Kevin J. Shih
Kevin J. Shih
Research Scientist, NVIDIA
Verified email at nvidia.com
Title
Cited by
Cited by
Year
Image inpainting for irregular holes using partial convolutions
G Liu, FA Reda, KJ Shih, TC Wang, A Tao, B Catanzaro
Proceedings of the European Conference on Computer Vision (ECCV), 85-100, 2018
9352018
Where to look: Focus regions for visual question answering
KJ Shih, S Singh, D Hoiem
Computer Vision and Pattern Recognition 2016, 2015
4142015
Improving semantic segmentation via video propagation and label relaxation
Y Zhu, K Sapra, FA Reda, KJ Shih, S Newsam, A Tao, B Catanzaro
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1882019
Graphical contrastive losses for scene graph parsing
J Zhang, KJ Shih, A Elgammal, A Tao, B Catanzaro
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
98*2019
Learning collections of part models for object recognition
I Endres, KJ Shih, J Jiaa, D Hoiem
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2013
872013
Sdc-net: Video prediction using spatially-displaced convolution
FA Reda, G Liu, KJ Shih, R Kirby, J Barker, D Tarjan, A Tao, B Catanzaro
Proceedings of the European Conference on Computer Vision (ECCV), 718-733, 2018
802018
Partial convolution based padding
G Liu, KJ Shih, TC Wang, FA Reda, K Sapra, Z Yu, A Tao, B Catanzaro
arXiv preprint arXiv:1811.11718, 2018
422018
Part localization using multi-proposal consensus for fine-grained categorization
KJ Shih, A Mallya, S Singh, D Hoiem
BMVC 2015, 2015
422015
Learning Interpretable Spatial Operations in a Rich 3D Blocks World
Y Bisk, KJ Shih, Y Choi, D Marcu
Proceedings of the Thirty-Second Conference on Artificial Intelligence (AAAI-18), 2018
392018
Flowtron: an autoregressive flow-based generative network for text-to-speech synthesis
R Valle, K Shih, R Prenger, B Catanzaro
arXiv preprint arXiv:2005.05957, 2020
342020
Unsupervised video interpolation using cycle consistency
FA Reda, D Sun, A Dundar, M Shoeybi, G Liu, KJ Shih, A Tao, J Kautz, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision, 892-900, 2019
262019
Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks
T Gupta, K Shih, S Singh, D Hoiem
arXiv preprint arXiv:1704.00260, 2017
132017
An interpretable model for scene graph generation
J Zhang, K Shih, A Tao, B Catanzaro, A Elgammal
arXiv preprint arXiv:1811.09543, 2018
122018
Unsupervised disentanglement of pose, appearance and background from images and videos
A Dundar, KJ Shih, A Garg, R Pottorf, A Tao, B Catanzaro
arXiv preprint arXiv:2001.09518, 2020
112020
Recognition of items depicted in images
K Shih, W Di, V Jagadeesh, R Piramuthu
US Patent App. 14/973,582, 2016
112016
Video prediction using spatially displaced convolution
G Liu, K Shih, R Kirby, J Barker, D Tarjan, A Tao, B Catanzaro
US Patent App. 16/360,853, 2019
82019
Learning discriminative collections of part detectors for object recognition
KJ Shih, I Endres, D Hoiem
IEEE transactions on pattern analysis and machine intelligence 37 (8), 1571-1584, 2014
82014
Revisiting image-language networks for open-ended phrase detection
BA Plummer, K Shih, Y Li, K Xu, S Lazebnik, S Sclaroff, K Saenko
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020
52020
Introduction to the 1st place winning model of openimages relationship detection challenge
J Zhang, K Shih, A Tao, B Catanzaro, A Elgammal
arXiv preprint arXiv:1811.00662, 2018
52018
One TTS alignment to rule them all
R Badlani, A Łancucki, KJ Shih, R Valle, W Ping, B Catanzaro
arXiv preprint arXiv:2108.10447, 2021
32021
The system can't perform the operation now. Try again later.
Articles 1–20