Seth Dong Huk Park
Seth Dong Huk Park
Verified email at
Cited by
Cited by
Multimodal compact bilinear pooling for visual question answering and visual grounding
A Fukui, DH Park, D Yang, A Rohrbach, T Darrell, M Rohrbach
arXiv preprint arXiv:1606.01847, 2016
Multimodal explanations: Justifying decisions and pointing to the evidence
DH Park, L Anne Hendricks, Z Akata, A Rohrbach, B Schiele, T Darrell, ...
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
Multimodal video description
V Ramanishka, A Das, DH Park, S Venugopalan, LA Hendricks, ...
Proceedings of the 24th ACM international conference on Multimedia, 1092-1096, 2016
Robust Change Captioning
DH Park, T Darrell, A Rohrbach
arXiv preprint arXiv:1901.02527, 2019
Learning a unified embedding for visual search at pinterest
A Zhai, HY Wu, E Tzeng, DH Park, C Rosenberg
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019
Toward transformer-based object detection
J Beal, E Kim, E Tzeng, DH Park, A Zhai, D Kislyuk
arXiv preprint arXiv:2012.09958, 2020
Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations
J Beal, HY Wu, DH Park, A Zhai, D Kislyuk
arXiv preprint arXiv:2108.05887, 2021
Benchmark for Compositional Text-to-Image Synthesis
DH Park, S Azadi, X Liu, T Darrell, A Rohrbach
Novelty Detection with Rotated Contrastive Predictive Coding
DH Park, T Darrell
Discovering Non-monotonic Autoregressive Orderings with Variational Inference
X Li, B Trabucco, DH Park, M Luo, S Shen, T Darrell, Y Gao
International Conference on Learning Representations, 2020
Multimodal Explanations: Justifying Decisions and Pointing to the Evidence (Supplementary Material)
DH Park, LA Hendricks, Z Akata, A Rohrbach, B Schiele, T Darrell, ...
The system can't perform the operation now. Try again later.
Articles 1–11