Seth Dong Huk Park

Cited by

	All	Since 2019
Citations	3109	2613
h-index	10	10
i10-index	10	10

660

330

165

495

20162017201820192020202120222023202426 156 280 362 414 440 530 648 218

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Trevor DarrellProfessor of Computer Science, U.C. BerkeleyVerified email at eecs.berkeley.edu
Anna RohrbachProfessor, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Marcus RohrbachProfessor for Multimodal Reliable AI, TU Darmstadt, GermanyVerified email at tu-darmstadt.de
Lisa Anne M HendricksDeepMindVerified email at google.com
Zeynep AkataProfessor at TUM and Director at Helmholtz MunichVerified email at helmholtz-munich.de
Bernt SchieleProfessor, Max Planck Institute for Informatics, Saarland Informatics Campus, Saarland UniversityVerified email at mpi-inf.mpg.de
Vasili RamanishkaSamsung Research AmericaVerified email at bu.edu
Abir DasAssistant Professor at IIT KharagpurVerified email at cse.iitkgp.ac.in
Kate SaenkoBoston UniversityVerified email at bu.edu

Seth Dong Huk Park

UC Berkeley

Verified email at eecs.berkeley.edu

Artificial Intelligence Deep Learning Vision & Language


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multimodal compact bilinear pooling for visual question answering and visual grounding A Fukui, DH Park, D Yang, A Rohrbach, T Darrell, M Rohrbach arXiv preprint arXiv:1606.01847, 2016	1741	2016
Multimodal explanations: Justifying decisions and pointing to the evidence DH Park, L Anne Hendricks, Z Akata, A Rohrbach, B Schiele, T Darrell, ... Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018	535*	2018
Toward transformer-based object detection J Beal, E Kim, E Tzeng, DH Park, A Zhai, D Kislyuk arXiv preprint arXiv:2012.09958, 2020	204	2020
More control for free! image synthesis with semantic diffusion guidance X Liu, DH Park, S Azadi, G Zhang, A Chopikyan, Y Hu, H Shi, A Rohrbach, ... Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023	164*	2023
Multimodal video description V Ramanishka, A Das, DH Park, S Venugopalan, LA Hendricks, ... Proceedings of the 24th ACM international conference on Multimedia, 1092-1096, 2016	150	2016
Robust Change Captioning DH Park, T Darrell, A Rohrbach arXiv preprint arXiv:1901.02527, 2019	127	2019
Benchmark for compositional text-to-image synthesis DH Park, S Azadi, X Liu, T Darrell, A Rohrbach Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021	79	2021
Learning a unified embedding for visual search at pinterest A Zhai, HY Wu, E Tzeng, DH Park, C Rosenberg Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019	50	2019
Diffusion hyperfeatures: Searching through time and space for semantic correspondence G Luo, L Dunlap, DH Park, A Holynski, T Darrell Advances in Neural Information Processing Systems 36, 2024	26	2024
Billion-scale pretraining with vision transformers for multi-task visual representations J Beal, HY Wu, DH Park, A Zhai, D Kislyuk Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022	25	2022
Shape-guided diffusion with inside-outside attention DH Park, G Luo, C Toste, S Azadi, X Liu, M Karalashvili, A Rohrbach, ... Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024	4	2024
Discovering non-monotonic autoregressive orderings with variational inference X Li, B Trabucco, DH Park, M Luo, S Shen, T Darrell, Y Gao arXiv preprint arXiv:2110.15797, 2021	4	2021
Vision and Language Understanding Through Generative Modeling DHS Park University of California, Berkeley, 2023		2023
Skin tone determination and filtering A Burdin, A Guo, CJ Rosenberg, CX Zhang, DDJ Xue, DO Kislyuk, ... US Patent App. 17/564,004, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–14

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors