Seth Dong Huk Park
Seth Dong Huk Park
Verified email at
Cited by
Cited by
Multimodal compact bilinear pooling for visual question answering and visual grounding
A Fukui, DH Park, D Yang, A Rohrbach, T Darrell, M Rohrbach
arXiv preprint arXiv:1606.01847, 2016
Multimodal explanations: Justifying decisions and pointing to the evidence
DH Park, L Anne Hendricks, Z Akata, A Rohrbach, B Schiele, T Darrell, ...
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
Toward transformer-based object detection
J Beal, E Kim, E Tzeng, DH Park, A Zhai, D Kislyuk
arXiv preprint arXiv:2012.09958, 2020
More control for free! image synthesis with semantic diffusion guidance
X Liu, DH Park, S Azadi, G Zhang, A Chopikyan, Y Hu, H Shi, A Rohrbach, ...
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2023
Multimodal video description
V Ramanishka, A Das, DH Park, S Venugopalan, LA Hendricks, ...
Proceedings of the 24th ACM international conference on Multimedia, 1092-1096, 2016
Robust Change Captioning
DH Park, T Darrell, A Rohrbach
arXiv preprint arXiv:1901.02527, 2019
Benchmark for compositional text-to-image synthesis
DH Park, S Azadi, X Liu, T Darrell, A Rohrbach
Thirty-fifth Conference on Neural Information Processing Systems Datasets …, 2021
Learning a unified embedding for visual search at pinterest
A Zhai, HY Wu, E Tzeng, DH Park, C Rosenberg
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019
Diffusion hyperfeatures: Searching through time and space for semantic correspondence
G Luo, L Dunlap, DH Park, A Holynski, T Darrell
Advances in Neural Information Processing Systems 36, 2024
Billion-scale pretraining with vision transformers for multi-task visual representations
J Beal, HY Wu, DH Park, A Zhai, D Kislyuk
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2022
Shape-guided diffusion with inside-outside attention
DH Park, G Luo, C Toste, S Azadi, X Liu, M Karalashvili, A Rohrbach, ...
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
Discovering non-monotonic autoregressive orderings with variational inference
X Li, B Trabucco, DH Park, M Luo, S Shen, T Darrell, Y Gao
arXiv preprint arXiv:2110.15797, 2021
Vision and Language Understanding Through Generative Modeling
DHS Park
University of California, Berkeley, 2023
Skin tone determination and filtering
A Burdin, A Guo, CJ Rosenberg, CX Zhang, DDJ Xue, DO Kislyuk, ...
US Patent App. 17/564,004, 2022
The system can't perform the operation now. Try again later.
Articles 1–14