Pano-avqa: Grounded audio-visual question answering on 360deg videos H Yun, Y Yu, W Yang, K Lee, G Kim Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 56 | 2021 |
Transitional adaptation of pretrained models for visual storytelling Y Yu, J Chung, H Yun, J Kim, G Kim Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 28 | 2021 |
Multimodal knowledge alignment with reinforcement learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, P Ammanabrolu, R Zellers, ... arXiv preprint arXiv:2205.12630, 2022 | 27 | 2022 |
Panoramic Vision Transformer for Saliency Detection in 360 Videos H Yun, S Lee, G Kim European Conference on Computer Vision, 422-439, 2022 | 15 | 2022 |
Fusing Pre-Trained Language Models With Multimodal Prompts Through Reinforcement Learning Y Yu, J Chung, H Yun, J Hessel, JS Park, X Lu, R Zellers, P Ammanabrolu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 6 | 2023 |
Character grounding and re-identification in story of videos and text descriptions Y Yu, J Kim, H Yun, J Chung, G Kim Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 6 | 2020 |
Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation H Yun, J Na, G Kim Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 2 | 2023 |
A Mobile Robot Generating Video Summaries of Seniors' Indoor Activities CY Yang, H Yun, S Varadaraj, JY Hsu Proceedings of the 21st International Conference on Human-Computer …, 2019 | | 2019 |