Top-down framework for weakly-supervised grounded image captioning C Cai, S Wang, KH Yap, Y Wang Knowledge-Based Systems 287, 111433, 2024 | 17* | 2024 |
Attribute Conditioned Fashion Image Captioning C Cai, KH Yap, S Wang 2022 IEEE International Conference on Image Processing (ICIP), 1921-1925, 2022 | 16 | 2022 |
Interactive Change-Aware Transformer Network for Remote Sensing Image Change Captioning C Cai, Y Wang, KH Yap Remote Sensing 15 (23), 5611, 2023 | 12 | 2023 |
Towards Attribute-Controlled Fashion Image Captioning C Cai, KH Yap, S Wang ACM Transactions on Multimedia Computing, Communications and Applications, 2024 | 4 | 2024 |
CM2-Net: Continual Cross-Modal Mapping Network for Driver Action Recognition R Wang, C Cai, W Wang, J Gao, D Lin, W Liu, KH Yap 2024 IEEE International Conference on Image Processing (ICIP), 2024 | 2 | 2024 |
Multi-scale Attentive Fusion Network for Remote Sensing Image Change Captioning C Cai, Y Wang, KH Yap 2024 IEEE International Symposium on Circuits and Systems (ISCAS), 1-5, 2024 | 2 | 2024 |
Temporal Sentence Grounding with Temporally Global Textual Knowledge C Cai, R Zhang, J Gao, K Wu, KH Yap, Y Wang 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024 | 1 | 2024 |
Hdplifter: Hierarchical Dynamics Perception For 2D-to-3D Human Pose Lifting Y Lu, J Gao, C Cai, R Wang, DT Phan, KH Yap 2024 IEEE International Conference on Image Processing (ICIP), 2055-2061, 2024 | | 2024 |
CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models J Gao, C Cai, R Wang, W Liu, KH Yap, K Garg, BS Han arXiv preprint arXiv:2410.15657, 2024 | | 2024 |
Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting C Cai, Z Wang, J Gao, W Liu, Y Lu, R Zhang, KH Yap 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024 | | 2024 |