Traj-mae: Masked autoencoders for trajectory prediction H Chen, J Wang, K Shao, F Liu, J Hao, C Guan, G Chen, PA Heng ICCV 2023, 8351-8362, 2023 | 60 | 2023 |
SciVerse: Unveiling the Knowledge Comprehension and Visual Reasoning of LMMs on Multi-modal Scientific Problems Z Guo*, R Zhang*, H Chen*, J Gao*, D Jiang, J Wang, PA Heng ACL 2025, 2025 | 10* | 2025 |
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model J Liu*, H Chen*, P An, Z Liu, R Zhang, C Gu, X Li, Z Guo, S Chen, M Liu, ... arXiv preprint arXiv:2503.10631, 2025 | 9 | 2025 |
SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning H Chen, J Wang, Z Guo, J Li, D Zhou, B Wu, C Guan, G Chen, PA Heng BMVC 2024, 2024 | 7 | 2024 |
MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models D Zhou, J Huang, J Bai, J Wang, H Chen, G Chen, X Hu, PA Heng IJCAI 2025, 2024 | 5 | 2024 |
SFANet: Spatial-Frequency Attention Network for Weather Forecasting J Wang, H Chen, H Xu, J Li, B Wang, K Shao, F Liu, H Chen, G Chen, ... arXiv preprint arXiv:2405.18849, 2024 | | 2024 |