Action Quality Assessment with Temporal Parsing Transformer Y Bai, D Zhou, S Zhang, J Wang, E Ding, Y Guan, Y Long, J Wang European Conference on Computer Vision (ECCV 2022), 2022 | 52 | 2022 |
Fatigue Assessment using ECG and Actigraphy Sensors Y Bai, Y Guan, WF Ng International Symposium on Wearable Computers (Ubicomp-ISWC 2020), 12-16, 2020 | 36 | 2020 |
Discriminative Latent Semantic Graph for Video Captioning Y Bai, J Wang, Y Long, B Hu, Y Song, M Pagnucco, Y Guan ACM International Conference on Multimedia (ACM MM 2021), 2021 | 35 | 2021 |
Query Twice: Dual Mixture Attention Meta Learning for Video Summarization J Wang, Y Bai, Y Long, B Hu, Z Chai, Y Guan, X Wei ACM International Conference on Multimedia (ACM MM 2020), 2020 | 21 | 2020 |
Sentence-level Prompts Benefit Composed Image Retrieval Y Bai, X Xu, Y Liu, S Khan, F Khan, W Zuo, RSM Goh, CM Feng International Conference on Learning Representations (ICLR 2024 Spotlight), 2024 | 20* | 2024 |
Ds-depth: Dynamic and static depth estimation via a fusion cost volume X Miao, Y Bai, H Duan, Y Huang, F Wan, X Xu, Y Long, Y Zheng IEEE Transactions on Circuits and Systems for Video Technology, 2023 | 13 | 2023 |
The effects of noninvasive vagus nerve stimulation on fatigue in participants with primary sjögren’s syndrome J Tarn, E Evans, E Traianos, A Collins, M Stylianou, J Parikh, Y Bai, ... Neuromodulation: Technology at the Neural Interface 26 (3), 681-689, 2023 | 13 | 2023 |
Towards Automated Fatigue Assessment using Wearable Sensing and Mixed-Effects Models Y Bai, Y Guan, JQ Shi, WF Ng International Symposium on Wearable Computers (Ubicomp-ISWC 2021), 2021 | 8 | 2021 |
Ctnerf: Cross-time transformer for dynamic neural radiance field from monocular video X Miao, Y Bai, H Duan, F Wan, Y Huang, Y Long, Y Zheng Pattern Recognition 156, 110729, 2024 | 5 | 2024 |
Temporal segment transformer for action segmentation Z Liu, L Wang, D Zhou, J Wang, S Zhang, Y Bai, E Ding, R Fan arXiv preprint arXiv:2302.13074, 2023 | 5 | 2023 |
MedRG: Medical Report Grounding with Multi-modal Large Language Model K Zou, Y Bai, Z Chen, Y Zhou, Y Chen, K Ren, M Wang, X Yuan, X Shen, ... arXiv preprint arXiv:2404.06798, 2024 | 4 | 2024 |
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering CM Feng, Y Bai*, T Luo, Z Li, S Khan, W Zuo, X Xu, RSM Goh, Y Liu Proceedings of the 39th AAAI Conference on Artificial Intelligence (AAAI), 2025 | 3 | 2025 |
ConRF: Zero-shot Stylization of 3D Scenes with Conditioned Radiation Fields X Miao, Y Bai, H Duan, F Wan, Y Huang, Y Long, Y Zheng arXiv preprint arXiv:2402.01950, 2024 | 2 | 2024 |
Towards Few-shot Image captioning with Cycle-based Compositional Semantic Enhancement Framework P Zhang, Y Bai, J Su, Y Huang, Y Long IEEE International Joint Conference on Neural Networks (IJCNN 2023), 2023 | 1 | 2023 |
Enhancing Community Vision Screening: AI-Driven Retinal Photography for Early Disease Detection and Patient Trust X Lei, YC Tham, JHL Goh, Y Feng, Y Bai, ZD Soh, RSM Goh, X Xu, Y Liu, ... International Workshop on Ophthalmic Medical Image Analysis, 146-156, 2024 | | 2024 |
From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning Y Bai, Y Zhou, J Zhou, RSM Goh, DSW Ting, Y Liu arXiv preprint arXiv:2410.06456, 2024 | | 2024 |
UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling K Yu, Y Zhou, Y Bai, Z Da Soh, X Xu, RSM Goh, CY Cheng, Y Liu MICCAI 2024, 2024 | | 2024 |