Hop: History-and-order aware pre-training for vision-and-language navigation Y Qiao, Y Qi, Y Hong, Z Yu, P Wang, Q Wu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 56 | 2022 |
Hop+: History-enhanced and order-aware pre-training for vision-and-language navigation Y Qiao, Y Qi, Y Hong, Z Yu, P Wang, Q Wu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 24 | 2023 |
March in chat: Interactive prompting for remote embodied referring expression Y Qiao, Y Qi, Z Yu, J Liu, Q Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 9 | 2023 |
Rankvqa: Answer re-ranking for visual question answering Y Qiao, Z Yu, J Liu 2020 IEEE international conference on multimedia and expo (ICME), 1-6, 2020 | 9 | 2020 |
VC-VQA: visual calibration mechanism for visual question answering Y Qiao, Z Yu, J Liu 2020 IEEE International Conference on Image Processing (ICIP), 1481-1485, 2020 | 7 | 2020 |
Vl-mamba: Exploring state space models for multimodal learning Y Qiao, Z Yu, L Guo, S Chen, Z Zhao, M Sun, Q Wu, J Liu arXiv preprint arXiv:2403.13600, 2024 | 6 | 2024 |
VLN-PETL: Parameter-Efficient Transfer Learning for Vision-and-Language Navigation Y Qiao, Z Yu, Q Wu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 3 | 2023 |
Multi-modal Adapter for Medical Vision-and-Language Learning Z Yu, Y Qiao, Y Xie, Q Wu International Workshop on Machine Learning in Medical Imaging, 393-402, 2023 | 1 | 2023 |
PLMVQA: Applying Pseudo Labels for Medical Visual Question Answering with Limited Data Z Yu, Y Xie, Y Xia, Q Wu International Conference on Medical Image Computing and Computer-Assisted …, 2023 | | 2023 |