Follow
Xuri Ge 葛旭日
Title
Cited by
Cited by
Year
Structured multi-modal feature embedding and alignment for image-sentence retrieval
X Ge, F Chen, JM Jose, Z Ji, Z Wu, X Liu
Proceedings of the 29th ACM international conference on multimedia, 5185-5193, 2021
602021
Cross-modal semantic enhanced interaction for image-sentence retrieval
X Ge, F Chen, S Xu, F Tao, JM Jose
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2023
392023
Variational structured semantic inference for diverse image captioning
F Chen, R Ji, J Ji, X Sun, B Zhang, X Ge, Y Wu, F Huang, Y Wang
Advances in Neural Information Processing Systems (NeurIPS), 2019
342019
IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT
J Fu, X Ge*, X Xin, A Karatzoglou, I Arapakis, J Wang, JM Jose
Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024
182024
Local global relational network for facial action units recognition
X Ge, P Wan, H Han, JM Jose, Z Ji, Z Wu, X Liu
2021 16th IEEE International Conference on Automatic Face and Gesture …, 2021
182021
ALGRNet: Multi-Relational Adaptive Facial Action Unit Modelling for Face Representation and Relevant Recognitions
X Ge, JM Jose, P Wang, A Iyer, X Liu, H Han
IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM), 2023
17*2023
3SHNet: Boosting image–sentence retrieval via visual semantic–spatial self-highlighting
X Ge, S Xu, F Chen, J Wang, G Wang, S An, JM Jose
Information Processing & Management 61 (4), 103716, 2024
122024
Multi-Local Attention for Speech-based Depression Detection
F Tao, X Ge*, W Ma, A Esposito, A Vinciarelli
2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023
102023
MGRR-Net: Multi-level graph relational reasoning network for facial action units detection
X Ge, JM Jose, S Xu, X Liu, H Han
ACM Transactions on Intelligent Systems and Technology (TIST), 2024
82024
Colloquial image captioning
X Ge, F Chen, C Shen, R Ji
2019 IEEE International Conference on Multimedia and Expo (ICME), 356-361, 2019
82019
CFIR: Fast and Effective Long-Text To Image Retrieval for Large Corpora
Z Long, X Ge*, R McCreadie, JM Jose
Proceedings of the 47th International ACM SIGIR Conference on Research and …, 2024
72024
Efficient and Effective Adaptation of Multimodal Foundation Models in Sequential Recommendation
J Fu, X Ge*, X Xin, A Karatzoglou, I Arapakis, K Zheng, Y Ni, JM Jose
arXiv preprint arXiv:2411.02992, 2024
32024
Sparks of Surprise: Multi-objective Recommendations with Hierarchical Decision Transformers for Diversity, Novelty, and Serendipity
J Wang, A Karatzoglou, I Arapakis, X Xin, X Ge, JM Joemon
33rd ACM International Conference on Information and Knowledge Management …, 2024
32024
The relationship between speech features changes when you get depressed: Feature correlations for improving speed and performance of depression detection
F Tao, W Ma, X Ge, A Esposito, A Vinciarelli
arXiv preprint arXiv:2307.02892, 2023
32023
Continuous Interaction with a Smart Speaker via Low-dimensional Embeddings of Dynamic Hand Pose
S Xu, C Kaul, X Ge, R Murray-Smith
2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023
32023
ZhongqinWu, and Xiao Liu. 2021. Structured multi-modal feature embedding and alignment for image-sentence retrieval
X Ge, F Chen, ZJ JoemonMJose
Proceedings of the 29th ACM International Conference on Multimedia, 0
3
Hire: Hybrid-modal interaction with multiple relational enhancements for image-text matching
X Ge, F Chen, S Xu, F Tao, J Wang, JM Jose
ACM Transactions on Intelligent Systems and Technology, 2025
22025
Detail-Enhanced Intra-and Inter-modal Interaction for Audio-Visual Emotion Recognition
T Shi, X Ge, JM Jose, N Pugeault, P Henderson
International Conference on Pattern Recognition, 451-465, 2025
22025
Towards End-to-End Explainable Facial Action Unit Recognition via Vision-Language Joint Learning
X Ge, J Fu, F Chen, S An, N Sebe, JM Jose
Proceedings of the 32nd ACM International Conference on Multimedia, 8189-8198, 2024
12024
Factored Attention and Embedding for Unstructured-view Topic-related Ultrasound Report Generation
F Chen, F Chen, X Ma, X Ge*
Journal of Electrical Systems (JES), 2024
12024
The system can't perform the operation now. Try again later.
Articles 1–20