Follow
Yunxin Li
Title
Cited by
Cited by
Year
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering
Y Li, L Wang, B Hu, X Chen, W Zhong, C Lyu, M Zhang
arXiv preprint arXiv:2311.07536, 2023
142023
LMEye: An Interactive Perception Network for Large Language Models
Y Li, B Hu, X Chen, L Ma, Y Xu, M Zhang
arXiv preprint arXiv:2305.03701, 2023
142023
Fast and Robust Online Handwritten Chinese Character Recognition with Deep Spatial & Contextual Information Fusion Network
Y Li, Q Yang, Q Chen, B Hu, X Wang, Y Ding, L Ma
IEEE Transactions on Multimedia, 2022
132022
Chunk-aware Alignment and Lexical Constraint for Visual Entailment with Natural Language Explanations
Q Yang*, Y Li*, B Hu, L Ma, Y Ding, M Zhang
ACM MM 2022, 2022
82022
A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues
Y Li, B Hu, X Chen, Y Ding, L Ma, M Zhang
ACL 2023 Main Conference, 2023
72023
Medical Dialogue Response Generation with Pivotal Information Recalling
Y Zhao*, Y Li*, Y Wu, B Hu, Q Chen, X Wang, Y Ding, M Zhang
KDD 2022, 2022
72022
Training Multimedia Event Extraction With Generated Images and Captions
Z Du, Y Li, X Guo, Y Sun, B Li
ACM MM 2023, 2023
52023
GlyphCRM: Bidirectional Encoder Representation for Chinese Character with its Glyph
Y Li, Y Zhao, B Hu, Q Chen, Y Xiang, X Wang, Y Ding, L Ma
arXiv preprint arXiv:2107.00395, 2021
42021
Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs
Y Li, B Hu, W Wang, X Cao, M Zhang
arXiv preprint arXiv:2311.15759, 2023
32023
A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Y Li, B Hu, Y Ding, L Ma, M Zhang
ACL 2023 Main Conference, 2023
32023
LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs
Y Li, X Chen, B Hu, M Zhang
arXiv preprint arXiv:2402.13546, 2024
12024
Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment
Y Li, X Chen, B Hu, H Shi, M Zhang
arXiv preprint arXiv:2402.13561, 2024
2024
A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation
Y Li, B Hu, W Luo, L Ma, Y Ding, M Zhang
LREC-COLING 2024, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–13