Follow
Wenmeng Yu
Title
Cited by
Cited by
Year
Learning modality-specific representations with self-supervised multi-task learning for multimodal sentiment analysis
W Yu, H Xu, Z Yuan, J Wu
Proceedings of the AAAI conference on artificial intelligence 35 (12), 10790 …, 2021
4732021
Cogvlm: Visual expert for pretrained language models
W Wang, Q Lv, W Yu, W Hong, J Qi, Y Wang, J Ji, Z Yang, L Zhao, X Song, ...
arXiv preprint arXiv:2311.03079, 2023
4592023
Ch-sims: A chinese multimodal sentiment analysis dataset with fine-grained annotation of modality
W Yu, H Xu, F Meng, Y Zhu, Y Ma, J Wu, J Zou, K Yang
Proceedings of the 58th annual meeting of the association for computational …, 2020
3092020
Cogagent: A visual language model for gui agents
W Hong, W Wang, Q Lv, J Xu, W Yu, J Ji, Y Wang, Z Wang, Y Dong, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
1912024
Transformer-based feature reconstruction network for robust multimodal sentiment analysis
Z Yuan, W Li, H Xu, W Yu
Proceedings of the 29th ACM International Conference on Multimedia, 4400-4407, 2021
952021
Co-attentive multi-task convolutional neural network for facial expression recognition
W Yu, H Xu
Pattern Recognition 123, 108401, 2022
722022
M-SENA: An integrated platform for multimodal sentiment analysis
H Mao, Z Yuan, H Xu, W Yu, Y Liu, K Gao
arXiv preprint arXiv:2203.12441, 2022
522022
Cogvlm2: Visual language models for image and video understanding
W Hong, W Wang, M Ding, W Yu, Q Lv, Y Wang, Y Cheng, S Huang, J Ji, ...
arXiv preprint arXiv:2408.16500, 2024
282024
Cogagent: A visual language model for gui agents, 2023
W Hong, W Wang, Q Lv, J Xu, W Yu, J Ji, Y Wang, Z Wang, Y Zhang, J Li, ...
URL https://arxiv. org/abs/2312.08914, 0
7
CogVLM: Visual expert for large language models
W Wang, Q Lv, W Yu, W Hong, J Qi, Y Wang, J Ji, Z Yang, L Zhao, ...
52023
AlignMMBench: Evaluating Chinese Multimodal Alignment in Large Vision-Language Models
Y Wu, W Yu, Y Cheng, Y Wang, X Zhang, J Xu, M Ding, Y Dong
arXiv preprint arXiv:2406.09295, 2024
22024
MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model
Z Yang, J Chen, Z Du, W Yu, W Wang, W Hong, Z Jiang, B Xu, Y Dong, ...
arXiv preprint arXiv:2409.13729, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–12