Sihan Chen

引用次数

	总计	2019 年至今
引用	338	338
h 指数	7	7
i10 指数	6	6

160

120

202120222023202416 59 156 104

开放获取的出版物数量

查看全部

3 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Jing Liu 刘静Professor in Institute of Automation of the Chinese Academy Sciences (CASIA)在 nlpr.ia.ac.cn 的电子邮件经过验证
Xinxin Zhu 朱欣鑫Institute of Automation of the Chinese Academy Sciences (CASIA)在 nlpr.ia.ac.cn 的电子邮件经过验证
Longteng GuoAssociate Professor, Institute of Automation of the Chinese Academy Sciences (CASIA)在 nlpr.ia.ac.cn 的电子邮件经过验证
Xingjian HeInstitute of Automation of the Chinese Academy Sciences (CASIA)在 nlpr.ia.ac.cn 的电子邮件经过验证
Zijia ZhaoInstitute of Automation, Chinese Academy Sciences (CASIA)在 ia.ac.cn 的电子邮件经过验证
Handong LiInstitute of Automation, Chinese Academy of Sciences在 ia.ac.cn 的电子邮件经过验证
Xiaojie Jin, 靳潇杰Bytedance Research, USA在 bytedance.com 的电子邮件经过验证
Jiashi FengByteDance Inc.在 bytedance.com 的电子邮件经过验证
Zikang LiuInstitute of Automation, Chinese Academy of Sciences在 ia.ac.cn 的电子邮件经过验证
Weining WangInstitute of Automation, Chinese Academy of Sciences在 nlpr.ia.ac.cn 的电子邮件经过验证
Jiawei LiuByteDance在 bytedance.com 的电子邮件经过验证

关注

Sihan Chen

Institute of Automation, Chinese Academy of Sciences

在 nlpr.ia.ac.cn 的电子邮件经过验证

Vision-Language Pretraining Multimodal Understanding


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Cptr: Full transformer network for image captioning W Liu, S Chen, L Guo, X Zhu, J Liu arXiv preprint arXiv:2101.10804, 2021	162	2021
VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset S Chen, X He, L Guo, X Zhu, W Wang, J Tang, J Liu arXiv preprint arXiv:2304.08345, 2023	55	2023
Vast: A vision-audio-subtitle-text omni-modality foundation model and dataset S Chen, H Li, Q Wang, Z Zhao, M Sun, X Zhu, J Liu Advances in Neural Information Processing Systems 36, 2024	34	2024
ChatBridge: Bridging Modalities with Large Language Model as a Language Catalyst Z Zhao, L Guo, T Yue, S Chen, S Shao, X Zhu, Z Yuan, J Liu arXiv preprint arXiv:2305.16103, 2023	29	2023
Global-local propagation network for RGB-D semantic segmentation S Chen, X Zhu, W Liu, X He, J Liu arXiv preprint arXiv:2101.10801, 2021	19	2021
VLAB: Enhancing Video Language Pre-training by Feature Adapting and Blending X He, S Chen, F Ma, Z Huang, X Jin, Z Liu, D Fu, Y Yang, J Liu, J Feng arXiv preprint arXiv:2305.13167, 2023	18	2023
VL-Mamba: Exploring State Space Models for Multimodal Learning Y Qiao, Z Yu, L Guo, S Chen, Z Zhao, M Sun, Q Wu, J Liu arXiv preprint arXiv:2403.13600, 2024	8	2024
MM21 Pre-training for Video Understanding Challenge: Video Captioning with Pretraining Techniques S Chen, X Zhu, D Hao, W Liu, J Liu, Z Zhao, L Guo, J Liu Proceedings of the 29th ACM International Conference on Multimedia, 4853-4857, 2021	5	2021
COSA: Concatenated Sample Pretrained Vision-Language Foundation Model S Chen, X He, H Li, X Jin, J Feng, J Liu arXiv preprint arXiv:2306.09085, 2023	3	2023
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation J Liu, W Wang, S Chen, X Zhu, J Liu IEEE Transactions on Multimedia, 2023	3	2023
GLOBER: Coherent Non-autoregressive Video Generation via GLOBal Guided Video DecodER M Sun, W Wang, Z Qin, J Sun, S Chen, J Liu Advances in Neural Information Processing Systems 36, 2024	2	2024
EAVL: Explicitly Align Vision and Language for Referring Image Segmentation Y Yan, X He, W Wang, S Chen, J Liu arXiv preprint arXiv:2308.09779, 2023		2023
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner Z Liu, S Chen, L Guo, H Li, X He, J Liu arXiv preprint arXiv:2305.11769, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–13

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者