Follow
Guo Chen
Guo Chen
Verified email at smail.nju.edu.cn
Title
Cited by
Cited by
Year
Internvideo: General video foundation models via generative and discriminative learning
Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ...
arXiv preprint arXiv:2212.03191, 2022
1582022
Dcan: improving temporal action detection via dual context aggregation
G Chen, YD Zheng, L Wang, T Lu
Proceedings of the AAAI conference on artificial intelligence 36 (1), 248-257, 2022
492022
Internvid: A large-scale video-text dataset for multimodal understanding and generation
Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ...
ICLR2023, 2023
482023
Videollm: Modeling video sequence with large language models
G Chen, YD Zheng, J Wang, J Xu, Y Huang, J Pan, Y Wang, Y Wang, ...
arXiv preprint arXiv:2305.13292, 2023
372023
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
292022
Basictad: an astounding rgb-only baseline for temporal action detection
M Yang, G Chen, YD Zheng, T Lu, L Wang
Computer Vision and Image Understanding 232, 103692, 2023
242023
FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
Z Chen, J Wang, W Wang, G Chen, E Xie, P Luo, T Lu
arXiv preprint arXiv:2111.02394, 2021
122021
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ...
arXiv preprint arXiv:2312.14238, 2023
112023
Mvbench: A comprehensive multi-modal video understanding benchmark
K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ...
arXiv preprint arXiv:2311.17005, 2023
112023
Avsegformer: Audio-visual segmentation with transformer
S Gao, Z Chen, G Chen, W Wang, T Lu
Proceedings of the AAAI Conference on Artificial Intelligence 38 (11), 12155 …, 2024
92024
Video mamba suite: State space model as a versatile alternative for video understanding
G Chen, Y Huang, J Xu, B Pei, Z Chen, Z Li, J Wang, K Li, T Lu, L Wang
arXiv preprint arXiv:2403.09626, 2024
52024
Memory-and-Anticipation Transformer for Online Action Understanding
J Wang, G Chen, Y Huang, L Wang, T Lu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
52023
MRSN: Multi-Relation Support Network for Video Action Detection
YD Zheng, G Chen, M Yuan, T Lu
2023 IEEE International Conference on Multimedia and Expo (ICME), 1026-1031, 2023
32023
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ...
arXiv preprint arXiv:2403.15377, 2024
22024
Retrieval-augmented egocentric video captioning
J Xu, Y Huang, J Hou, G Chen, Y Zhang, R Feng, W Xie
arXiv preprint arXiv:2401.00789, 2024
22024
Champion Solution for the WSDM2023 Toloka VQA Challenge
S Gao, Z Chen, G Chen, W Wang, T Lu
arXiv preprint arXiv:2301.09045, 2023
12023
EgoExoLearn: A Dataset for Bridging Asynchronous Ego-and Exo-centric View of Procedural Activities in Real World
Y Huang, G Chen, J Xu, M Zhang, L Yang, B Pei, H Zhang, L Dong, ...
arXiv preprint arXiv:2403.16182, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–17