Follow
Zhe Chen
Zhe Chen
PhD candidate, Nanjing University
Verified email at smail.nju.edu.cn - Homepage
Title
Cited by
Cited by
Year
Vision Transformer Adapter for Dense Predictions
Z Chen, Y Duan, W Wang, J He, T Lu, J Dai, Y Qiao
International Conference on Learning Representation (ICLR), 2022
2882022
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions
W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 14408 …, 2023
2862023
VisionLLM: Large Language Model is Also an Open-Ended Decoder for Vision-Centric Tasks
W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
1002023
InternGPT: Solving vision-centric tasks by interacting with chatbots beyond language
Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Y Yang, Q Li, ...
arXiv preprint arXiv:2305.05662, 2023
402023
DDP: Diffusion Model for Dense Visual Prediction
Y Ji, Z Chen, E Xie, L Hong, X Liu, Z Liu, T Lu, Z Li, P Luo
IEEE/CVF International Conference on Computer Vision (ICCV), 2023
302023
InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
Technical Report of Ego4D Challenge 2022 @ ECCV, 2022
232022
Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization
Z Chen, W Wang, E Xie, T Lu, P Luo
Proceedings of the AAAI Conference on Artificial Intelligence 36 (1), 393-400, 2022
152022
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
W Wang, M Shi, Q Li, W Wang, Z Huang, L Xing, Z Chen, H Li, X Zhu, ...
International Conference on Learning Representation (ICLR), 2023
142023
FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
Z Chen, J Wang, W Wang, G Chen, E Xie, P Luo, T Lu
arXiv preprint arXiv:2111.02394, 2021
122021
Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt
K Chen, E Xie, Z Chen, L Hong, Z Li, DY Yeung
International Conference on Learning Representation (ICLR), 2023
102023
AVSegFormer: Audio-Visual Segmentation with Transformer
S Gao, Z Chen, G Chen, W Wang, T Lu
AAAI Conference on Artificial Intelligence (AAAI), 2023
52023
SiameseCCR: A Novel Method for One‐Shot and Few‐Shot Chinese CAPTCHA Recognition using Deep Siamese Network
Z Chen, W Ma, N Xu, C Ji, Y Zhang
IET Image Processing 14 (12), 2855-2859, 2020
52020
Block Shuffle: A Method for High-Resolution Fast Style Transfer with Limited Memory
W Ma, Z Chen, C Ji
IEEE Access 8, 158056-158066, 2020
42020
Graph Propagation Transformer for Graph Representation Learning
Z Chen, H Tan, T Wang, T Shen, T Lu, Q Peng, C Cheng, Y Qi
The 32nd International Joint Conference on Artificial Intellgence (IJCAI), 2023
32023
InternVL: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ...
arXiv preprint arXiv:2312.14238, 2023
22023
Champion Solution for the WSDM2023 Toloka VQA Challenge
S Gao, Z Chen, G Chen, W Wang, T Lu
Technical Report of WSDM Cup 2023 @ WSDM, 2023
12023
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer
C Tian, X Zhu, Y Xiong, W Wang, Z Chen, W Wang, Y Chen, L Lu, T Lu, ...
arXiv preprint arXiv:2401.10208, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–17