Follow
Wenhai Wang (王文海)
Wenhai Wang (王文海)
CUHK | Shanghai AI Laboratory | NJU
Verified email at cuhk.edu.hk - Homepage
Title
Cited by
Cited by
Year
SegFormer: Simple and efficient design for semantic segmentation with transformers
E Xie, W Wang, Z Yu, A Anandkumar, JM Alvarez, P Luo
Advances in Neural Information Processing Systems (NeurIPS) 34, 12077-12090, 2021
49292021
Pyramid vision transformer: A versatile backbone for dense prediction without convolutions
W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao
IEEE/CVF International Conference on Computer Vision (ICCV), 568-578, 2021
43382021
Selective Kernel Networks
X Li, W Wang, X Hu, J Yang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
29132019
PVT v2: Improved Baselines with Pyramid Vision Transformer
W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao
Computational Visual Media Journal (CVMJ), 2022
15142022
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection
X Li, W Wang, L Wu, S Chen, X Hu, J Li, J Tang, J Yang
Advances in Neural Information Processing Systems (NeurIPS), 2020
12382020
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers
Z Li, W Wang, H Li, E Xie, C Sima, T Lu, Q Yu, J Dai
European Conference on Computer Vision (ECCV), 2022
11672022
Shape Robust Text Detection with Progressive Scale Expansion Network
W Wang, E Xie, X Li, W Hou, T Lu, G Yu, S Shao
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019
8332019
Polarmask: Single shot instance segmentation with polar representation
E Xie, P Sun, X Song, W Wang, X Liu, D Liang, C Shen, P Luo
IEEE/CVF conference on computer vision and pattern recognition (CVPR), 12193 …, 2020
7132020
Internimage: Exploring large-scale vision foundation models with deformable convolutions
W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023
7092023
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network
W Wang, E Xie, X Song, Y Zang, W Wang, T Lu, G Yu, C Shen
IEEE/CVF International Conference on Computer Vision (ICCV), 2019
6032019
Vision transformer adapter for dense predictions
Z Chen, Y Duan, W Wang, J He, T Lu, J Dai, Y Qiao
International Conference on Learning Representation (ICLR), 2023
5572023
Planning-oriented autonomous driving
Y Hu, J Yang, L Chen, K Li, C Sima, X Zhu, S Chai, S Du, T Lin, W Wang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
542*2023
Videochat: Chat-centric video understanding
KC Li, Y He, Y Wang, Y Li, W Wang, P Luo, Y Wang, L Wang, Y Qiao
arXiv preprint arXiv:2305.06355, 2023
4852023
Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers
B Dong, W Wang, DP Fan, J Li, H Fu, L Shao
CAAI Artificial Intelligence Research (CAAI AIR), 2023
4012023
Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks
Z Chen, J Wu, W Wang, W Su, G Chen, S Xing, Z Muyan, Q Zhang, X Zhu, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024
395*2024
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks
W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ...
Advances in Neural Information Processing Systems (NeurIPS), 2023
3922023
Detco: Unsupervised Contrastive Learning for Object Detection
E Xie, J Ding, W Wang, X Zhan, H Xu, Z Li, P Luo
IEEE/CVF International Conference on Computer Vision (ICCV), 2021
3742021
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection
X Li, W Wang, X Hu, J Li, J Tang, J Yang
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
2942021
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites
Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...
arXiv preprint arXiv:2404.16821, 2024
2282024
Scene Text Image Super-Resolution in the Wild
W Wang, E Xie, X Liu, W Wang, D Liang, C Shen, X Bai
European Conference on Computer Vision (ECCV), 650-666, 2020
185*2020
The system can't perform the operation now. Try again later.
Articles 1–20