关注
Sitong Wu
标题
引用次数
引用次数
年份
Pale transformer: A general vision transformer backbone with pale-shaped attention
S Wu, T Wu, H Tan, G Guo
Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 2731-2739, 2022
812022
Fully transformer networks for semantic image segmentation
S Wu, T Wu, F Lin, S Tian, G Guo
arXiv preprint arXiv:2106.04108, 2021
562021
Structtoken: Rethinking semantic segmentation with structural prior
F Lin, Z Liang, S Wu, J He, K Chen, S Tian
IEEE Transactions on circuits and systems for video technology 33 (10), 5655 …, 2023
492023
Semantic diffusion network for semantic segmentation
H Tan, S Wu, J Pi
Advances in Neural Information Processing Systems 35, 8702-8716, 2022
412022
Data pruning via moving-one-sample-out
H Tan, S Wu, F Du, Y Chen, Z Wang, F Wang, X Qi
Advances in neural information processing systems 36, 18251-18262, 2023
382023
Regionblip: A unified multi-modal pre-training framework for holistic and regional comprehension
Q Zhou, C Yu, S Zhang, S Wu, Z Wang, F Wang
arXiv preprint arXiv:2308.02299, 2023
242023
Uninext: Exploring a unified architecture for vision recognition
F Lin, J Yuan, S Wu, F Wang, Z Wang
Proceedings of the 31st ACM International Conference on Multimedia, 3200-3208, 2023
222023
Catrans: Context and affinity transformer for few-shot segmentation
S Zhang, T Wu, S Wu, G Guo
arXiv preprint arXiv:2204.12817, 2022
222022
Demystify transformers & convolutions in modern image deep networks
X Hu, M Shi, W Wang, S Wu, L Xing, W Wang, X Zhu, L Lu, J Zhou, ...
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024
152024
PRSeg: A lightweight patch rotate MLP decoder for semantic segmentation
Y Ma, F Lin, S Wu, S Tian, L Yu
IEEE Transactions on Circuits and Systems for Video Technology 33 (11), 6860 …, 2023
122023
Full-scale selective transformer for semantic segmentation
F Lin, S Wu, Y Ma, S Tian
Proceedings of the Asian Conference on Computer Vision, 2663-2679, 2022
92022
Proxy graph matching with proximal matching networks
HR Tan, C Wang, ST Wu, TQ Wang, XY Zhang, CL Liu
Proceedings of the AAAI conference on artificial intelligence 35 (11), 9808-9815, 2021
82021
Feature selective transformer for semantic image segmentation
F Lin, T Wu, S Wu, S Tian, G Guo
arXiv preprint arXiv:2203.14124, 2022
52022
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition
Z Zhong, C Wang, Y Liu, S Yang, L Tang, Y Zhang, J Li, T Qu, Y Li, ...
arXiv preprint arXiv:2412.09501, 2024
42024
Ensemble quadratic assignment network for graph matching
H Tan, C Wang, S Wu, XY Zhang, F Yin, CL Liu
International Journal of Computer Vision 132 (9), 3633-3655, 2024
42024
Robocoder: Robotic learning from basic skills to general tasks with large language models
J Li, P Chen, S Wu, C Zheng, H Xu, J Jia
arXiv preprint arXiv:2406.03757, 2024
32024
Axwin transformer: A context-aware vision transformer backbone with axial windows
F Lin, Y Ma, S Wu, L Yu, S Tian
arXiv preprint arXiv:2305.01280, 2023
32023
Saco loss: Sample-wise affinity consistency for vision-language pre-training
S Wu, H Tan, Z Tian, Y Chen, X Qi, J Jia
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
22024
Data Pruning by Information Maximization
H Tan, S Wu, W Huang, S Zhao, X Qi
The Thirteenth International Conference on Learning Representations, 2025
2025
CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning
S Zhang, Q Zhou, S Wu, H Tan, Z Wang, J Huang, J Yan
The Thirteenth International Conference on Learning Representations, 0
系统目前无法执行此操作,请稍后再试。
文章 1–20