Pale transformer: A general vision transformer backbone with pale-shaped attention S Wu, T Wu, H Tan, G Guo Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 2731-2739, 2022 | 81 | 2022 |
Fully transformer networks for semantic image segmentation S Wu, T Wu, F Lin, S Tian, G Guo arXiv preprint arXiv:2106.04108, 2021 | 56 | 2021 |
Structtoken: Rethinking semantic segmentation with structural prior F Lin, Z Liang, S Wu, J He, K Chen, S Tian IEEE Transactions on circuits and systems for video technology 33 (10), 5655 …, 2023 | 49 | 2023 |
Semantic diffusion network for semantic segmentation H Tan, S Wu, J Pi Advances in Neural Information Processing Systems 35, 8702-8716, 2022 | 41 | 2022 |
Data pruning via moving-one-sample-out H Tan, S Wu, F Du, Y Chen, Z Wang, F Wang, X Qi Advances in neural information processing systems 36, 18251-18262, 2023 | 38 | 2023 |
Regionblip: A unified multi-modal pre-training framework for holistic and regional comprehension Q Zhou, C Yu, S Zhang, S Wu, Z Wang, F Wang arXiv preprint arXiv:2308.02299, 2023 | 24 | 2023 |
Uninext: Exploring a unified architecture for vision recognition F Lin, J Yuan, S Wu, F Wang, Z Wang Proceedings of the 31st ACM International Conference on Multimedia, 3200-3208, 2023 | 22 | 2023 |
Catrans: Context and affinity transformer for few-shot segmentation S Zhang, T Wu, S Wu, G Guo arXiv preprint arXiv:2204.12817, 2022 | 22 | 2022 |
Demystify transformers & convolutions in modern image deep networks X Hu, M Shi, W Wang, S Wu, L Xing, W Wang, X Zhu, L Lu, J Zhou, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024 | 15 | 2024 |
PRSeg: A lightweight patch rotate MLP decoder for semantic segmentation Y Ma, F Lin, S Wu, S Tian, L Yu IEEE Transactions on Circuits and Systems for Video Technology 33 (11), 6860 …, 2023 | 12 | 2023 |
Full-scale selective transformer for semantic segmentation F Lin, S Wu, Y Ma, S Tian Proceedings of the Asian Conference on Computer Vision, 2663-2679, 2022 | 9 | 2022 |
Proxy graph matching with proximal matching networks HR Tan, C Wang, ST Wu, TQ Wang, XY Zhang, CL Liu Proceedings of the AAAI conference on artificial intelligence 35 (11), 9808-9815, 2021 | 8 | 2021 |
Feature selective transformer for semantic image segmentation F Lin, T Wu, S Wu, S Tian, G Guo arXiv preprint arXiv:2203.14124, 2022 | 5 | 2022 |
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Z Zhong, C Wang, Y Liu, S Yang, L Tang, Y Zhang, J Li, T Qu, Y Li, ... arXiv preprint arXiv:2412.09501, 2024 | 4 | 2024 |
Ensemble quadratic assignment network for graph matching H Tan, C Wang, S Wu, XY Zhang, F Yin, CL Liu International Journal of Computer Vision 132 (9), 3633-3655, 2024 | 4 | 2024 |
Robocoder: Robotic learning from basic skills to general tasks with large language models J Li, P Chen, S Wu, C Zheng, H Xu, J Jia arXiv preprint arXiv:2406.03757, 2024 | 3 | 2024 |
Axwin transformer: A context-aware vision transformer backbone with axial windows F Lin, Y Ma, S Wu, L Yu, S Tian arXiv preprint arXiv:2305.01280, 2023 | 3 | 2023 |
Saco loss: Sample-wise affinity consistency for vision-language pre-training S Wu, H Tan, Z Tian, Y Chen, X Qi, J Jia Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 2 | 2024 |
Data Pruning by Information Maximization H Tan, S Wu, W Huang, S Zhao, X Qi The Thirteenth International Conference on Learning Representations, 2025 | | 2025 |
CR2PQ: Continuous Relative Rotary Positional Query for Dense Visual Representation Learning S Zhang, Q Zhou, S Wu, H Tan, Z Wang, J Huang, J Yan The Thirteenth International Conference on Learning Representations, 0 | | |