DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting S Huang, D Wang, X Wu, A Tang ACM International Conference on Information and Knowledge Management (CIKM …, 2019 | 295 | 2019 |
Pareto Self-Supervised Training for Few-Shot Learning Z Chen, J Ge, H Zhan, S Huang, D Wang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 13663-13672, 2021 | 149 | 2021 |
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval S Huang, B Gong, Y Pan, J Jiang, Y Lv, Y Li, D Wang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 6565-6574, 2023 | 60 | 2023 |
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference H Zhao, M Zhang, W Zhao, P Ding, S Huang, D Wang AAAI Conference on Artificial Intelligence (AAAI), 2024 | 53 | 2024 |
Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition S Huang, M Zhang, Y Kang, D Wang AAAI Conference on Artificial Intelligence (AAAI), 7840-7847, 2021 | 43 | 2021 |
Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation M Zhang, S Huang, W Li, D Wang European Conference on Computer Vision (ECCV), 453-470, 2022 | 38 | 2022 |
Prompt-Based Distribution Alignment for Unsupervised Domain Adaptation S Bai, M Zhang, W Zhou, S Huang, Z Luan, D Wang, B Chen AAAI Conference on Artificial Intelligence (AAAI) 38 (2), 729-737, 2024 | 29 | 2024 |
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning S Huang, B Gong, Y Feng, Y Lv, D Wang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 19 | 2024 |
VGDIFFZERO: Text-To-Image Diffusion Models Can Be Zero-Shot Visual Grounders X Liu*, S Huang*, Y Kang, H Chen, D Wang IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024 | 12 | 2024 |
Domain Generalized Few-Shot Image Classification via Meta Regularization Network M Zhang, S Huang, D Wang IEEE International Conference on Acoustics, Speech and Signal Processing …, 2022 | 11 | 2022 |
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots P Ding, H Zhao, W Zhang, W Song, M Zhang, S Huang, N Yang, D Wang European Conference on Computer Vision (ECCV), 352-367, 2024 | 8 | 2024 |
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation B Gong*, S Huang*, Y Feng, S Zhang, Y Li, Y Liu IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 8 | 2024 |
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network Z Zhuang, X Xiang, S Huang, D Wang International Conference on Multimedia Retrieval (ICMR), 429-436, 2021 | 8 | 2021 |
DARA: Domain-and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding T Liu, X Liu, S Huang, H Chen, Q Yin, L Qin, D Wang, Y Hu IEEE Conference on Multimedia Expo (ICME), 2024 | 7 | 2024 |
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation S Huang, B Gong, Y Feng, X Chen, Y Fu, Y Liu, D Wang IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 6 | 2024 |
MIST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension X Liu, T Liu, S Huang, Y Hu, Q Yin, D Wang, H Chen arXiv e-prints, arXiv: 2407.01131, 2024 | 4 | 2024 |
Reference-Limited Compositional Zero-Shot Learning S Huang, Q Wei, D Wang International Conference on Multimedia Retrieval (ICMR), 443-451, 2023 | 4 | 2023 |
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference T Liu, X Liu, S Huang, L Shi, Z Xu, Y Xin, Q Yin, X Liu arXiv preprint arXiv:2405.14700, 2024 | 3 | 2024 |
Accelerating diffusion transformers with token-wise feature caching C Zou, X Liu, T Liu, S Huang, L Zhang arXiv preprint arXiv:2410.05317, 2024 | 2 | 2024 |
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration Y Han, X Liu, P Ding, D Wang, H Chen, Q Yan, S Huang arXiv preprint arXiv:2411.17686, 2024 | 1 | 2024 |