关注
Siteng Huang
Siteng Huang
其他姓名黄 思腾
Alibaba DAMO Academy | ZJU | Westlake University
在 westlake.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting
S Huang, D Wang, X Wu, A Tang
ACM International Conference on Information and Knowledge Management (CIKM …, 2019
2952019
Pareto Self-Supervised Training for Few-Shot Learning
Z Chen, J Ge, H Zhan, S Huang, D Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 13663-13672, 2021
1492021
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval
S Huang, B Gong, Y Pan, J Jiang, Y Lv, Y Li, D Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 6565-6574, 2023
602023
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference
H Zhao, M Zhang, W Zhao, P Ding, S Huang, D Wang
AAAI Conference on Artificial Intelligence (AAAI), 2024
532024
Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition
S Huang, M Zhang, Y Kang, D Wang
AAAI Conference on Artificial Intelligence (AAAI), 7840-7847, 2021
432021
Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation
M Zhang, S Huang, W Li, D Wang
European Conference on Computer Vision (ECCV), 453-470, 2022
382022
Prompt-Based Distribution Alignment for Unsupervised Domain Adaptation
S Bai, M Zhang, W Zhou, S Huang, Z Luan, D Wang, B Chen
AAAI Conference on Artificial Intelligence (AAAI) 38 (2), 729-737, 2024
292024
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning
S Huang, B Gong, Y Feng, Y Lv, D Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
192024
VGDIFFZERO: Text-To-Image Diffusion Models Can Be Zero-Shot Visual Grounders
X Liu*, S Huang*, Y Kang, H Chen, D Wang
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2024
122024
Domain Generalized Few-Shot Image Classification via Meta Regularization Network
M Zhang, S Huang, D Wang
IEEE International Conference on Acoustics, Speech and Signal Processing …, 2022
112022
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
P Ding, H Zhao, W Zhang, W Song, M Zhang, S Huang, N Yang, D Wang
European Conference on Computer Vision (ECCV), 352-367, 2024
82024
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation
B Gong*, S Huang*, Y Feng, S Zhang, Y Li, Y Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
82024
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network
Z Zhuang, X Xiang, S Huang, D Wang
International Conference on Multimedia Retrieval (ICMR), 429-436, 2021
82021
DARA: Domain-and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding
T Liu, X Liu, S Huang, H Chen, Q Yin, L Qin, D Wang, Y Hu
IEEE Conference on Multimedia Expo (ICME), 2024
72024
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation
S Huang, B Gong, Y Feng, X Chen, Y Fu, Y Liu, D Wang
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
62024
MIST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension
X Liu, T Liu, S Huang, Y Hu, Q Yin, D Wang, H Chen
arXiv e-prints, arXiv: 2407.01131, 2024
42024
Reference-Limited Compositional Zero-Shot Learning
S Huang, Q Wei, D Wang
International Conference on Multimedia Retrieval (ICMR), 443-451, 2023
42023
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference
T Liu, X Liu, S Huang, L Shi, Z Xu, Y Xin, Q Yin, X Liu
arXiv preprint arXiv:2405.14700, 2024
32024
Accelerating diffusion transformers with token-wise feature caching
C Zou, X Liu, T Liu, S Huang, L Zhang
arXiv preprint arXiv:2410.05317, 2024
22024
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration
Y Han, X Liu, P Ding, D Wang, H Chen, Q Yan, S Huang
arXiv preprint arXiv:2411.17686, 2024
12024
系统目前无法执行此操作,请稍后再试。
文章 1–20