关注
Shiwei Zhang
Shiwei Zhang
Alibaba Group
在 alibaba-inc.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Modelscope text-to-video technical report
J Wang, H Yuan, D Chen, Y Zhang, X Wang, S Zhang
arXiv preprint arXiv:2308.06571, 2023
2812023
Videocomposer: Compositional video synthesis with motion controllability
X Wang, H Yuan, S Zhang, D Chen, J Wang, Y Zhang, Y Shen, D Zhao, ...
Advances in Neural Information Processing Systems 36, 2024
2542024
End-to-end temporal action detection with transformer
X Liu, Q Wang, Y Hu, X Tang, S Zhang, S Bai, X Bai
IEEE Transactions on Image Processing 31, 5427-5441, 2022
2452022
TCTrack: Temporal contexts for aerial tracking
Z Cao, Z Huang, L Pan, S Zhang, Z Liu, C Fu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
1712022
I2VGen-XL: High-quality image-to-video synthesis via cascaded diffusion models
S Zhang, J Wang, Y Zhang, K Zhao, H Yuan, Z Qin, X Wang, D Zhao, ...
arXiv preprint arXiv:2311.04145, 2023
1372023
Oadtr: Online action detection with transformers
X Wang, S Zhang, Z Qing, Y Shao, Z Zuo, C Gao, N Sang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
1312021
Tacnet: Transition-aware context network for spatio-temporal action detection
L Song, S Zhang, G Yu, H Sun
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
1042019
Hybrid relation guided set matching for few-shot action recognition
X Wang, S Zhang, Z Qing, M Tang, Z Zuo, C Gao, R Jin, N Sang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
1012022
Self-supervised learning for semi-supervised temporal action proposal
X Wang, S Zhang, Z Qing, Y Shao, C Gao, N Sang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
762021
DreamVideo: Composing your dream videos with customized subject and motion
Y Wei, S Zhang, Z Qing, H Yuan, Z Liu, Y Liu, Y Zhang, J Zhou, H Shan
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
642024
MoLo: Motion-augmented Long-short Contrastive Learning for Few-shot Action Recognition
X Wang, S Zhang, Z Qing, C Gao, Y Zhang, D Zhao, N Sang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
612023
TAda! Temporally-Adaptive Convolutions for Video Understanding
Z Huang, S Zhang, L Pan, Z Qing, M Tang, Z Liu, MH Ang Jr
International Conference on Learning Representations, 2022
612022
MAR: Masked Autoencoders for Efficient Action Recognition
Z Qing, S Zhang, Z Huang, X Wang, Y Wang, Y Lv, C Gao, N Sang
IEEE Transactions on Multimedia 26, 218-233, 2023
552023
CLIP-guided prototype modulating for few-shot action recognition
X Wang, S Zhang, J Cen, C Gao, Y Zhang, D Zhao, N Sang
International Journal of Computer Vision, 2023, 2023
502023
Support-set based cross-supervision for video grounding
X Ding, N Wang, S Zhang, D Cheng, X Li, Z Huang, M Tang, X Gao
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
492021
DreamTalk: When expressive talking head generation meets diffusion probabilistic models
Y Ma, S Zhang, J Wang, X Wang, Y Zhang, Z Deng
arXiv preprint arXiv:2312.09767, 2023
452023
Glnet: Global local network for weakly supervised action localization
S Zhang, L Song, C Gao, N Sang
IEEE Transactions on Multimedia 22 (10), 2610-2622, 2019
402019
Towards real-world visual tracking with temporal contexts
Z Cao, Z Huang, L Pan, S Zhang, Z Liu, C Fu
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
362023
VideoLCM: Video latent consistency model
X Wang, S Zhang, H Zhang, Y Liu, Y Zhang, C Gao, N Sang
arXiv preprint arXiv:2312.09109, 2023
352023
Rlipv2: Fast scaling of relational language-image pre-training
H Yuan, S Zhang, X Wang, S Albanie, Y Pan, T Feng, J Jiang, D Ni, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
292023
系统目前无法执行此操作,请稍后再试。
文章 1–20