Follow
Songyang Zhang
Songyang Zhang
Other names张宋扬
Amazon Web Services
Verified email at amazon.com - Homepage
Title
Cited by
Cited by
Year
Make-a-video: Text-to-video generation without text-video data
U Singer, A Polyak, T Hayes, X Yin, J An, S Zhang, Q Hu, H Yang, ...
arXiv preprint arXiv:2209.14792, 2022
6032022
Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language
S Zhang, H Peng, J Fu, J Luo
AAAI 2020, 2019
3702019
On geometric features for skeleton-based action recognition using multilayer lstm networks
S Zhang, X Liu, J Xiao
2017 IEEE Winter Conference on Applications of Computer Vision (WACV), 148-157, 2017
3362017
Fusing geometric features for skeleton-based action recognition using multilayer LSTM networks
S Zhang, Y Yang, J Xiao, X Liu, Y Yang, D Xie, Y Zhuang
IEEE Transactions on Multimedia 20 (9), 2330-2343, 2018
1882018
Expanding language-image pretrained models for general video recognition
B Ni, H Peng, M Chen, S Zhang, G Meng, J Fu, S Xiang, H Ling
European Conference on Computer Vision, 1-18, 2022
1632022
Boundary proposal network for two-stage natural language video localization
S Xiao, L Chen, S Zhang, W Ji, J Shao, L Ye, J Xiao
Proceedings of the AAAI Conference on Artificial Intelligence 35 (4), 2986-2994, 2021
1302021
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Z Yang, S Zhang, L Wang, J Luo
ICCV 2021, 2021
652021
The devil is in the labels: Noisy label correction for robust scene graph generation
L Li, L Chen, Y Huang, Z Zhang, S Zhang, J Xiao
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
642022
Exploiting temporal relationships in video moment localization with natural language
S Zhang, J Su, J Luo
Proceedings of the 27th ACM International Conference on Multimedia, 1230-1238, 2019
632019
Multi-scale 2d temporal adjacency networks for moment localization with natural language
S Zhang, H Peng, J Fu, Y Lu, J Luo
IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (12), 9073 …, 2021
492021
Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation
J An, S Zhang, H Yang, S Gupta, JB Huang, J Luo, X Yin
arXiv preprint arXiv:2304.08477, 2023
482023
Content-based Analysis of the Cultural Differences between TikTok and Douyin
L Sun, H Zhang, S Zhang, J Luo
IEEE Big Data 2020, 2020
272020
Video-aided unsupervised grammar induction
S Zhang, L Song, L Jin, K Xu, D Yu, J Luo
NAACL 2021, 2021
262021
Mugen: A playground for video-audio-text multimodal understanding and generation
T Hayes, S Zhang, X Yin, G Pang, S Sheng, H Yang, S Ge, Q Hu, D Parikh
European Conference on Computer Vision, 431-449, 2022
202022
Explorations of skeleton features for LSTM-based action recognition
J Feng, S Zhang, J Xiao
Multimedia Tools and Applications 78, 591-603, 2019
162019
Instance-wise or class-wise? A tale of neighbor Shapley for concept-based explanation
J Li, K Kuang, L Li, L Chen, S Zhang, J Shao, J Xiao
Proceedings of the 29th ACM International Conference on Multimedia, 3664-3672, 2021
122021
Mi YouTube es Su YouTube? Analyzing the Cultures using YouTube Thumbnails of Popular Videos
S Zhang, T Aktas, J Luo
IEEE Big Data 2021, 2020
112020
Rethinking the Evaluation of Unbiased Scene Graph Generation
X Li, L Chen, J Shao, S Xiao, S Zhang, J Xiao
arXiv preprint arXiv:2208.01909, 2022
102022
Make-a-video: Text-to-video generation without text-video data. arXiv 2022
U Singer, A Polyak, T Hayes, X Yin, J An, S Zhang, Q Hu, H Yang, ...
arXiv preprint arXiv:2209.14792, 2022
102022
Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization
S Zhang, H Peng, L Yang, J Fu, J Luo
HACS Challenge at ICCV 2020, 2019
72019
The system can't perform the operation now. Try again later.
Articles 1–20