关注
Zhengyuan Yang
Zhengyuan Yang
Researcher, Microsoft
在 microsoft.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
End-to-end multi-modal multi-task vehicle control for self-driving cars with visual perceptions
Z Yang, Y Zhang, J Yu, J Cai, J Luo
2018 24th International Conference on Pattern Recognition (ICPR), 2289-2294, 2018
1202018
A fast and accurate one-stage approach to visual grounding
Z Yang, B Gong, L Wang, W Huang, D Yu, J Luo
IEEE International Conference on Computer Vision (ICCV), 4683-4693, 2019
1182019
Action recognition with spatio–temporal visual attention on skeleton image sequences
Z Yang, Y Li, J Yang, J Luo
IEEE Transactions on Circuits and Systems for Video Technology 29 (8), 2405-2415, 2018
1152018
Attentive relational networks for mapping images to scene graphs
M Qi, W Li, Z Yang, Y Wang, J Luo
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 3957-3966, 2019
992019
Improving One-stage Visual Grounding by Recursive Sub-query Construction
Z Yang, T Chen, L Wang, J Luo
European Conference on Computer Vision (ECCV), 2020
542020
A Novel Graph-based Multi-modal Fusion Encoder for Neural Machine Translation
Y Yin, F Meng, J Su, C Zhou, Z Yang, J Zhou, J Luo
Annual Meeting of the Association for Computational Linguistics (ACL), 2020
452020
TAP: Text-Aware Pre-training for Text-VQA and Text-Caption
Z Yang, Y Lu, J Wang, X Yin, D Florencio, L Wang, C Zhang, L Zhang, ...
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
412021
TransVG: End-to-End Visual Grounding with Transformers
J Deng, Z Yang, T Chen, W Zhou, H Li
IEEE International Conference on Computer Vision (ICCV), 2021
372021
Dynamic context-guided capsule network for multimodal machine translation
H Lin, F Meng, J Su, Y Yin, Z Yang, Y Ge, J Zhou, J Luo
Proceedings of the 28th ACM International Conference on Multimedia, 1320-1329, 2020
222020
An empirical study of gpt-3 for few-shot knowledge-based vqa
Z Yang, Z Gan, J Wang, X Hu, Y Lu, Z Liu, L Wang
arXiv preprint arXiv:2109.05014 3 (6), 7, 2021
202021
Improving Weakly Supervised Visual Grounding by Contrastive Knowledge Distillation
L Wang, J Huang, Y Li, K Xu, Z Yang, D Yu
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
192021
Human-centered emotion recognition in animated gifs
Z Yang, Y Zhang, J Luo
2019 IEEE International Conference on Multimedia and Expo (ICME), 1090-1095, 2019
162019
Action recognition with visual attention on skeleton images
Z Yang, Y Li, J Yang, J Luo
2018 24th International Conference on Pattern Recognition (ICPR), 3309-3314, 2018
132018
Scaling up vision-language pre-training for image captioning
X Hu, Z Gan, J Wang, Z Yang, Z Liu, Y Lu, L Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
112022
SAT: 2D Semantics Assisted Training for 3D Visual Grounding
Z Yang, S Zhang, L Wang, J Luo
IEEE International Conference on Computer Vision (ICCV), 2021
102021
UFO: A unified transformer for vision-language representation learning
J Wang, X Hu, Z Gan, Z Yang, X Dai, Z Liu, Y Lu, L Wang
arXiv preprint arXiv:2111.10023, 2021
92021
Grounding-tracking-integration
Z Yang, T Kumar, T Chen, J Su, J Luo
IEEE Transactions on Circuits and Systems for Video Technology 31 (9), 3433-3443, 2020
92020
Crossing the format boundary of text and boxes: Towards unified vision-language modeling
Z Yang, Z Gan, J Wang, X Hu, F Ahmed, Z Liu, Y Lu, L Wang
arXiv preprint arXiv:2111.12085, 2021
62021
Weakly Supervised Body Part Segmentation with Pose based Part Priors
Z Yang, Y Li, L Yang, N Zhang, J Luo
2020 25th International Conference on Pattern Recognition (ICPR), 286-293, 2021
5*2021
Personalized pose estimation for body language understanding
Z Yang, J Luo
2017 IEEE International Conference on Image Processing (ICIP), 126-130, 2017
42017
系统目前无法执行此操作,请稍后再试。
文章 1–20