A real-time hand posture recognition system using deep neural networks A Tang, K Lu, Y Wang, J Huang, H Li ACM Transactions on Intelligent Systems and Technology (TIST) 6 (2), 1-23, 2015 | 191 | 2015 |
Skeleton key: Image captioning by skeleton-attribute decomposition Y Wang, Z Lin, X Shen, S Cohen, GW Cottrell Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 140 | 2017 |
Event-specific image importance Y Wang, Z Lin, X Shen, R Mech, G Miller, GW Cottrell proceedings of the IEEE Conference on Computer Vision and Pattern …, 2016 | 50 | 2016 |
Text with knowledge graph augmented transformer for video captioning X Gu, G Chen, Y Wang, L Zhang, T Luo, L Wen Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 47 | 2023 |
Scenes-objects-actions: A multi-task, multi-label video dataset J Ray, H Wang, D Tran, Y Wang, M Feiszli, L Torresani, M Paluri Proceedings of the European conference on computer vision (ECCV), 635-651, 2018 | 38 | 2018 |
Convolutional neural network and convex optimization S Chen, Y Wang Dept. of Elect. and Comput. Eng., Univ. of California at San Diego, San …, 2014 | 26 | 2014 |
A Deep Siamese Neural Network Learns the Human-Perceived SimilarityStructure of Facial Expressions Without Explicit Categories SJ Rao, Y Wang, GW Cottrell Proceedings of the Annual Meeting of the Cognitive Science Society 38, 2016 | 21 | 2016 |
Bikers are like tobacco shops, formal dressers are like suits: Recognizing urban tribes with caffe Y Wang, GW Cottrell 2015 IEEE Winter Conference on Applications of Computer Vision, 876-883, 2015 | 20 | 2015 |
Concept mask: Large-scale segmentation from semantic concepts Y Wang, Z Lin, X Shen, J Zhang, S Cohen Proceedings of the European Conference on Computer Vision (ECCV), 530-546, 2018 | 18 | 2018 |
Recognizing and curating photo albums via event-specific image importance Y Wang, Z Lin, X Shen, R Mech, G Miller, GW Cottrell arXiv preprint arXiv:1707.05911, 2017 | 17 | 2017 |
Structured context transformer for generic event boundary detection C Li, X Wang, D Hong, Y Wang, L Zhang, T Luo, L Wen arXiv preprint arXiv:2206.02985, 2022 | 8 | 2022 |
Dual-stream transformer for generic event boundary captioning X Gu, H Ye, G Chen, Y Wang, L Zhang, L Wen arXiv preprint arXiv:2207.03038, 2022 | 2 | 2022 |
Unidual: A unified model for image and video understanding Y Wang, D Tran, L Torresani arXiv preprint arXiv:1906.03857, 2019 | 2 | 2019 |
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition (Supplementary Material) Y Wang, Z Lin, X Shen, S Cohen, GW Cottrell | | |
Event-specific Image Importance (Supplementary Material) Y Wang, Z Lin, X Shen, R Mech, G Miller, GW Cottrell | | |