Geolocalized modeling for dish recognition R Xu, L Herranz, S Jiang, S Wang, X Song, R Jain IEEE transactions on multimedia 17 (8), 1187-1199, 2015 | 86 | 2015 |
Depth CNNs for RGB-D scene recognition: Learning from scratch better than transferring from RGB-CNNs X Song, L Herranz, S Jiang Thirty-first AAAI conference on artificial intelligence, 2017 | 66 | 2017 |
Multi-scale multi-feature context modeling for scene recognition in the semantic manifold X Song, S Jiang, L Herranz IEEE Transactions on Image Processing 26 (6), 2721-2735, 2017 | 54 | 2017 |
Learning effective RGB-D representations for scene recognition X Song, S Jiang, L Herranz, C Chen IEEE Transactions on Image Processing 28 (2), 980-993, 2018 | 38 | 2018 |
Joint multi-feature spatial context for scene recognition on the semantic manifold X Song, S Jiang, L Herranz Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 34 | 2015 |
Spatio-temporal memory attention for image captioning J Ji, C Xu, X Zhang, B Wang, X Song IEEE Transactions on Image Processing 29, 7615-7628, 2020 | 27 | 2020 |
Image representations with spatial object-to-object relations for RGB-D scene recognition X Song, S Jiang, B Wang, C Chen, G Chen IEEE Transactions on Image Processing 29, 525-537, 2019 | 25 | 2019 |
Scene recognition with prototype-agnostic scene layout G Chen, X Song, H Zeng, S Jiang IEEE Transactions on Image Processing 29, 5877-5888, 2020 | 24 | 2020 |
Combining Models from Multiple Sources for RGB-D Scene Recognition. X Song, S Jiang, L Herranz IJCAI, 4523-4529, 2017 | 23 | 2017 |
Image captioning with both object and scene information X Li, X Song, L Herranz, Y Zhu, S Jiang Proceedings of the 24th ACM international conference on Multimedia, 1107-1110, 2016 | 21 | 2016 |
Image captioning via semantic element embedding X Zhang, S He, X Song, RWH Lau, J Jiao, Q Ye Neurocomputing 395, 212-221, 2020 | 17 | 2020 |
Deep patch representations with shared codebook for scene classification S Jiang, G Chen, X Song, L Liu ACM Transactions on Multimedia Computing, Communications, and Applications …, 2019 | 17 | 2019 |
Category co-occurrence modeling for large scale scene recognition X Song, S Jiang, L Herranz, Y Kong, K Zheng Pattern Recognition 59, 98-111, 2016 | 17 | 2016 |
RGB-D scene recognition with object-to-object relation X Song, C Chen, S Jiang Proceedings of the 25th ACM international conference on Multimedia, 600-608, 2017 | 15 | 2017 |
Relative image similarity learning with contextual information for Internet cross-media retrieval S Jiang, X Song, Q Huang Multimedia systems 20 (6), 645-657, 2014 | 14 | 2014 |
Learning scene attribute for scene recognition H Zeng, X Song, G Chen, S Jiang IEEE Transactions on Multimedia 22 (6), 1519-1530, 2019 | 8 | 2019 |
Keyword-driven image captioning via Context-dependent Bilateral LSTM X Zhang, S He, X Song, P Wei, S Jiang, Q Ye, J Jiao, RWH Lau 2017 IEEE International Conference on Multimedia and Expo (ICME), 781-786, 2017 | 7 | 2017 |
Generalized zero-shot learning with multi-source semantic embeddings for scene recognition X Song, H Zeng, S Zhang, L Herranz, S Jiang Proceedings of the 28th ACM International Conference on Multimedia, 3976-3985, 2020 | 6 | 2020 |
Aberrance-aware gradient-sensitive attentions for scene recognition with RGB-D videos X Song, S Zhang, Y Hua, S Jiang Proceedings of the 27th ACM international conference on multimedia, 1286-1294, 2019 | 6 | 2019 |
Rich image description based on regions X Zhang, X Song, X Lv, S Jiang, Q Ye, J Jiao Proceedings of the 23rd ACM international conference on Multimedia, 1315-1318, 2015 | 6 | 2015 |