Geolocalized modeling for dish recognition R Xu, L Herranz, S Jiang, S Wang, X Song, R Jain IEEE transactions on multimedia 17 (8), 1187-1199, 2015 | 50 | 2015 |
Depth CNNs for RGB-D scene recognition: learning from scratch better than transferring from RGB-CNNs X Song, L Herranz, S Jiang Thirty-First AAAI Conference on Artificial Intelligence, 2017 | 33 | 2017 |
Multi-scale multi-feature context modeling for scene recognition in the semantic manifold X Song, S Jiang, L Herranz IEEE Transactions on Image Processing 26 (6), 2721-2735, 2017 | 29 | 2017 |
Joint multi-feature spatial context for scene recognition on the semantic manifold X Song, S Jiang, L Herranz Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 28 | 2015 |
Combining Models from Multiple Sources for RGB-D Scene Recognition. X Song, S Jiang, L Herranz IJCAI, 4523-4529, 2017 | 14 | 2017 |
Image captioning with both object and scene information X Li, X Song, L Herranz, Y Zhu, S Jiang Proceedings of the 24th ACM international conference on Multimedia, 1107-1110, 2016 | 12 | 2016 |
Category co-occurrence modeling for large scale scene recognition X Song, S Jiang, L Herranz, Y Kong, K Zheng Pattern Recognition 59, 98-111, 2016 | 11 | 2016 |
Relative image similarity learning with contextual information for Internet cross-media retrieval S Jiang, X Song, Q Huang Multimedia systems 20 (6), 645-657, 2014 | 10 | 2014 |
Learning effective RGB-D representations for scene recognition X Song, S Jiang, L Herranz, C Chen IEEE Transactions on Image Processing 28 (2), 980-993, 2018 | 7 | 2018 |
RGB-D scene recognition with object-to-object relation X Song, C Chen, S Jiang Proceedings of the 25th ACM international conference on Multimedia, 600-608, 2017 | 7 | 2017 |
Rich image description based on regions X Zhang, X Song, X Lv, S Jiang, Q Ye, J Jiao Proceedings of the 23rd ACM international conference on Multimedia, 1315-1318, 2015 | 4 | 2015 |
MIAR ICT Participation at Robot Vision 2013. R Xu, S Jiang, X Song, S Wang, Y Xie, F Wang, X Lv CLEF (Working Notes), 2013 | 4 | 2013 |
Keyword-driven image captioning via Context-dependent Bilateral LSTM X Zhang, S He, X Song, P Wei, S Jiang, Q Ye, J Jiao, RWH Lau 2017 IEEE International Conference on Multimedia and Expo (ICME), 781-786, 2017 | 3 | 2017 |
Joint Learning of CNN and LSTM for Image Captioning. Y Zhu, X Li, X Li, J Sun, X Song, S Jiang CLEF (Working Notes), 421-427, 2016 | 3 | 2016 |
Semantic features for food image recognition with geo-constraints X Song, S Jiang, R Xu, L Herranz 2014 IEEE International Conference on Data Mining Workshop, 1020-1025, 2014 | 3 | 2014 |
Deep Patch Representations with Shared Codebook for Scene Classification S Jiang, G Chen, X Song, L Liu ACM Transactions on Multimedia Computing, Communications, and Applications …, 2019 | 2 | 2019 |
Focal Loss for Region Proposal Network C Chen, X Song, S Jiang Chinese Conference on Pattern Recognition and Computer Vision (PRCV), 368-380, 2018 | 1 | 2018 |
Multipath Convolutional-Recursive Neural Networks for Object Recognition X Li, S Jiang, X Song, L Herranz, Z Shi International Conference on Intelligent Information Processing, 269-277, 2014 | 1 | 2014 |
Cross concept local Fisher discriminant analysis for image classification X Song, S Jiang, S Wang, J Tang, Q Huang Advances in Multimedia Modeling, 407-416, 2013 | 1 | 2013 |
Aberrance-aware Gradient-sensitive Attentions for Scene Recognition with RGB-D Videos X Song, S Zhang, Y Hua, S Jiang Proceedings of the 27th ACM International Conference on Multimedia, 1286-1294, 2019 | | 2019 |