Sibo Song

Cited by

	All	Since 2019
Citations	433	351
h-index	9	9
i10-index	9	9

20152016201720182019202020212022202320242 10 24 46 66 57 64 63 83 18

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Sibo Song

Alibaba

Verified email at alibaba-inc.com

computer vision deep learning multimodal learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
On classification of distorted images with deep convolutional neural networks Y Zhou, S Song, NM Cheung Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017	140	2017
Multimodal multi-stream deep learning for egocentric activity recognition S Song, V Chandrasekhar, B Mandal, L Li, JH Lim, G Sateesh Babu, ... Proceedings of the IEEE conference on computer vision and pattern …, 2016	93	2016
Egocentric activity recognition with multimodal fisher vector S Song, NM Cheung, V Chandrasekhar, B Mandal, J Liri 2016 IEEE International conference on acoustics, speech and signal …, 2016	47	2016
Activity recognition in egocentric life-logging videos S Song, V Chandrasekhar, NM Cheung, S Narayan, L Li, JH Lim Computer Vision-ACCV 2014 Workshops: Singapore, Singapore, November 1-2 …, 2015	45	2015
Defense against adversarial attacks with saak transform S Song, Y Chen, NM Cheung, CCJ Kuo arXiv preprint arXiv:1808.01785, 2018	28	2018
Truly multi-modal youtube-8m video classification with video, audio, and text Z Wang, K Kuan, M Ravaut, G Manek, S Song, Y Fang, S Kim, N Chen, ... arXiv preprint arXiv:1706.05461, 2017	26	2017
Vision-language pre-training for boosting scene text detectors S Song, J Wan, Z Yang, J Tang, W Cheng, X Bai, C Yao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022	19	2022
Saak transform-based machine learning for light-sheet imaging of cardiac trabeculation Y Ding, V Gudapati, R Lin, Y Fei, RRS Packard, S Song, CC Chang, ... IEEE Transactions on Biomedical Engineering 68 (1), 225-235, 2020	19	2020
Deep Adaptive Temporal Pooling for Activity Recognition S Song, NM Cheung, V Chandrasekhar, B Mandal 2018 ACM Multimedia Conference on Multimedia Conference, 1829--1837, 2018	11	2018
Modeling entities as semantic points for visual information extraction in the wild Z Yang, R Long, P Wang, S Song, H Zhong, W Cheng, X Bai, C Yao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	5	2023
OmniParser: A Unified Framework for Text Spotting, Key Information Extraction and Table Recognition J Wan, S Song, W Yu, Y Liu, W Cheng, F Huang, X Bai, C Yao, Z Yang arXiv preprint arXiv:2403.19128, 2024		2024
ICDAR 2023 Competition on Born Digital Video Text Question Answering Z Yang, X Song, S Song, T Lu, X Bai, CL Liu, F Huang, C Yao International Conference on Document Analysis and Recognition, 508-521, 2023		2023
Towards Multimodal and Secure Deep Learning for Human Activity Recognition from Multiple Views S Song Singapore University of Technology and Design, 2018		2018

The system can't perform the operation now. Try again later.

Articles 1–13

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by