Wencong Xiao

Cited by

	All	Since 2019
Citations	2338	2205
h-index	15	15
i10-index	17	17

600

300

150

450

2016201720182019202020212022202320249 21 92 182 312 416 515 592 185

Public access

View all

5 articles

3 articles

available

not available

Based on funding mandates

Co-authors

Fan YangMicrosoft ResearchVerified email at microsoft.com
Lidong ZhouMicrosoft ResearchVerified email at microsoft.com
Wei LinAlibabaVerified email at alibaba-inc.com
Ming WuMicrosoft ResearchVerified email at microsoft.com
Hanyu ZhaoAlibaba GroupVerified email at alibaba-inc.com
Jilong XueMicrosoft ResearchVerified email at microsoft.com
Lintao ZhangMicrosoft Research AsiaVerified email at microsoft.com
Chen ZhangShanghai Jiao Tong UniversityVerified email at sjtu.edu.cn
Shijie CaoMicrosoft Research AsiaVerified email at microsoft.com
Bojie LiStealth StartupVerified email at os.ai
Zhenhua HANMicrosoft Research AsiaVerified email at microsoft.com
Haoxiang LinMicrosoft Research AsiaVerified email at microsoft.com
Zain Zhenyuan RuanMIT CSAILVerified email at csail.mit.edu
Xianyan JiaAlibabaVerified email at alibaba-inc.com
Yangqing JiaFounder, Lepton AIVerified email at daggerfs.com
Yuanwei LuAI Stealth Startup

Wencong Xiao

Alibaba Group

Verified email at alibaba-inc.com - Homepage

Distributed system Machine learning system Resource management


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gandiva: Introspective cluster scheduling for deep learning W Xiao, R Bhardwaj, R Ramjee, M Sivathanu, N Kwatra, Z Han, P Patel, ... 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018	494	2018
Analysis of {Large-Scale}{Multi-Tenant}{GPU} clusters for {DNN} training workloads M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang 2019 USENIX Annual Technical Conference (USENIX ATC 19), 947-960, 2019	331	2019
Kv-direct: High-performance in-memory key-value store with programmable nic B Li, Z Ruan, W Xiao, Y Lu, Y Xiong, A Putnam, E Chen, L Zhang Proceedings of the 26th Symposium on Operating Systems Principles, 137-152, 2017	267	2017
Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity S Cao, C Zhang, Z Yao, W Xiao, L Nie, D Zhan, Y Liu, M Wu, L Zhang Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019	190	2019
{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters Q Weng, W Xiao, Y Yu, W Wang, C Wang, J He, Y Li, L Zhang, W Lin, ... 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2022	165	2022
GraM: scaling graph computation to the trillions M Wu, F Yang, J Xue, W Xiao, Y Miao, L Wei, H Lin, Y Dai, L Zhou Proceedings of the Sixth ACM Symposium on Cloud Computing, 408-421, 2015	162	2015
{AntMan}: Dynamic scaling on {GPU} clusters for deep learning W Xiao, S Ren, Y Li, Y Zhang, P Hou, Z Li, Y Feng, W Lin, Y Jia 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2020	151	2020
Balanced sparsity for efficient dnn inference on gpu Z Yao, S Cao, W Xiao, C Zhang, L Nie Proceedings of the AAAI conference on artificial intelligence 33 (01), 5676-5683, 2019	118	2019
An empirical study on program failures of deep learning jobs R Zhang, W Xiao, H Zhang, Y Liu, H Lin, M Yang Proceedings of the ACM/IEEE 42nd international conference on software …, 2020	83	2020
Seernet: Predicting convolutional neural network feature-map sparsity through low-bit quantization S Cao, L Ma, W Xiao, C Zhang, Y Liu, L Zhang, L Nie, Z Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	80	2019
{Tux²}: Distributed Graph Computation for Machine Learning W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2017	77	2017
Multi-tenant GPU Clusters for Deep Learning Workloads: Analysis and Implications M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang MSR-TR-2018-13, 2018	74	2018
Zico: Efficient {GPU} memory sharing for concurrent {DNN} training G Lim, J Ahn, W Xiao, Y Kwon, M Jeon 2021 USENIX Annual Technical Conference (USENIX ATC 21), 161-175, 2021	34	2021
Whale: Efficient giant model training over heterogeneous {GPUs} X Jia, L Jiang, A Wang, W Xiao, Z Shi, J Zhang, X Li, L Chen, Y Li, ... 2022 USENIX Annual Technical Conference (USENIX ATC 22), 673-688, 2022	30	2022
Memory efficient loss recovery for hardware-based transport in datacenter Y Lu, G Chen, Z Ruan, W Xiao, B Li, J Zhang, Y Xiong, P Cheng, E Chen Proceedings of the First Asia-Pacific Workshop on Networking, 22-28, 2017	25	2017
Scheduling CPU for GPU-based deep learning jobs W Xiao, Z Han, H Zhao, X Peng, Q Zhang, F Yang, L Zhou Proceedings of the ACM Symposium on Cloud Computing, 503-503, 2018	11	2018
BeamRaster: a practical fast massive MU-MIMO system with pre-computed precoders M Meng, W Xiao, T He, Y Tao, K Tan, J Zhang, W Wang IEEE Transactions on Mobile Computing 18 (5), 1014-1027, 2018	11	2018
Distributed graph computation meets machine learning W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou IEEE Transactions on Parallel and Distributed Systems 31 (7), 1588-1604, 2020	9	2020
Cognn: efficient scheduling for concurrent gnn training on gpus Q Sun, Y Liu, H Yang, R Zhang, M Dun, M Li, X Liu, W Xiao, Y Li, Z Luan, ... SC22: International Conference for High Performance Computing, Networking …, 2022	8	2022
Easyscale: Accuracy-consistent elastic training for deep learning M Li, W Xiao, B Sun, H Zhao, H Yang, S Ren, Z Luan, X Jia, Y Liu, Y Li, ... arXiv preprint arXiv:2208.14228, 2022	4	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors