Gandiva: Introspective cluster scheduling for deep learning W Xiao, R Bhardwaj, R Ramjee, M Sivathanu, N Kwatra, Z Han, P Patel, ... 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2018 | 494 | 2018 |
Analysis of {Large-Scale}{Multi-Tenant}{GPU} clusters for {DNN} training workloads M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang 2019 USENIX Annual Technical Conference (USENIX ATC 19), 947-960, 2019 | 331 | 2019 |
Kv-direct: High-performance in-memory key-value store with programmable nic B Li, Z Ruan, W Xiao, Y Lu, Y Xiong, A Putnam, E Chen, L Zhang Proceedings of the 26th Symposium on Operating Systems Principles, 137-152, 2017 | 267 | 2017 |
Efficient and effective sparse LSTM on FPGA with bank-balanced sparsity S Cao, C Zhang, Z Yao, W Xiao, L Nie, D Zhan, Y Liu, M Wu, L Zhang Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019 | 190 | 2019 |
{MLaaS} in the wild: Workload analysis and scheduling in {Large-Scale} heterogeneous {GPU} clusters Q Weng, W Xiao, Y Yu, W Wang, C Wang, J He, Y Li, L Zhang, W Lin, ... 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2022 | 165 | 2022 |
GraM: scaling graph computation to the trillions M Wu, F Yang, J Xue, W Xiao, Y Miao, L Wei, H Lin, Y Dai, L Zhou Proceedings of the Sixth ACM Symposium on Cloud Computing, 408-421, 2015 | 162 | 2015 |
{AntMan}: Dynamic scaling on {GPU} clusters for deep learning W Xiao, S Ren, Y Li, Y Zhang, P Hou, Z Li, Y Feng, W Lin, Y Jia 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2020 | 151 | 2020 |
Balanced sparsity for efficient dnn inference on gpu Z Yao, S Cao, W Xiao, C Zhang, L Nie Proceedings of the AAAI conference on artificial intelligence 33 (01), 5676-5683, 2019 | 118 | 2019 |
An empirical study on program failures of deep learning jobs R Zhang, W Xiao, H Zhang, Y Liu, H Lin, M Yang Proceedings of the ACM/IEEE 42nd international conference on software …, 2020 | 83 | 2020 |
Seernet: Predicting convolutional neural network feature-map sparsity through low-bit quantization S Cao, L Ma, W Xiao, C Zhang, Y Liu, L Zhang, L Nie, Z Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 80 | 2019 |
{Tux²}: Distributed Graph Computation for Machine Learning W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou 14th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2017 | 77 | 2017 |
Multi-tenant GPU Clusters for Deep Learning Workloads: Analysis and Implications M Jeon, S Venkataraman, A Phanishayee, J Qian, W Xiao, F Yang MSR-TR-2018-13, 2018 | 74 | 2018 |
Zico: Efficient {GPU} memory sharing for concurrent {DNN} training G Lim, J Ahn, W Xiao, Y Kwon, M Jeon 2021 USENIX Annual Technical Conference (USENIX ATC 21), 161-175, 2021 | 34 | 2021 |
Whale: Efficient giant model training over heterogeneous {GPUs} X Jia, L Jiang, A Wang, W Xiao, Z Shi, J Zhang, X Li, L Chen, Y Li, ... 2022 USENIX Annual Technical Conference (USENIX ATC 22), 673-688, 2022 | 30 | 2022 |
Memory efficient loss recovery for hardware-based transport in datacenter Y Lu, G Chen, Z Ruan, W Xiao, B Li, J Zhang, Y Xiong, P Cheng, E Chen Proceedings of the First Asia-Pacific Workshop on Networking, 22-28, 2017 | 25 | 2017 |
Scheduling CPU for GPU-based deep learning jobs W Xiao, Z Han, H Zhao, X Peng, Q Zhang, F Yang, L Zhou Proceedings of the ACM Symposium on Cloud Computing, 503-503, 2018 | 11 | 2018 |
BeamRaster: a practical fast massive MU-MIMO system with pre-computed precoders M Meng, W Xiao, T He, Y Tao, K Tan, J Zhang, W Wang IEEE Transactions on Mobile Computing 18 (5), 1014-1027, 2018 | 11 | 2018 |
Distributed graph computation meets machine learning W Xiao, J Xue, Y Miao, Z Li, C Chen, M Wu, W Li, L Zhou IEEE Transactions on Parallel and Distributed Systems 31 (7), 1588-1604, 2020 | 9 | 2020 |
Cognn: efficient scheduling for concurrent gnn training on gpus Q Sun, Y Liu, H Yang, R Zhang, M Dun, M Li, X Liu, W Xiao, Y Li, Z Luan, ... SC22: International Conference for High Performance Computing, Networking …, 2022 | 8 | 2022 |
Easyscale: Accuracy-consistent elastic training for deep learning M Li, W Xiao, B Sun, H Zhao, H Yang, S Ren, Z Luan, X Jia, Y Liu, Y Li, ... arXiv preprint arXiv:2208.14228, 2022 | 4 | 2022 |