vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training S Zhao, F Li, X Chen, X Guan, J Jiang, D Huang, Y Qing, S Wang, P Wang, ... IEEE Transactions on Parallel and Distributed Systems 33 (3), 489-506, 2021 | 24 | 2021 |
Naspipe: high performance and reproducible pipeline parallel supernet training via causal synchronous parallelism S Zhao, F Li, X Chen, T Shen, L Chen, S Wang, N Zhang, C Li, H Cui Proceedings of the 27th ACM International Conference on Architectural …, 2022 | 7 | 2022 |
Fold3D: Rethinking and Parallelizing Computational and Communicational Tasks in the Training of Large DNN Models F Li, S Zhao, Y Qing, X Chen, X Guan, S Wang, G Zhang, H Cui IEEE Transactions on Parallel and Distributed Systems 34 (5), 1432-1449, 2023 | 2 | 2023 |
Hams: High availability for distributed machine learning service graphs S Zhao, X Chen, C Wang, F Li, Q Ji, H Cui, C Li, S Wang 2020 50th Annual IEEE/IFIP International Conference on Dependable Systems …, 2020 | 2 | 2020 |
AMPipe: Accelerating MoE Model Training with Intra-Block Pipelining Y Fu, Q Yuhao, S Zhao, F Li, B Xiao, D HUANG, H Cui | | 2023 |
Neural Architecture Search via Ensemble-based Knowledge Distillation F Li, S Zhao, H Pi, Q Yuhao, Y Fu, S Wang, H Cui | | 2021 |