Follow
Chunan Shi
Chunan Shi
Verified email at pku.edu.cn
Title
Cited by
Cited by
Year
Galvatron: Efficient transformer training over multiple gpus using automatic parallelism
X Miao, Y Wang, Y Jiang, C Shi, X Nie, H Zhang, B Cui
arXiv preprint arXiv:2211.13878, 2022
312022
Spotserve: Serving generative large language models on preemptible instances
X Miao, C Shi, J Duan, X Xi, D Lin, B Cui, Z Jia
Proceedings of the 29th ACM International Conference on Architectural …, 2024
142024
Clover: Regressive Lightweight Speculative Decoding with Sequential Knowledge
B Xiao, C Shi, X Nie, F Yang, X Deng, L Su, W Chen, B Cui
arXiv preprint arXiv:2405.00263, 2024
2024
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification
X Miao, G Oliaro, Z Zhang, X Cheng, Z Wang, Z Zhang, RYY Wong, A Zhu, ...
Proceedings of the 29th ACM International Conference on Architectural …, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–4