Qizhen Weng

Cited by

	All	Since 2019
Citations	291	291
h-index	6	6
i10-index	5	5

140

105

2019202020212022202320248 13 22 75 131 42

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yinghao YuEngineer, AlibabaVerified email at connect.ust.hk
Liping ZhangAlibaba Group IncVerified email at alibaba-inc.com
Wei WangThe Hong Kong University of Science and TechnologyVerified email at cse.ust.hk
Baochun LiProfessor of Electrical and Computer Engineering, University of TorontoVerified email at ece.toronto.edu
Bo LiChair Professor at hong kong university of science and technologyVerified email at cse.ust.hk
Wencong XiaoAlibaba GroupVerified email at alibaba-inc.com
Wei LinAlibabaVerified email at alibaba-inc.com
Luping WANGHong Kong University of Science and Technology, Alibaba-incVerified email at cse.ust.hk
Jun ZhangHong Kong University of Science and Technology, IEEE FellowVerified email at ust.hk
Yongkang ZhangDepartment of Computer Science and Engineering, Hong Kong University of Science and TechnologyVerified email at connect.ust.hk
Huangshi TianHong Kong University of Science and TechnologyVerified email at cse.ust.hk

Qizhen Weng

Hong Kong University of Science and Technology

Verified email at connect.ust.hk - Homepage

Machine Learning Systems AI Infrastructure Cloud Computing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
MLaaS in the Wild: Workload Analysis and Scheduling in Large-Scale Heterogeneous GPU Clusters Q Weng, W Xiao, Y Yu, W Wang, C Wang, J He, Y Li, L Zhang, W Lin, ... 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI …, 2022	161	2022
Metis: Learning to schedule long-running applications in shared container clusters at scale L Wang, Q Weng, W Wang, C Chen, B Li SC20: International Conference for High Performance Computing, Networking …, 2020	40	2020
Fast distributed deep learning via worker-adaptive batch sizing C Chen, Q Weng, W Wang, B Li, B Li Proceedings of the ACM symposium on cloud computing, 521-521, 2018	28	2018
Semi-dynamic load balancing: Efficient distributed learning in non-dedicated environments C Chen, Q Weng, W Wang, B Li, B Li Proceedings of the 11th ACM Symposium on Cloud Computing, 431-446, 2020	21	2020
Opus: Fair and efficient cache sharing for in-memory data analytics Y Yu, W Wang, J Zhang, Q Weng, KB Letaief 2018 IEEE 38th International Conference on Distributed Computing Systems …, 2018	14	2018
Beware of Fragmentation: Scheduling GPU-Sharing Workloads with Fragmentation Gradient Descent Q Weng, L Yang, Y Yu, W Wang, X Tang, G Yang, L Zhang 2023 USENIX Annual Technical Conference (USENIX ATC 23), 995-1008, 2023	7	2023
Workload consolidation in alibaba clusters: the good, the bad, and the ugly Y Zhang, Y Yu, W Wang, Q Chen, J Wu, Z Zhang, J Zhong, T Ding, ... Proceedings of the 13th Symposium on Cloud Computing, 210-225, 2022	6	2022
Accelerating distributed learning in non-dedicated environments C Chen, Q Weng, W Wang, B Li, B Li IEEE Transactions on Cloud Computing 11 (1), 515-531, 2021	6	2021
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	4	2024
Towards framework-independent, non-intrusive performance characterization for dataflow computation H Tian, Q Weng, W Wang Proceedings of the 10th ACM SIGOPS Asia-Pacific Workshop on Systems, 54-60, 2019	3	2019
CaraServe: CPU-Assisted and Rank-Aware LoRA Serving for Generative LLM Inference S Li, H Lu, T Wu, M Yu, Q Weng, X Chen, Y Shan, B Yuan, W Wang arXiv preprint arXiv:2401.11240, 2024	1	2024

The system can't perform the operation now. Try again later.

Articles 1–11

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors