Follow
Minchen Yu
Title
Cited by
Cited by
Year
{MArk}: Exploiting Cloud Services for {Cost-Effective},{SLO-Aware} Machine Learning Inference Serving
C Zhang, M Yu, W Wang, F Yan
2019 USENIX Annual Technical Conference (USENIX ATC 19), 1049-1062, 2019
1432019
Continuum: A platform for cost-aware, low-latency continual learning
H Tian, M Yu, W Wang
Proceedings of the ACM Symposium on Cloud Computing, 26-40, 2018
302018
Gillis: Serving Large Neural Networks in Serverless Functions with Automatic Model Partitioning
M Yu, Z Jiang, HC Ng, W Wang, R Chen, B Li
2021 IEEE 41st International Conference on Distributed Computing Systems …, 2021
192021
Enabling cost-effective, slo-aware machine learning inference serving on public cloud
C Zhang, M Yu, F Yan
IEEE Transactions on Cloud Computing, 2020
62020
Restructuring Serverless Computing with Data-Centric Function Orchestration
M Yu, T Cao, W Wang, R Chen
arXiv preprint arXiv:2109.13492, 2021
52021
{CrystalPerf}: Learning to Characterize the Performance of Dataflow Computation through Code Analysis
H Tian, M Yu, W Wang
2021 USENIX Annual Technical Conference (USENIX ATC 21), 253-267, 2021
32021
RepBun: Load-balanced, shuffle-free cluster caching for structured data
M Yu, Y Yu, Y Zheng, B Yang, W Wang
IEEE INFOCOM 2020-IEEE Conference on Computer Communications, 954-963, 2020
22020
Following the data, not the function: Rethinking function orchestration in serverless computing
M Yu, T Cao, W Wang, R Chen
arXiv preprint arXiv:2109.13492, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–8