关注
Junjie Bai
Junjie Bai
Alibaba
在 alibaba-inc.com 的电子邮件经过验证
标题
引用次数
引用次数
年份
Pytorch: An imperative style, high-performance deep learning library
A Paszke, S Gross, F Massa, A Lerer, J Bradbury, G Chanan, T Killeen, ...
Advances in neural information processing systems 32, 2019
429342019
Onnx: Open neural network exchange
J Bai, F Lu, K Zhang
GitHub repository, 2017
3882017
Advances in Neural Information Processing Systems 32, Curran Associates
A Paszke, S Gross, F Massa, A Lerer, J Bradbury, G Chanan, T Killeen, ...
Inc., New York, 8024, 2019
632019
DISC: A dynamic shape compiler for machine learning workloads
K Zhu, WY Zhao, Z Zheng, TY Guo, PZ Zhao, JJ Bai, J Yang, XY Liu, ...
Proceedings of the 1st Workshop on Machine Learning and Systems, 89-95, 2021
222021
Parameter-efficient sparsity for large language models fine-tuning
Y Li, F Luo, C Tan, M Wang, S Huang, S Li, J Bai
arXiv preprint arXiv:2205.11005, 2022
122022
DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
M Li, T Cai, J Cao, Q Zhang, H Cai, J Bai, Y Jia, MY Liu, K Li, S Han
arXiv preprint arXiv:2402.19481, 2024
42024
Bladedisc: Optimizing dynamic shape machine learning workloads via compiler approach
Z Zheng, Z Pan, D Wang, K Zhu, W Zhao, T Guo, X Qiu, M Sun, J Bai, ...
Proceedings of the ACM on Management of Data 1 (3), 1-29, 2023
32023
RECom: A Compiler Approach to Accelerating Recommendation Model Inference with Massive Embedding Columns
Z Pan, Z Zheng, F Zhang, R Wu, H Liang, D Wang, X Qiu, J Bai, W Lin, ...
Proceedings of the 28th ACM International Conference on Architectural …, 2023
12023
MonoInfer: Enabling a New Monolithic Optimization Space for Neural Network Inference Tasks on Modern GPU-Centric Architectures
D Zhuang, Z ZHENG, H Xia, X Qiu, J Bai, W Lin, SL Song
系统目前无法执行此操作,请稍后再试。
文章 1–9