Follow
Cody (Hao) Yu
Cody (Hao) Yu
Software Engineer @ Anyscale | ex-Amazonian | UCLA PhD ‘19
Verified email at anyscale.com - Homepage
Title
Cited by
Cited by
Year
Efficient memory management for large language model serving with pagedattention
W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ...
Proceedings of the 29th Symposium on Operating Systems Principles, 611-626, 2023
9912023
Automated systolic array architecture synthesis for high throughput CNN inference on FPGAs
X Wei, CH Yu, P Zhang, Y Chen, Y Wang, H Hu, Y Liang, J Cong
Proceedings of the 54th Annual Design Automation Conference 2017, 1-6, 2017
4832017
Ansor: Generating {High-Performance} tensor programs for deep learning
L Zheng, C Jia, M Sun, Z Wu, CH Yu, A Haj-Ali, Y Wang, J Yang, D Zhuo, ...
14th USENIX symposium on operating systems design and implementation (OSDI …, 2020
4022020
HeteroCL: A multi-paradigm programming infrastructure for software-defined reconfigurable computing
YH Lai, Y Chi, Y Hu, J Wang, CH Yu, Y Zhou, J Cong, Z Zhang
Proceedings of the 2019 ACM/SIGDA International Symposium on Field …, 2019
1162019
Programming and runtime support to blaze FPGA accelerator deployment at datacenter scale
M Huang, D Wu, CH Yu, Z Fang, M Interlandi, T Condie, J Cong
Proceedings of the Seventh ACM Symposium on Cloud Computing, 456-469, 2016
1122016
AutoDSE: Enabling software programmers to design efficient FPGA accelerators
A Sohrabizadeh, CH Yu, M Gao, J Cong
ACM Transactions on Design Automation of Electronic Systems (TODAES) 27 (4 …, 2022
932022
Automated accelerator generation and optimization with composable, parallel and pipeline architecture
J Cong, P Wei, CH Yu, P Zhang
Proceedings of the 55th Annual Design Automation Conference, 1-6, 2018
832018
TGPA: Tile-grained pipeline architecture for low latency CNN inference
X Wei, Y Liang, X Li, CH Yu, P Zhang, J Cong
2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2018
812018
Bandwidth Optimization Through On-Chip Memory Restructuring for HLS
J Cong, P Wei, CH Yu, P Zhou
652017
Tensorir: An abstraction for automatic tensorized program optimization
S Feng, B Hou, H Jin, W Lin, J Shao, R Lai, Z Ye, L Zheng, CH Yu, Y Yu, ...
Proceedings of the 28th ACM International Conference on Architectural …, 2023
642023
The SMEM Seeding Acceleration for DNA Sequence Alignment
MCF Chang, YT Chen, J Cong, PT Huang, CL Kuo, CH Yu
The 24th IEEE International Symposium on Field-Programmable Custom Computing …, 2016
642016
Efficiently programming large language models using sglang
L Zheng, L Yin, Z Xie, J Huang, C Sun, C Hao Yu, S Cao, C Kozyrakis, ...
arXiv e-prints, arXiv: 2312.07104, 2023
622023
Systems and methods for systolic array design from a high-level program
P Zhang, CH Yu, X Wei, P Pan
US Patent 10,838,910, 2020
622020
S2FA: An accelerator automation framework for heterogeneous computing in datacenters
CH Yu, P Wei, M Grossman, P Zhang, V Sarker, J Cong
Proceedings of the 55th Annual Design Automation Conference, 1-6, 2018
482018
On the preconditioner of conjugate gradient method: a power grid simulation perspective
CH Chou, NY Tsai, H Yu, CR Lee, Y Shi, SC Chang
Proceedings of the International Conference on Computer-Aided Design, 494-497, 2010
402010
Best-effort FPGA programming: A few steps can go a long way
J Cong, Z Fang, Y Hao, P Wei, CH Yu, C Zhang, P Zhou
arXiv preprint arXiv:1807.01340, 2018
342018
DietCode: Automatic optimization for dynamic tensor programs
B Zheng, Z Jiang, CH Yu, H Shen, J Fromm, Y Liu, Y Wang, L Ceze, ...
Proceedings of Machine Learning and Systems 4, 848-863, 2022
332022
Heterogeneous datacenters: Options and opportunities
J Cong, M Huang, D Wu, CH Yu
Proceedings of the 53rd Annual Design Automation Conference, 1-6, 2016
302016
Analysis and optimization of the implicit broadcasts in FPGA HLS to improve maximum frequency
L Guo, J Lau, Y Chi, J Wang, CH Yu, Z Chen, Z Zhang, J Cong
2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020
282020
Latte: Locality Aware Transformation for High-Level Synthesis
J Cong, P Wei, CH Yu, P Zhou
252018
The system can't perform the operation now. Try again later.
Articles 1–20