Ching-Hsiang Chu

引用次数

	总计	2019 年至今
引用	826	707
h 指数	18	16
i10 指数	28	22

200

100

150

2010201120122013201420152016201720182019202020212022202320243 6 10 12 4 9 12 19 41 59 105 139 160 197 47

开放获取的出版物数量

查看全部

18 篇文章

5 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Dhabaleswar K. PandaProfessor of Computer Science, The Ohio State University在 cse.ohio-state.edu 的电子邮件经过验证
Hari SubramoniThe Ohio State University在 cse.ohio-state.edu 的电子邮件经过验证
Ammar Ahmad AwanMicrosoft在 osu.edu 的电子邮件经过验证
Khaled HamidoucheAMD Research在 amd.com 的电子邮件经过验证
Kawthar Shafie KhorassaniThe Ohio State University在 osu.edu 的电子邮件经过验证
Akshay VenkateshNVIDIA; Ohio State University在 nvidia.com 的电子邮件经过验证
Eric Hsiao-kuang WuNational Central University在 csie.ncu.edu.tw 的电子邮件经过验证
Xiaoyi LuAssistant Professor, University of California, Merced在 ucmerced.edu 的电子邮件经过验证
Jahanzeb HashmiSenior Architect, NVIDIA在 nvidia.com 的电子邮件经过验证
Pouya KoushaResearch Assistant, The Ohio State University在 osu.edu 的电子邮件经过验证
(Altamont) Bracy Hamilton EltonPenguin Computing在 bracyelton.com 的电子邮件经过验证
Mohammadreza Bayatpour (Mamzi)NVIDIA, The Ohio State University在 nvidia.com 的电子邮件经过验证
Arpan JainThe Ohio State University在 osu.edu 的电子邮件经过验证
Karthik Vadambacheri ManianSt. Jude Children's Research Hospital在 stjude.org 的电子邮件经过验证
Min-Te SunProfessor of Computer Science and Information Engineering, National Central University在 csie.ncu.edu.tw 的电子邮件经过验证
Srinivas Sridharan, PhdDistinguished Engineer, NVIDIA在 nvidia.com 的电子邮件经过验证
Qinghua ZhouThe Ohio State University在 osu.edu 的电子邮件经过验证
Mustafa OzdalMeta在 meta.com 的电子邮件经过验证
Dheevatsa MudigereDistinguished Engineer, NVIDIA在 nvidia.com 的电子邮件经过验证
Liang LuoUniversity of Washington在 cs.washington.edu 的电子邮件经过验证

关注

Ching-Hsiang Chu

Research Scientist, Meta/Facebook

在 meta.com 的电子邮件经过验证 - 首页

High-performance Computing GPU GPU Communication Wireless Networks


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Software-hardware co-design for fast and scalable training of deep learning recommendation models D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ... Proceedings of the 49th Annual International Symposium on Computer …, 2022	75	2022
The MVAPICH project: Transforming research into high-performance MPI library for HPC community DK Panda, H Subramoni, CH Chu, M Bayatpour Journal of Computational Science 52, 101208, 2021	61	2021
Scalable distributed dnn training using tensorflow and cuda-aware mpi: Characterization, designs, and performance evaluation AA Awan, J Bédorf, CH Chu, H Subramoni, DK Panda 2019 19th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2019	56	2019
Optimized broadcast for deep learning workloads on dense-GPU InfiniBand clusters: MPI or NCCL? AA Awan, CH Chu, H Subramoni, DK Panda Proceedings of the 25th European MPI Users' Group Meeting, 1-9, 2018	52	2018
Nv-group: link-efficient reduction for distributed deep learning on modern dense gpu systems CH Chu, P Kousha, AA Awan, KS Khorassani, H Subramoni, DK Panda Proceedings of the 34th ACM International Conference on Supercomputing, 1-12, 2020	39	2020
M. khorashadi, P D Mudigere, Y Hao, J Huang, Z Jia, A Tulloch, S Sridharan, X Liu, ... Bhattacharya, P. Lapukhov, M. Naumov, L. Qiao, M. Smelyanskiy, B. Jia, and V …, 2021	38	2021
Oc-dnn: Exploiting advanced unified memory capabilities in cuda 9 and volta gpus for out-of-core dnn training AA Awan, CH Chu, H Subramoni, X Lu, DK Panda 2018 IEEE 25th International Conference on High Performance Computing (HiPC …, 2018	36	2018
High-performance, distributed training of large-scale deep learning recommendation models D Mudigere, Y Hao, J Huang, A Tulloch, S Sridharan, X Liu, M Ozdal, ... arXiv preprint arXiv:2104.05158, 2021	31	2021
Improving SCTP performance by jitter-based congestion control over wired-wireless networks JM Chen, CH Chu, EHK Wu, MF Tsai, JR Wang EURASIP Journal on Wireless Communications and Networking 2011, 1-13, 2011	27	2011
CUDA kernel based collective reduction operations on large-scale GPU clusters CH Chu, K Hamidouche, A Venkatesh, AA Awan, DK Panda 2016 16th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2016	26	2016
Performance evaluation of MPI libraries on GPU-enabled OpenPOWER architectures: Early experiences KS Khorassani, CH Chu, H Subramoni, DK Panda High Performance Computing: ISC High Performance 2019 International …, 2019	25	2019
Designing high-performance mpi libraries with on-the-fly compression for modern gpu clusters Q Zhou, C Chu, NS Kumar, P Kousha, SM Ghazimirsaeed, H Subramoni, ... 2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021	24	2021
Efficient and scalable multi-source streaming broadcast on GPU clusters for deep learning CH Chu, X Lu, AA Awan, H Subramoni, J Hashmi, B Elton, DK Panda 2017 46th International Conference on Parallel Processing (ICPP), 161-170, 2017	24	2017
Exploiting GPUDirect RDMA in designing high performance OpenSHMEM for NVIDIA GPU clusters K Hamidouche, A Venkatesh, AA Awan, H Subramoni, CH Chu, ... 2015 IEEE International Conference on Cluster Computing, 78-87, 2015	24	2015
Communication profiling and characterization of deep-learning workloads on clusters with high-performance interconnects AA Awan, A Jain, CH Chu, H Subramoni, DK Panda IEEE Micro 40 (1), 35-43, 2019	21	2019
Characterizing cuda unified memory (um)-aware mpi designs on modern gpu architectures KV Manian, AA Ammar, A Ruhela, CH Chu, H Subramoni, DK Panda Proceedings of the 12th Workshop on General Purpose Processing Using GPUs, 43-52, 2019	20	2019
Designing a profiling and visualization tool for scalable and in-depth analysis of high-performance GPU clusters P Kousha, B Ramesh, KK Suresh, CH Chu, A Jain, N Sarkauskas, ... 2019 IEEE 26th International Conference on High Performance Computing, Data …, 2019	19	2019
IVC: Imperceptible video communication R Carvalho, CH Chu, LJ Chen Proc. of HotMobile (poster), 2014	18	2014
Distributed topology control for energy-efficient and reliable wireless communications MT Sun, CH Chu, EHK Wu, CS Hsiao, AAK Jeng IEEE Systems Journal 12 (3), 2152-2161, 2017	17	2017
Optimized large-message broadcast for deep learning workloads: MPI, MPI+ NCCL, or NCCL2? AA Awan, KV Manian, CH Chu, H Subramoni, DK Panda parallel computing 85, 141-152, 2019	16	2019

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

关注

引用次数

合著作者