Follow
Shigang Li
Title
Cited by
Cited by
Year
Deep learning for post-processing ensemble weather forecasts
P Grönquist, C Yao, T Ben-Nun, N Dryden, P Dueben, S Li, T Hoefler
Philosophical Transactions of the Royal Society A 379 (2194), 20200092, 2021
1852021
Data Movement Is All You Need: A Case Study on Optimizing Transformers
A Ivanov, N Dryden, T Ben-Nun, S Li, T Hoefler
Proceedings of Machine Learning and Systems 3, 2021
1502021
Chimera: efficiently training large-scale neural networks with bidirectional pipelines
S Li, T Hoefler
Proceedings of the International Conference for High Performance Computing …, 2021
1242021
NUMA-aware shared-memory collective communication for MPI
S Li, T Hoefler, M Snir
Proceedings of the 22nd international symposium on High-performance parallel …, 2013
1202013
Parallel processing systems for big data: a survey
Y Zhang, T Cao, S Li, X Tian, L Yuan, H Jia, AV Vasilakos
Proceedings of the IEEE 104 (11), 2114-2136, 2016
1162016
CAS‐ESM 2: Description and climate simulation performance of the Chinese Academy of Sciences (CAS) Earth System Model (ESM) version 2
H Zhang, M Zhang, J Jin, K Fei, D Ji, C Wu, J Zhu, J He, Z Chai, J Xie, ...
Journal of Advances in Modeling Earth Systems, e2020MS002210, 2020
822020
Taming unbalanced training workloads in deep learning with partial collective operations
S Li, T Ben-Nun, SD Girolamo, D Alistarh, T Hoefler
Proceedings of the 25th ACM SIGPLAN Symposium on Principles and Practice of …, 2020
652020
Asynchronous Decentralized SGD with Quantized and Local Updates
G Nadiradze, A Sabour, P Davies, S Li, D Alistarh
Advances in Neural Information Processing Systems 34, 2021
572021
Intra-hour Photovoltaic Generation Forecasting based on Multi-source Data and Deep Learning Methods
T Yao, J Wang, H Wu, P Zhang, S Li, K Xu, X Liu, X Chi
IEEE Transactions on Sustainable Energy, 2021
532021
Near-optimal sparse allreduce for distributed deep learning
S Li, T Hoefler
Proceedings of the 27th ACM SIGPLAN Symposium on Principles and Practice of …, 2022
472022
Flare: flexible in-network allreduce
D De Sensi, S Di Girolamo, S Ashkboos, S Li, T Hoefler
Proceedings of the International Conference for High Performance Computing …, 2021
462021
Improved MPI collectives for MPI processes in shared address spaces
S Li, T Hoefler, C Hu, M Snir
Cluster Computing 17 (4), 1139-1155, 2014
342014
Efficient quantized sparse matrix operations on tensor cores
S Li, K Osawa, T Hoefler
SC22: International Conference for High Performance Computing, Networking …, 2022
322022
A photovoltaic power output dataset: Multi-source photovoltaic power output dataset with Python toolkit
T Yao, J Wang, H Wu, P Zhang, S Li, Y Wang, X Chi, M Shi
Solar Energy 230, 122-130, 2021
322021
Cache-oblivious MPI all-to-all communications based on Morton order
S Li, Y Zhang, T Hoefler
IEEE Transactions on Parallel and Distributed Systems, 2018
312018
Hammingmesh: A network topology for large-scale deep learning
T Hoefler, T Bonato, D De Sensi, S Di Girolamo, S Li, M Heddes, J Belk, ...
SC22: International Conference for High Performance Computing, Networking …, 2022
212022
Kernel optimization for short-range molecular dynamics
C Hu, X Wang, J Li, X He, S Li, Y Feng, S Yang, H Bai
Computer Physics Communications, 2016
212016
PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
K Osawa, S Li, T Hoefler
Proceedings of Machine Learning and Systems 5, 2023
202023
Efficient parallel optimizations of a high-performance SIFT on GPUs
Z Li, H Jia, Y Zhang, S Liu, S Li, X Wang, H Zhang
Journal of Parallel and Distributed Computing, 2018
202018
Why Dataset Properties Bound the Scalability of Parallel Machine Learning Training Algorithms
D Cheng, S Li, Z Hanping, F Xia, Y Zhang
IEEE Transactions on Parallel and Distributed Systems, 2021
19*2021
The system can't perform the operation now. Try again later.
Articles 1–20