A survey of deep learning techniques for neural machine translation S Yang, Y Wang, X Chu arXiv preprint arXiv:2002.07526, 2020 | 167 | 2020 |
A distributed synchronous SGD algorithm with global top-k sparsification for low bandwidth networks S Shi, Q Wang, K Zhao, Z Tang, Y Wang, X Huang, X Chu 2019 IEEE 39th International Conference on Distributed Computing Systems …, 2019 | 136 | 2019 |
The impact of GPU DVFS on the energy and performance of deep learning: An empirical study Z Tang, Y Wang, Q Wang, X Chu Proceedings of the Tenth ACM International Conference on Future Energy …, 2019 | 70 | 2019 |
Benchmarking the performance and energy efficiency of AI accelerators for AI training Y Wang, Q Wang, S Shi, X He, Z Tang, K Zhao, X Chu 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet …, 2020 | 47 | 2020 |
Computer-aided clinical skin disease diagnosis using cnn and object detection models X He, S Wang, S Shi, Z Tang, Y Wang, Z Zhao, J Dai, R Ni, X Zhang, X Liu, ... 2019 IEEE International Conference on Big Data (Big Data), 4839-4844, 2019 | 12 | 2019 |
FusionAI: Decentralized Training and Deploying LLMs with Massive Consumer-Level GPUs Z Tang, Y Wang, X He, L Zhang, X Pan, Q Wang, R Zeng, K Zhao, S Shi, ... arXiv preprint arXiv:2309.01172, 2023 | 5 | 2023 |
Energy-efficient Inference Service of Transformer-based Deep Learning Models on GPUs Y Wang, Q Wang, X Chu 2020 International Conferences on IEEE Green Computing and Communications …, 2020 | 5 | 2020 |
FedML Parrot: A scalable federated learning system via heterogeneity-aware scheduling on sequential and hierarchical training Z Tang, X Chu, RY Ran, S Lee, S Shi, Y Zhang, Y Wang, AQ Liang, ... arXiv preprint arXiv:2303.01778, 2023 | 4 | 2023 |
NAS-LID: efficient neural architecture search with local intrinsic dimension X He, J Yao, Y Wang, Z Tang, KC Cheung, S See, B Han, X Chu Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 7839-7847, 2023 | 2 | 2023 |
Towards Efficient and Reliable LLM Serving: A Real-World Workload Study Y Wang, Y Chen, Z Li, Z Tang, R Guo, X Wang, Q Wang, AC Zhou, X Chu arXiv preprint arXiv:2401.17644, 2024 | 1 | 2024 |
Reliable and Efficient In-Memory Fault Tolerance of Large Language Model Pretraining Y Wang, S Shi, X He, Z Tang, X Pan, Y Zheng, X Wu, AC Zhou, B He, ... arXiv preprint arXiv:2310.12670, 2023 | 1 | 2023 |
Energy-efficient Online Scheduling of Transformer Inference Services on GPU Servers Y Wang, Q Wang, X Chu IEEE Transactions on Green Communications and Networking, 2022 | 1 | 2022 |