{TC-GNN}: Bridging Sparse {GNN} Computation and Dense Tensor Cores on {GPUs} Y Wang, B Feng, Z Wang, G Huang, Y Ding 2023 USENIX Annual Technical Conference (USENIX ATC 23), 149-164, 2023 | 24 | 2023 |
{MGG}: Accelerating Graph Neural Networks with {Fine-Grained}{Intra-Kernel}{Communication-Computation} Pipelining on {Multi-GPU} Platforms Y Wang, B Feng, Z Wang, T Geng, K Barker, A Li, Y Ding 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023 | 9 | 2023 |
EL-Rec: efficient large-scale recommendation model training via tensor-train embedding table Z Wang, Y Wang, B Feng, D Mudigere, B Muthiah, Y Ding SC22: International Conference for High Performance Computing, Networking …, 2022 | 5 | 2022 |
Uncertainty-aware attention graph neural network for defending adversarial attacks B Feng, Y Wang, Z Wang, Y Ding arXiv preprint arXiv:2009.10235, 2020 | 5 | 2020 |
ECSSD: Hardware/Data Layout Co-Designed In-Storage-Computing Architecture for Extreme Classification S Li, F Tu, L Liu, J Lin, Z Wang, Y Kang, Y Ding, Y Xie Proceedings of the 50th Annual International Symposium on Computer …, 2023 | 2 | 2023 |
Faith: An Efficient Framework for Transformer Verification on {GPUs} B Feng, T Tang, Y Wang, Z Chen, Z Wang, S Yang, Y Xie, Y Ding 2022 USENIX Annual Technical Conference (USENIX ATC 22), 167-182, 2022 | 1 | 2022 |
ZENO: A Type-based Optimization Framework for Zero Knowledge Neural Network Inference B Feng, Z Wang, Y Wang, S Yang, Y Ding Proceedings of the 29th ACM International Conference on Architectural …, 2024 | | 2024 |
RAP: Resource-aware Automated GPU Sharing for Multi-GPU Recommendation Model Training and Input Preprocessing Z Wang, Y Wang, J Deng, D Zheng, A Li, Y Ding Proceedings of the 29th ACM International Conference on Architectural …, 2024 | | 2024 |
GMI-DRL: Empowering Multi-GPU Deep Reinforcement Learning with GPU Spatial Multiplexing Y Wang, B Feng, Z Wang, T Geng, A Li, Y Ding arXiv preprint arXiv:2206.08482, 2022 | | 2022 |