SpecInfer: Accelerating Generative LLM Serving with Speculative Inference and Token Tree Verification X Miao, G Oliaro, Z Zhang, X Cheng, Z Wang, RYY Wong, Z Chen, ... arXiv preprint arXiv:2305.09781, 2023 | 50 | 2023 |
Quantized training of gradient boosting decision trees Y Shi, G Ke, Z Chen, S Zheng, TY Liu Advances in neural information processing systems 35, 18822-18833, 2022 | 10 | 2022 |
Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding Z Chen, A May, R Svirschevski, Y Huang, M Ryabinin, Z Jia, B Chen arXiv preprint arXiv:2402.12374, 2024 | 1 | 2024 |
GNNPipe: Accelerating Distributed Full-Graph GNN Training with Pipelined Model Parallelism J Chen, Z Chen, X Qian arXiv preprint arXiv:2308.10087, 2023 | 1 | 2023 |
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding H Sun, Z Chen, X Yang, Y Tian, B Chen arXiv preprint arXiv:2404.11912, 2024 | | 2024 |
Quark: A Gradient-Free Quantum Learning Framework for Classification Tasks Z Zhang, Z Chen, H Huang, Z Jia | | 2022 |