FPGA-based accelerator for long short-term memory recurrent neural networks Y Guan, Z Yuan, G Sun, J Cong 2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC), 629-634, 2017 | 223 | 2017 |
Ptq4vit: Post-training quantization for vision transformers with twin uniform quantization Z Yuan, C Xue, Y Chen, Q Wu, G Sun European Conference on Computer Vision, 191-207, 2022 | 50* | 2022 |
Reducing overfitting in deep convolutional neural networks using redundancy regularizer B Wu, Z Liu, Z Yuan, G Sun, C Wu Artificial Neural Networks and Machine Learning–ICANN 2017: 26th …, 2017 | 30 | 2017 |
S2DNAS: Transforming static CNN model for dynamic inference via neural architecture search Z Yuan, B Wu, G Sun, Z Liang, S Zhao, W Bi Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 25 | 2020 |
Post-training quantization on diffusion models Y Shang, Z Yuan, B Xie, B Wu, Y Yan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 19 | 2023 |
NAS4RRAM: neural network architecture search for inference on RRAM-based accelerators Z Yuan, J Liu, X Li, L Yan, H Chen, B Wu, Y Yang, G Sun Science China Information Sciences 64 (6), 160407, 2021 | 16 | 2021 |
Latency-aware Spatial-wise Dynamic Networks Y Han, Z Yuan, Y Pu, C Xue, S Song, G Sun, G Huang Advances in Neural Information Processing Systems 35, 36845-36857, 2022 | 14 | 2022 |
RPTQ: Reorder-based Post-training Quantization for Large Language Models Z Yuan, L Niu, J Liu, W Liu, X Wang, Y Shang, G Sun, Q Wu, J Wu, B Wu arXiv preprint arXiv:2304.01089, 2023 | 12 | 2023 |
Pd-quant: Post-training quantization based on prediction difference metric J Liu, L Niu, Z Yuan, D Yang, X Wang, W Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 9 | 2023 |
Using data compression for optimizing FPGA-based convolutional neural network accelerators Y Guan, N Xu, C Zhang, Z Yuan, J Cong International workshop on advanced parallel processing technologies, 14-26, 2017 | 9 | 2017 |
Crane: Mitigating accelerator under-utilization caused by sparsity irregularities in cnns Y Guan, G Sun, Z Yuan, X Li, N Xu, S Chen, J Cong, Y Xie IEEE Transactions on Computers 69 (7), 931-943, 2020 | 7 | 2020 |
ENAS4D: Efficient multi-stage CNN architecture search for dynamic inference Z Yuan, X Liu, B Wu, G Sun arXiv preprint arXiv:2009.09182, 2020 | 6 | 2020 |
Ptq-sl: Exploring the sub-layerwise post-training quantization Z Yuan, Y Chen, C Xue, C Zhang, Q Wang, G Sun arXiv preprint arXiv:2110.07809, 2021 | 3 | 2021 |
Reconfigurable ASIC implementation of asynchronous recurrent neural networks S Nelson, SY Kim, J Di, Z Zhou, Z Yuan, G Sun 2021 27th IEEE International Symposium on Asynchronous Circuits and Systems …, 2021 | 3 | 2021 |
FD-CNN: A Frequency-Domain FPGA Acceleration Scheme for CNN-Based Image-Processing Applications X Wang, Z Zhou, Z Yuan, J Zhu, Y Cao, Y Zhang, K Sun, G Sun ACM Transactions on Embedded Computing Systems 22 (6), 1-30, 2023 | 2 | 2023 |
Enabling High-Quality Uncertainty Quantification in a PIM Designed for Bayesian Neural Network X Li, B Wu, G Sun, Z Zhang, Z Yuan, R Wang, R Huang, D Niu, H Zheng, ... 2022 IEEE International Symposium on High-Performance Computer Architecture …, 2022 | 2 | 2022 |
METRO: A software-hardware co-design of interconnections for spatial DNN accelerators Z Wang, G Sun, J Zhu, Z Zhou, Y Guo, Z Yuan arXiv preprint arXiv:2108.10570, 2021 | 2 | 2021 |
Latency-aware unified dynamic networks for efficient image recognition Y Han, Z Liu, Z Yuan, Y Pu, C Wang, S Song, G Huang arXiv preprint arXiv:2308.15949, 2023 | 1 | 2023 |
Tailor: removing redundant operations in memristive analog neural network accelerators X Li, Z Yuan, G Sun, L Zhao, Z Lu Proceedings of the 59th ACM/IEEE Design Automation Conference, 1009-1014, 2022 | 1 | 2022 |
Rapid configuration of asynchronous recurrent neural networks for ASIC implementations S Nelson, W Khalil, SY Kim, J Di, Z Zhou, Z Yuan, G Sun 2021 IEEE High Performance Extreme Computing Conference (HPEC), 1-6, 2021 | 1 | 2021 |