Optimizing FPGA-based accelerator design for deep convolutional neural networks C Zhang, P Li, G Sun, Y Guan, B Xiao, J Cong Proceedings of the 2015 ACM/SIGDA international symposium on field …, 2015 | 2551 | 2015 |
FP-DNN: An automated framework for mapping deep neural networks onto FPGAs with RTL-HLS hybrid templates Y Guan, H Liang, N Xu, W Wang, S Shi, X Chen, G Sun, W Zhang, J Cong 2017 IEEE 25th Annual International Symposium on Field-Programmable Custom …, 2017 | 408 | 2017 |
FPGA-based accelerator for long short-term memory recurrent neural networks Y Guan, Z Yuan, G Sun, J Cong 2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC), 629-634, 2017 | 241 | 2017 |
184QPS/W 64Mb/mm23D Logic-to-DRAM Hybrid Bonding with Process-Near-Memory Engine for Recommendation System D Niu, S Li, Y Wang, W Han, Z Zhang, Y Guan, T Guan, F Sun, F Xue, ... 2022 IEEE International Solid-State Circuits Conference (ISSCC) 65, 1-3, 2022 | 63 | 2022 |
BlockGNN: Towards efficient GNN acceleration using block-circulant weight matrices Z Zhou, B Shi, Z Zhang, Y Guan, G Sun, G Luo 2021 58th ACM/IEEE Design Automation Conference (DAC), 1009-1014, 2021 | 39 | 2021 |
Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network S Li, D Niu, Y Wang, W Han, Z Zhang, T Guan, Y Guan, H Liu, L Huang, ... Proceedings of the 49th Annual International Symposium on Computer …, 2022 | 24 | 2022 |
GNN-PIM: A processing-in-memory architecture for graph neural networks Z Wang, Y Guan, G Sun, D Niu, Y Wang, H Zheng, Y Han Advanced Computer Architecture: 13th Conference, ACA 2020, Kunming, China …, 2020 | 23 | 2020 |
Using data compression for optimizing FPGA-based convolutional neural network accelerators Y Guan, N Xu, C Zhang, Z Yuan, J Cong International workshop on advanced parallel processing technologies, 14-26, 2017 | 13 | 2017 |
PIMulator-NN: An event-driven, cross-level simulation framework for processing-in-memory-based neural network accelerators Q Zheng, X Li, Y Guan, Z Wang, Y Cai, Y Chen, G Sun, R Huang IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2022 | 8 | 2022 |
Crane: mitigating accelerator under-utilization caused by sparsity irregularities in cnns Y Guan, G Sun, Z Yuan, X Li, N Xu, S Chen, J Cong, Y Xie IEEE Transactions on Computers 69 (7), 931-943, 2020 | 7 | 2020 |
Practical near-data-processing architecture for large-scale distributed graph neural network L Huang, Z Zhang, S Li, D Niu, Y Guan, H Zheng, Y Xie IEEE Access 10, 46796-46807, 2022 | 6 | 2022 |
OpSparse: a highly optimized framework for sparse general matrix multiplication on GPUs Z Du, Y Guan, T Guan, D Niu, L Huang, H Zheng, Y Xie IEEE Access 10, 85960-85974, 2022 | 5 | 2022 |
Computation unit, related apparatus, and method G Yijin, F Sun, LUO Junwen, H Li, W Bangyan, G Tianchan, Y Zhang US Patent App. 17/510,217, 2022 | 5 | 2022 |
Instruction processing apparatus, acceleration unit, and server G Yijin, F Sun, L Liang US Patent 11,789,733, 2023 | 2 | 2023 |
Accelerating cpu-based sparse general matrix multiplication with binary row merging Z Du, Y Guan, T Guan, D Niu, H Zheng, Y Xie IEEE Access 10, 79237-79248, 2022 | 2 | 2022 |
Flatfish: A Reinforcement Learning Approach for Application-Aware Address Mapping X Li, Z Yuan, Y Guan, G Sun, T Zhang, R Wei, D Niu IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2022 | 2 | 2022 |
Computing system and memory sharing method for computing system G Tianchan, D Niu, G Yijin, H Zheng US Patent App. 18/322,954, 2024 | 1 | 2024 |
{HydraRPC}:{RPC} in the {CXL} Era T Ma, Z Liu, C Wei, J Huang, Y Zhuo, H Li, N Zhang, Y Guan, D Niu, ... 2024 USENIX Annual Technical Conference (USENIX ATC 24), 387-395, 2024 | 1 | 2024 |
Processing system that increases the memory capacity of a GPGPU Y Wang, D Niu, G Yijin, W Shengcheng, S Li, H Zheng US Patent 11,847,049, 2023 | 1 | 2023 |
Predicting the output structure of sparse matrix multiplication with sampled compression ratio Z Du, Y Guan, T Guan, D Niu, N Tan, X Yu, H Zheng, J Meng, X Yan, Y Xie 2022 IEEE 28th International Conference on Parallel and Distributed Systems …, 2023 | 1 | 2023 |