Follow
Yijin Guan
Yijin Guan
Unknown affiliation
Verified email at pku.edu.cn
Title
Cited by
Cited by
Year
Optimizing FPGA-based accelerator design for deep convolutional neural networks
C Zhang, P Li, G Sun, Y Guan, B Xiao, J Cong
Proceedings of the 2015 ACM/SIGDA international symposium on field …, 2015
25512015
FP-DNN: An automated framework for mapping deep neural networks onto FPGAs with RTL-HLS hybrid templates
Y Guan, H Liang, N Xu, W Wang, S Shi, X Chen, G Sun, W Zhang, J Cong
2017 IEEE 25th Annual International Symposium on Field-Programmable Custom …, 2017
4082017
FPGA-based accelerator for long short-term memory recurrent neural networks
Y Guan, Z Yuan, G Sun, J Cong
2017 22nd Asia and South Pacific Design Automation Conference (ASP-DAC), 629-634, 2017
2412017
184QPS/W 64Mb/mm23D Logic-to-DRAM Hybrid Bonding with Process-Near-Memory Engine for Recommendation System
D Niu, S Li, Y Wang, W Han, Z Zhang, Y Guan, T Guan, F Sun, F Xue, ...
2022 IEEE International Solid-State Circuits Conference (ISSCC) 65, 1-3, 2022
632022
BlockGNN: Towards efficient GNN acceleration using block-circulant weight matrices
Z Zhou, B Shi, Z Zhang, Y Guan, G Sun, G Luo
2021 58th ACM/IEEE Design Automation Conference (DAC), 1009-1014, 2021
392021
Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network
S Li, D Niu, Y Wang, W Han, Z Zhang, T Guan, Y Guan, H Liu, L Huang, ...
Proceedings of the 49th Annual International Symposium on Computer …, 2022
242022
GNN-PIM: A processing-in-memory architecture for graph neural networks
Z Wang, Y Guan, G Sun, D Niu, Y Wang, H Zheng, Y Han
Advanced Computer Architecture: 13th Conference, ACA 2020, Kunming, China …, 2020
232020
Using data compression for optimizing FPGA-based convolutional neural network accelerators
Y Guan, N Xu, C Zhang, Z Yuan, J Cong
International workshop on advanced parallel processing technologies, 14-26, 2017
132017
PIMulator-NN: An event-driven, cross-level simulation framework for processing-in-memory-based neural network accelerators
Q Zheng, X Li, Y Guan, Z Wang, Y Cai, Y Chen, G Sun, R Huang
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2022
82022
Crane: mitigating accelerator under-utilization caused by sparsity irregularities in cnns
Y Guan, G Sun, Z Yuan, X Li, N Xu, S Chen, J Cong, Y Xie
IEEE Transactions on Computers 69 (7), 931-943, 2020
72020
Practical near-data-processing architecture for large-scale distributed graph neural network
L Huang, Z Zhang, S Li, D Niu, Y Guan, H Zheng, Y Xie
IEEE Access 10, 46796-46807, 2022
62022
OpSparse: a highly optimized framework for sparse general matrix multiplication on GPUs
Z Du, Y Guan, T Guan, D Niu, L Huang, H Zheng, Y Xie
IEEE Access 10, 85960-85974, 2022
52022
Computation unit, related apparatus, and method
G Yijin, F Sun, LUO Junwen, H Li, W Bangyan, G Tianchan, Y Zhang
US Patent App. 17/510,217, 2022
52022
Instruction processing apparatus, acceleration unit, and server
G Yijin, F Sun, L Liang
US Patent 11,789,733, 2023
22023
Accelerating cpu-based sparse general matrix multiplication with binary row merging
Z Du, Y Guan, T Guan, D Niu, H Zheng, Y Xie
IEEE Access 10, 79237-79248, 2022
22022
Flatfish: A Reinforcement Learning Approach for Application-Aware Address Mapping
X Li, Z Yuan, Y Guan, G Sun, T Zhang, R Wei, D Niu
IEEE Transactions on Computer-Aided Design of Integrated Circuits and …, 2022
22022
Computing system and memory sharing method for computing system
G Tianchan, D Niu, G Yijin, H Zheng
US Patent App. 18/322,954, 2024
12024
{HydraRPC}:{RPC} in the {CXL} Era
T Ma, Z Liu, C Wei, J Huang, Y Zhuo, H Li, N Zhang, Y Guan, D Niu, ...
2024 USENIX Annual Technical Conference (USENIX ATC 24), 387-395, 2024
12024
Processing system that increases the memory capacity of a GPGPU
Y Wang, D Niu, G Yijin, W Shengcheng, S Li, H Zheng
US Patent 11,847,049, 2023
12023
Predicting the output structure of sparse matrix multiplication with sampled compression ratio
Z Du, Y Guan, T Guan, D Niu, N Tan, X Yu, H Zheng, J Meng, X Yan, Y Xie
2022 IEEE 28th International Conference on Parallel and Distributed Systems …, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20