关注
Jee W. Choi
Jee W. Choi
在 uoregon.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Model-driven autotuning of sparse matrix-vector multiply on GPUs
JW Choi, A Singh, RW Vuduc
ACM sigplan notices 45 (5), 115-126, 2010
5392010
A roofline model of energy
JW Choi, D Bedard, R Fowler, R Vuduc
2013 IEEE 27th International Symposium on Parallel and Distributed …, 2013
1812013
On the limits of GPU acceleration
R Vuduc, A Chandramowlishwaran, J Choi, M Guney, A Shringarpure
Proceedings of the 2nd USENIX conference on Hot topics in parallelism 13 (0), 2010
1782010
FROSTT: The formidable repository of open sparse tensors and tools
S Smith, JW Choi, J Li, R Vuduc, J Park, X Liu, G Karypis
1172017
Algorithmic time, energy, and power on candidate HPC compute building blocks
J Choi, M Dukhan, X Liu, R Vuduc
2014 IEEE 28th international parallel and distributed processing symposium …, 2014
982014
Performance analysis and tuning for general purpose graphics processing units (GPGPU)
H Kim, RW Vuduc, S Baghsorkhi, J Choi, WH Wen-mei
Morgan & Claypool, 2012
632012
Model-driven sparse CP decomposition for higher-order tensors
J Li, J Choi, I Perros, J Sun, R Vuduc
2017 IEEE international parallel and distributed processing symposium (IPDPS …, 2017
602017
Sparse Matrix-Vector Multiplication on Multicore and Accelerators.
S Williams, N Bell, JW Choi, M Garland, L Oliker, R Vu
Scientific Computing with Multicore and Accelerators, 83-109, 2010
352010
On optimizing distributed tucker decomposition for dense tensors
VT Chakaravarthy, JW Choi, DJ Joseph, X Liu, P Murali, Y Sabharwal, ...
2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2017
322017
A CPU: GPU hybrid implementation and model-driven scheduling of the fast multipole method
J Choi, A Chandramowlishwaran, K Madduri, R Vuduc
Proceedings of Workshop on General Purpose Processing Using GPUs, 64-71, 2014
292014
High-performance dense tucker decomposition on GPU clusters
J Choi, X Liu, V Chakaravarthy
SC18: International Conference for High Performance Computing, Networking …, 2018
262018
Blocking optimization techniques for sparse tensor computation
J Choi, X Liu, S Smith, T Simon
2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2018
252018
Brief announcement: Towards a communication optimal fast multipole method and its implications at exascale
A Chandramowlishwaran, JW Choi, K Madduri, R Vuduc
Proceedings of the twenty-fourth annual ACM symposium on Parallelism in …, 2012
202012
On optimizing distributed tucker decomposition for sparse tensors
VT Chakaravarthy, JW Choi, DJ Joseph, P Murali, SS Pandian, ...
Proceedings of the 2018 International Conference on Supercomputing, 374-384, 2018
192018
High-performance lattice QCD for multi-core based parallel systems using a cache-friendly hybrid threaded-MPI approach
M Smelyanskiy, K Vaidyanathan, J Choi, B Joó, J Chhugani, MA Clark, ...
Proceedings of 2011 International Conference for High Performance Computing …, 2011
182011
A brief history and introduction to GPGPU
R Vuduc, J Choi
Modern Accelerator Technologies for Geographic Information Science, 9-23, 2013
162013
Alto: Adaptive linearized storage of sparse tensors
AE Helal, J Laukemann, F Checconi, JJ Tithi, T Ranadive, F Petrini, ...
Proceedings of the ACM International Conference on Supercomputing, 404-416, 2021
132021
How much (execution) time and energy does my algorithm cost?
JW Choi, RW Vuduc
XRDS: Crossroads, The ACM Magazine for Students 19 (3), 49-51, 2013
122013
Data analytics with nvlink: An spmv case study
D Buono, F Artico, F Checconi, JW Choi, X Que, L Schneidenbach
Proceedings of the Computing Frontiers Conference, 89-96, 2017
102017
Analyzing the energy efficiency of the fast multipole method using a DVFS-aware energy model
JW Choi, RW Vuduc
2016 IEEE International Parallel and Distributed Processing Symposium …, 2016
72016
系统目前无法执行此操作,请稍后再试。
文章 1–20