Accelerating numerical dense linear algebra calculations with GPUs J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, S Tomov, ... Numerical computations with GPUs, 3-28, 2014 | 131 | 2014 |
Towards high performance digital volume correlation M Gates, J Lambros, MT Heath Experimental Mechanics 51 (4), 491-507, 2011 | 114 | 2011 |
A survey of numerical linear algebra methods utilizing mixed-precision arithmetic A Abdelfattah, H Anzt, EG Boman, E Carson, T Cojean, J Dongarra, A Fox, ... The International Journal of High Performance Computing Applications 35 (4 …, 2021 | 111 | 2021 |
Blockchain: Ultimate guide to understanding blockchain, bitcoin, cryptocurrencies, smart contracts and the future of money. M Gates CreateSpace Independent Publishing Platform, 2017 | 92 | 2017 |
The singular value decomposition: Anatomy of optimizing an algorithm for extreme scale J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, S Tomov, ... SIAM review 60 (4), 808-865, 2018 | 89 | 2018 |
SLATE: Design of a modern distributed and accelerated linear algebra library M Gates, J Kurzak, A Charara, A YarKhan, J Dongarra Proceedings of the International Conference for High Performance Computing …, 2019 | 88 | 2019 |
Parallel programming models for dense linear algebra on heterogeneous systems J Dongarra, M Abalenkovs, A Abdelfattah, M Gates, A Haidar, J Kurzak, ... Supercomputing frontiers and innovations 2 (4), 67-86, 2015 | 62 | 2015 |
PLASMA: Parallel linear algebra software for multicore using OpenMP J Dongarra, M Gates, A Haidar, J Kurzak, P Luszczek, P Wu, I Yamazaki, ... ACM Transactions on Mathematical Software (TOMS) 45 (2), 1-35, 2019 | 60 | 2019 |
Preconditioned krylov solvers on GPUs H Anzt, M Gates, J Dongarra, M Kreutzer, G Wellein, M Köhler Parallel Computing 68, 32-44, 2017 | 50 | 2017 |
Implementation and tuning of batched Cholesky factorization and solve for NVIDIA GPUs J Kurzak, H Anzt, M Gates, J Dongarra IEEE Transactions on Parallel and Distributed Systems 27 (7), 2036-2048, 2015 | 47 | 2015 |
A survey of numerical methods utilizing mixed precision arithmetic A Abdelfattah, H Anzt, EG Boman, E Carson, T Cojean, J Dongarra, ... arXiv preprint arXiv:2007.06674, 2020 | 46 | 2020 |
With extreme computing, the rules have changed J Dongarra, S Tomov, P Luszczek, J Kurzak, M Gates, I Yamazaki, H Anzt, ... Computing in Science & Engineering 19 (3), 52-62, 2017 | 45 | 2017 |
High-performance hybrid CPU and GPU parallel algorithm for digital volume correlation M Gates, MT Heath, J Lambros The International Journal of High Performance Computing Applications 29 (1 …, 2015 | 43 | 2015 |
Hpc programming on intel many-integrated-core hardware with magma port to xeon phi J Dongarra, M Gates, A Haidar, Y Jia, K Kabir, P Luszczek, S Tomov Scientific Programming 2015, 9-9, 2015 | 43 | 2015 |
Accelerating collaborative filtering using concepts from high performance computing M Gates, H Anzt, J Kurzak, J Dongarra 2015 IEEE International Conference on Big Data (Big Data), 667-676, 2015 | 41 | 2015 |
A survey of recent developments in parallel implementations of Gaussian elimination S Donfack, J Dongarra, M Faverge, M Gates, J Kurzak, P Luszczek, ... Concurrency and Computation: Practice and Experience 27 (5), 1292-1309, 2015 | 40 | 2015 |
clMAGMA: High performance dense linear algebra with OpenCL C Cao, J Dongarra, P Du, M Gates, P Luszczek, S Tomov Proceedings of the International Workshop on OpenCL 2013 & 2014, 1-9, 2014 | 39 | 2014 |
Subset refinement for digital volume correlation: numerical and experimental applications M Gates, J Gonzalez, J Lambros, MT Heath Experimental Mechanics 55, 245-259, 2015 | 35 | 2015 |
A proposed API for batched basic linear algebra subprograms J Dongarra, I Duff, M Gates, A Haidar, S Hammarling, NJ Higham, J Hogg, ... Manchester Institute for Mathematical Sciences, University of Manchester, 2016 | 32 | 2016 |
Block-asynchronous multigrid smoothers for GPU-accelerated systems H Anzt, S Tomov, M Gates, J Dongarra, V Heuveline Procedia Computer Science 9, 7-16, 2012 | 31 | 2012 |