Google’s neural machine translation system: Bridging the gap between human and machine translation Y Wu arXiv preprint arXiv:1609.08144, 2016 | 8957 | 2016 |
In-datacenter performance analysis of a tensor processing unit NP Jouppi, C Young, N Patil, D Patterson, G Agrawal, R Bajwa, S Bates, ... Proceedings of the 44th annual international symposium on computer …, 2017 | 5613 | 2017 |
Anton, a special-purpose machine for molecular dynamics simulation DE Shaw, MM Deneroff, RO Dror, JS Kuskin, RH Larson, JK Salmon, ... Communications of the ACM 51 (7), 91-97, 2008 | 983 | 2008 |
Anton 2: raising the bar for performance and programmability in a special-purpose molecular dynamics supercomputer DE Shaw, JP Grossman, JA Bank, B Batson, JA Butts, JC Chao, ... SC'14: Proceedings of the International Conference for High Performance …, 2014 | 731 | 2014 |
Millisecond-scale molecular dynamics simulations on Anton DE Shaw, RO Dror, JK Salmon, JP Grossman, KM Mackenzie, JA Bank, ... Proceedings of the conference on high performance computing networking …, 2009 | 690 | 2009 |
Embedded computing: a VLIW approach to architecture, compilers and tools JA Fisher, P Faraboschi, C Young Elsevier, 2005 | 526 | 2005 |
Mesh-tensorflow: Deep learning for supercomputers N Shazeer, Y Cheng, N Parmar, D Tran, A Vaswani, P Koanantakool, ... Advances in neural information processing systems 31, 2018 | 409 | 2018 |
Anton, a special-purpose machine for molecular dynamics simulation DE Shaw, MM Deneroff, RO Dror, JS Kuskin, RH Larson, JK Salmon, ... ACM SIGARCH Computer Architecture News 35 (2), 1-12, 2007 | 366 | 2007 |
Ten lessons from three generations shaped google’s tpuv4i: Industrial product NP Jouppi, DH Yoon, M Ashcraft, M Gottscho, TB Jablin, G Kurian, ... 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture …, 2021 | 357 | 2021 |
Mlperf training benchmark P Mattson, C Cheng, G Diamos, C Coleman, P Micikevicius, D Patterson, ... Proceedings of Machine Learning and Systems 2, 336-349, 2020 | 331 | 2020 |
A domain-specific supercomputer for training deep neural networks NP Jouppi, DH Yoon, G Kurian, S Li, N Patil, J Laudon, C Young, ... Communications of the ACM 63 (7), 67-78, 2020 | 303 | 2020 |
Motivation for and evaluation of the first tensor processing unit N Jouppi, C Young, N Patil, D Patterson ieee Micro 38 (3), 10-19, 2018 | 292 | 2018 |
Measurements of differential cross-sections of highly boosted top quarks decaying to all-hadronic final states in collisions at using the ATLAS … M Aaboud, G Aad, B Abbott, O Abdinov, B Abeloos, SH Abidi, ... Physical Review D 98 (1), 012003, 2018 | 271 | 2018 |
Sparse gpu kernels for deep learning T Gale, M Zaharia, C Young, E Elsen SC20: International Conference for High Performance Computing, Networking …, 2020 | 243 | 2020 |
A new golden age in computer architecture: Empowering the machine-learning revolution J Dean, D Patterson, C Young IEEE Micro 38 (2), 21-29, 2018 | 236 | 2018 |
A comparative analysis of schemes for correlated branch prediction C Young, N Gloy, MD Smith ACM SIGARCH Computer Architecture News 23 (2), 276-286, 1995 | 220 | 1995 |
Tpu v4: An optically reconfigurable supercomputer for machine learning with hardware support for embeddings N Jouppi, G Kurian, S Li, P Ma, R Nagarajan, L Nai, N Patil, ... Proceedings of the 50th Annual International Symposium on Computer …, 2023 | 218 | 2023 |
A domain-specific architecture for deep neural networks NP Jouppi, C Young, N Patil, D Patterson Communications of the ACM 61 (9), 50-59, 2018 | 205 | 2018 |
Search for a heavy charged boson in events with a charged lepton and missing transverse momentum from collisions at with the ATLAS detector G Aad, B Abbott, DC Abbott, O Abdinov, A Abed Abud, K Abeling, ... Physical review D 100 (5), 052013, 2019 | 192 | 2019 |
Improving the accuracy of static branch prediction using branch correlation C Young, MD Smith ACM SIGOPS Operating Systems Review 28 (5), 232-241, 1994 | 168 | 1994 |