Exploring the limits of transfer learning with a unified text-to-text transformer C Raffel, N Shazeer, A Roberts, K Lee, S Narang, M Matena, Y Zhou, W Li, ... The Journal of Machine Learning Research 21 (1), 5485-5551, 2020 | 8034 | 2020 |
Deep Voice 2: Multi-Speaker Neural Text-to-Speech YZ Sercan Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng ... Neural Information Processing Systems (NIPS), 2017 | 534* | 2017 |
Deep learning scaling is predictable, empirically J Hestness, S Narang, N Ardalani, G Diamos, H Jun, H Kianinejad, ... arXiv preprint arXiv:1712.00409, 2017 | 426 | 2017 |
Lamda: Language models for dialog applications R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... arXiv preprint arXiv:2201.08239, 2022 | 353 | 2022 |
Neural voice cloning with a few samples S Arik, J Chen, K Peng, W Ping, Y Zhou Advances in neural information processing systems 31, 2018 | 328 | 2018 |
OpenPiton: An open source manycore research framework J Balkind, M McKeown, Y Fu, T Nguyen, Y Zhou, A Lavrov, M Shahrad, ... ACM SIGPLAN Notices 51 (4), 217-232, 2016 | 210 | 2016 |
Glam: Efficient scaling of language models with mixture-of-experts N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... International Conference on Machine Learning, 5547-5569, 2022 | 121 | 2022 |
Atomic In-place Updates for Non-volatile Main Memories with Kamino-Tx A Memaripour, A Badam, A Phanishayee, Y Zhou, R Alagappan, ... EuroSys '17 Proceedings of the Twelfth European Conference on Computer …, 2017 | 113 | 2017 |
Exploring the limits of transfer learning with a unified text-to-text transformer (2019) C Raffel, N Shazeer, A Roberts, K Lee, S Narang, M Matena, Y Zhou, W Li, ... arXiv preprint arXiv:1910.10683, 2021 | 82 | 2021 |
Resource-efficient neural architect Y Zhou, S Ebrahimi, SÖ Arık, H Yu, H Liu, G Diamos arXiv preprint arXiv:1806.07912, 2018 | 65 | 2018 |
MITTS: Memory inter-arrival time traffic shaping Y Zhou, D Wentzlaff ACM SIGARCH Computer Architecture News 44 (3), 532-544, 2016 | 57 | 2016 |
Renelito Delos Santos R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ... | 56 | 2022 |
Do transformer modifications transfer across implementations and applications? S Narang, HW Chung, Y Tay, W Fedus, T Fevry, M Matena, K Malkan, ... arXiv preprint arXiv:2102.11972, 2021 | 55 | 2021 |
A learned performance model for tensor processing units S Kaufman, P Phothilimthana, Y Zhou, C Mendis, S Roy, A Sabne, ... Proceedings of Machine Learning and Systems 3, 387-400, 2021 | 44 | 2021 |
Power and Energy Characterization of an Open Source 25-Core Manycore Processor. M McKeown, A Lavrov, M Shahrad, PJ Jackson, Y Fu, J Balkind, ... HPCA, 762-775, 2018 | 44 | 2018 |
Transferable graph optimizers for ml compilers Y Zhou, S Roy, A Abdolrashidi, D Wong, P Ma, Q Xu, H Liu, ... Advances in Neural Information Processing Systems 33, 13844-13855, 2020 | 43 | 2020 |
Toju Duke, Lucas Dixon, Kun Zhang, Quoc V N Du, Y Huang, AM Dai, S Tong, D Lepikhin, Y Xu, M Krikun, Y Zhou, ... Le, Yonghui Wu, Zhifeng Chen, and Claire Cui. Glam: Efficient scaling of …, 2021 | 42 | 2021 |
CASH: Supporting IaaS customers with a sub-core configurable architecture Y Zhou, H Hoffmann, D Wentzlaff ACM SIGARCH Computer Architecture News 44 (3), 682-694, 2016 | 42 | 2016 |
The sharing architecture: sub-core configurability for IaaS clouds Y Zhou, D Wentzlaff ACM SIGPLAN Notices 49 (4), 559-574, 2014 | 34 | 2014 |
Camouflage: Memory traffic shaping to mitigate timing attacks Y Zhou, S Wagh, P Mittal, D Wentzlaff 2017 IEEE International Symposium on High Performance Computer Architecture …, 2017 | 33 | 2017 |