Follow
Jakob Uszkoreit
Jakob Uszkoreit
Inceptive
Verified email at uszkoreit.net
Title
Cited by
Cited by
Year
Attention is all you need
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Advances in neural information processing systems 30, 2017
646112017
An image is worth 16x16 words: Transformers for image recognition at scale
A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ...
arXiv preprint arXiv:2010.11929, 2020
115732020
Self-attention with relative position representations
P Shaw, J Uszkoreit, A Vaswani
arXiv preprint arXiv:1803.02155, 2018
14332018
A decomposable attention model for natural language inference
AP Parikh, O Täckström, D Das, J Uszkoreit
arXiv preprint arXiv:1606.01933, 2016
13742016
Image transformer
N Parmar, A Vaswani, J Uszkoreit, L Kaiser, N Shazeer, A Ku, D Tran
International conference on machine learning, 4055-4064, 2018
12102018
Natural questions: a benchmark for question answering research
T Kwiatkowski, J Palomaki, O Redfield, M Collins, A Parikh, C Alberti, ...
Transactions of the Association for Computational Linguistics 7, 453-466, 2019
11772019
Mlp-mixer: An all-mlp architecture for vision
IO Tolstikhin, N Houlsby, A Kolesnikov, L Beyer, X Zhai, T Unterthiner, ...
Advances in neural information processing systems 34, 24261-24272, 2021
8512021
Universal transformers
M Dehghani, S Gouws, O Vinyals, J Uszkoreit, Ł Kaiser
arXiv preprint arXiv:1807.03819, 2018
6182018
An image is worth 16x16 words: Transformers for image recognition at scale. arXiv 2020
A Dosovitskiy, L Beyer, A Kolesnikov, D Weissenborn, X Zhai, ...
arXiv preprint arXiv:2010.11929, 2010
5732010
Music transformer
CZA Huang, A Vaswani, J Uszkoreit, N Shazeer, I Simon, C Hawthorne, ...
arXiv preprint arXiv:1809.04281, 2018
5232018
Tensor2tensor for neural machine translation
A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ...
arXiv preprint arXiv:1803.07416, 2018
5182018
Attention is all you need. arXiv 2017
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2017
4632017
One model to learn them all
L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ...
arXiv preprint arXiv:1706.05137, 2017
3312017
Object-centric learning with slot attention
F Locatello, D Weissenborn, T Unterthiner, A Mahendran, G Heigold, ...
Advances in Neural Information Processing Systems 33, 11525-11538, 2020
3222020
Gomez Aidan N., Kaiser Łukasz, and Polosukhin Illia. 2017
V Ashish, S Noam, P Niki, U Jakob, J Llion
Attention is all you need. In Advances in neural information processing …, 2017
2742017
Cross-lingual word clusters for direct transfer of linguistic structure
O Täckström, R McDonald, J Uszkoreit
The 2012 conference of the north american chapter of the association for …, 2012
2532012
Attention is all you need (2017)
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv:1706.03762, 2019
2512019
How to train your vit? data, augmentation, and regularization in vision transformers
A Steiner, A Kolesnikov, X Zhai, R Wightman, J Uszkoreit, L Beyer
arXiv preprint arXiv:2106.10270, 2021
1872021
Insertion transformer: Flexible sequence generation via insertion operations
M Stern, W Chan, J Kiros, J Uszkoreit
International Conference on Machine Learning, 5976-5985, 2019
1852019
Transforming machine translation: a deep learning system reaches news translation quality comparable to human professionals
M Popel, M Tomkova, J Tomek, Ł Kaiser, J Uszkoreit, O Bojar, ...
Nature communications 11 (1), 4381, 2020
1622020
The system can't perform the operation now. Try again later.
Articles 1–20