Follow
Denis Yarats
Denis Yarats
Cofounder and CTO, Perplexity AI
Verified email at perplexity.ai - Homepage
Title
Cited by
Cited by
Year
Convolutional sequence to sequence learning
J Gehring, M Auli, D Grangier, D Yarats, YN Dauphin
ICML 2017, 2017
33652017
Image augmentation is all you need: Regularizing deep reinforcement learning from pixels
D Yarats, I Kostrikov, R Fergus
ICLR 2021, 2020
413*2020
Deal or no deal? end-to-end learning for negotiation dialogues
M Lewis, D Yarats, YN Dauphin, D Parikh, D Batra
EMNLP 2017, 2017
3932017
Improving sample efficiency in model-free reinforcement learning from images
D Yarats, A Zhang, I Kostrikov, B Amos, J Pineau, R Fergus
AAAI 2021, 2019
2272019
Automatic data augmentation for generalization in deep reinforcement learning
R Raileanu, M Goldstein, D Yarats, I Kostrikov, R Fergus
NeurIPS 2021, 2020
113*2020
Quasi-hyperbolic momentum and adam for deep learning
J Ma, D Yarats
ICLR 2019, 2018
1072018
Reinforcement learning with prototypical representations
D Yarats, R Fergus, A Lazaric, L Pinto
ICML 2021, 2021
1052021
Generalized inner loop meta-learning
E Grefenstette, B Amos, D Yarats, PM Htut, A Molchanov, F Meier, D Kiela, ...
arXiv 2019, 2019
1052019
Mastering visual continuous control: Improved data-augmented reinforcement learning
D Yarats, R Fergus, A Lazaric, L Pinto
ICLR 2022, 2021
922021
Hierarchical text generation and planning for strategic dialogue
D Yarats, M Lewis
ICML 2018, 2018
502018
URLB: Unsupervised Reinforcement Learning Benchmark
M Laskin, D Yarats, H Liu, K Lee, A Zhan, K Lu, C Cang, L Pinto, P Abbeel
NeurIPS 2021, 2021
482021
Hierarchical decision making by generating and following natural language instructions
H Hu, D Yarats, Q Gong, Y Tian, M Lewis
NeurIPS 2019, 2019
472019
The differentiable cross-entropy method
B Amos, D Yarats
ICML 2020, 2020
452020
On the model-based stochastic value gradient for continuous reinforcement learning
B Amos, S Stanton, D Yarats, AG Wilson
L4DC 2021, 2020
402020
On the adequacy of untuned warmup for adaptive optimization
J Ma, D Yarats
AAAI 2021, 2019
392019
Soft actor-critic (sac) implementation in pytorch
D Yarats, I Kostrikov
https://github.com/denisyarats/pytorch_sac, 2020
382020
Don't change the algorithm, change the data: Exploratory data for offline reinforcement learning
D Yarats, D Brandfonbrener, H Liu, M Laskin, P Abbeel, A Lazaric, L Pinto
arXiv preprint arXiv:2201.13425, 2022
242022
Cic: Contrastive intrinsic control for unsupervised skill discovery
M Laskin, H Liu, XB Peng, D Yarats, A Rajeswaran, P Abbeel
arXiv preprint arXiv:2202.00161, 2022
192022
Learning navigation skills for legged robots with learned robot embeddings
J Truong, D Yarats, T Li, F Meier, S Chernova, D Batra, A Rai
IROS 2021, 2020
82020
A robot cluster for reproducible research in dexterous manipulation
S Bauer, F Widmaier, M WŁthrich, N Funk, JU De Jesus, J Peters, ...
arXiv preprint arXiv:2109.10957, 2021
5*2021
The system can't perform the operation now. Try again later.
Articles 1–20