Bernardo Ávila Pires
Bernardo Ávila Pires
Research Scientist, DeepMind
Verified email at google.com
Title
Cited by
Cited by
Year
Bootstrap your own latent: A new approach to self-supervised learning
JB Grill, F Strub, F Altché, C Tallec, PH Richemond, E Buchatskaya, ...
arXiv preprint arXiv:2006.07733, 2020
6232020
Cost-sensitive multiclass classification risk bounds
BA Pires, C Szepesvari, M Ghavamzadeh
International Conference on Machine Learning, 1391-1399, 2013
452013
Neural predictive belief representations
ZD Guo, MG Azar, B Piot, BA Pires, R Munos
arXiv preprint arXiv:1811.06407, 2018
362018
Bootstrap latent-predictive representations for multitask reinforcement learning
ZD Guo, BA Pires, B Piot, JB Grill, F Altché, R Munos, MG Azar
International Conference on Machine Learning, 3875-3886, 2020
352020
Statistical linear estimation with penalized estimators: an application to reinforcement learning
BA Pires, C Szepesvári
arXiv preprint arXiv:1206.6444, 2012
272012
World discovery models
MG Azar, B Piot, BA Pires, JB Grill, F Altché, R Munos
arXiv preprint arXiv:1902.07685, 2019
182019
Policy error bounds for model-based reinforcement learning with factored linear models
BÁ Pires, C Szepesvári
Conference on Learning Theory, 121-151, 2016
182016
Multiclass classification calibration functions
BÁ Pires, C Szepesvári
arXiv preprint arXiv:1609.06385, 2016
132016
Pseudo-MDPs and factored linear action models
H Yao, C Szepesvári, BA Pires, X Zhang
2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2014
122014
Neural belief states for partially observed domains
P Moreno, J Humplik, G Papamakarios, BA Pires, L Buesing, N Heess, ...
NeurIPS 2018 workshop on Reinforcement Learning under Partial Observability, 2018
102018
k. kavukcuoglu, R
JB Grill, F Strub, F Altché, C Tallec, P Richemond, E Buchatskaya, ...
Munos, and M. Valko,“Bootstrap your own latent-a new approach to self …, 2020
72020
Statistical analysis of l1-penalized linear estimation with applications
B Ávila Pires
62012
Geometric entropic exploration
ZD Guo, MG Azar, A Saade, S Thakoor, B Piot, BA Pires, M Valko, ...
arXiv preprint arXiv:2101.02055, 2021
52021
Clause Identification Using Entropy Guided Transformation Learning
ER Fernandes, B Pires, CN dos Santos, RL Milidiú
Information and Human Language Technology (STIL), 2009 Seventh Brazilian …, 2009
52009
Neural Recursive Belief States in Multi-Agent Reinforcement Learning
P Moreno, E Hughes, KR McKee, BA Pires, T Weber
arXiv preprint arXiv:2102.02274, 2021
22021
Pathological effects of variance on classification-based policy iteration
BÁ Pires, C Szepesvári
Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
12015
Toward Practical Reinforcement Learning Algorithms: Classification Based Policy Iteration and Model-Based Learning
B Ávila Pires
2017
Using random projections to estimate condition numbers and solve linear systems CMPUT 501 Project
BA Pires
2012
CLASSIFICATION CALIBRATION
BÁ Pires
2012
The system can't perform the operation now. Try again later.
Articles 1–19