Follow
Prabhat Nagarajan
Title
Cited by
Cited by
Year
Extrapolating beyond suboptimal demonstrations via inverse reinforcement learning from observations
D Brown, W Goo, P Nagarajan, S Niekum
International Conference on Machine Learning, 783-792, 2019
4242019
ChainerRL: A Deep Reinforcement Learning Library
Y Fujita, P Nagarajan, T Kataoka, T Ishikawa
Journal of Machine Learning Research 22 (77), 1-14, 2021
1512021
Deterministic Implementations for Reproducibility in Deep Reinforcement Learning
P Nagarajan, G Warnell, P Stone
AAAI 2019 Workshop on Reproducible AI, 2019
682019
The Impact of Nondeterminism on Reproducibility in Deep Reinforcement Learning
P Nagarajan, G Warnell, P Stone
2nd Reproducibility in Machine Learning Workshop at ICML 2018, Stockholm, Sweden, 2018
362018
Distributed Reinforcement Learning of Targeted Grasping with Active Vision for Mobile Manipulators
Y Fujita, K Uenishi, A Ummadisingu, P Nagarajan, S Masuda, MY Castro
2020 IEEE/RSJ International Conference on Intelligent Robots and Systems, 2020
252020
Learning Latent State Spaces for Planning through Reward Prediction
A Havens, Y Ouyang, P Nagarajan, Y Fujita
Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019
72019
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning
ZW Hong, P Nagarajan, G Maeda
European Conference on Machine Learning and Principles and Practice of …, 2021
42021
Reconnaissance for Reinforcement Learning with Safety Constraints
S Maeda, H Watahiki, Y Ouyang, S Okada, M Koyama, P Nagarajan
European Conference on Machine Learning and Principles and Practice of …, 2021
32021
When is Offline Policy Selection Sample Efficient for Reinforcement Learning?
V Liu, P Nagarajan, A Patterson, M White
arXiv preprint arXiv:2312.02355, 2023
12023
Swarm-inspired Reinforcement Learning via Collaborative Inter-agent Knowledge Distillation
ZW Hong, P Nagarajan, G Maeda
Workshop on Deep Reinforcement Learning at the 33rd Conference on Neural …, 2019
2019
Nondeterminism as a Reproducibility Challenge for Deep Reinforcement Learning
PM Nagarajan
The University of Texas at Austin, 2018
2018
The system can't perform the operation now. Try again later.
Articles 1–11