Reactive reinforcement learning in asynchronous environments JB Travnik, KW Mathewson, RS Sutton, PM Pilarski Frontiers in Robotics and AI 5, 79, 2018 | 33 | 2018 |
Representing high-dimensional data to intelligent prostheses and other wearable assistive robots: A first comparison of tile coding and selective Kanerva coding JB Travnik, PM Pilarski 2017 International Conference on Rehabilitation Robotics (ICORR), 1443-1450, 2017 | 24 | 2017 |
Tidbd: Adapting temporal-difference step-sizes through stochastic meta-descent A Kearney, V Veeriah, JB Travnik, RS Sutton, PM Pilarski arXiv preprint arXiv:1804.03334, 2018 | 17 | 2018 |
Learning feature relevance through step size adaptation in temporal-difference learning A Kearney, V Veeriah, J Travnik, PM Pilarski, RS Sutton arXiv preprint arXiv:1903.03252, 2019 | 11 | 2019 |
Reinforcement learning on resource bounded systems J Travnik | 2 | 2018 |