The utility of sparse representations for control in reinforcement learning V Liu, R Kumaraswamy, L Le, M White Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4384-4391, 2019 | 44 | 2019 |
Attribute-aware recommender system based on collaborative filtering: Survey and classification WH Chen, CC Hsu, YA Lai, V Liu, MY Yeh, SD Lin Frontiers in big Data 2, 49, 2020 | 14 | 2020 |
Investigating the properties of neural network representations in reinforcement learning H Wang, E Miahi, M White, MC Machado, Z Abbas, R Kumaraswamy, ... arXiv preprint arXiv:2203.15955, 2022 | 9 | 2022 |
Towards a practical measure of interference for reinforcement learning V Liu, A White, H Yao, M White arXiv preprint arXiv:2007.03807, 2020 | 7* | 2020 |
Attribute-aware collaborative filtering: survey and classification WH Chen, CC Hsu, YA Lai, V Liu, MY Yeh, SD Lin arXiv preprint arXiv:1810.08765, 2018 | 4 | 2018 |
Training recurrent neural networks online by learning explicit state variables S Nath, V Liu, A Chan, X Li, A White, M White International conference on learning representations, 2020 | 3 | 2020 |
Measuring and mitigating interference in reinforcement learning V Liu, AM White, H Yao, M White | 2 | 2020 |
Sparse Representation Neural Networks for Online Reinforcement Learning V Liu | 2 | 2019 |
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning V Liu, J Wright, M White arXiv preprint arXiv:2111.08066, 2021 | 1 | 2021 |
Incrementally Learning Functions of the Return B Bennett, W Chung, M Zaheer, V Liu arXiv preprint arXiv:1907.04651, 2019 | 1 | 2019 |
Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments V Liu, Y Chandak, P Thomas, M White arXiv preprint arXiv:2302.11725, 2023 | | 2023 |
No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL H Wang, A Sakhadeo, A White, J Bell, V Liu, X Zhao, P Liu, T Kozuno, ... arXiv preprint arXiv:2205.08716, 2022 | | 2022 |
A Value Function Basis for Nexting and Multi-step Prediction A Jacobsen, V Liu, R Shariff, A White, M White | | |