Delayed feedback in episodic reinforcement learning B Howson, C Pike-Burke, S Filippi arXiv preprint arXiv:2111.07615, 2021 | 11 | 2021 |
Delayed feedback in generalised linear bandits revisited B Howson, C Pike-Burke, S Filippi International Conference on Artificial Intelligence and Statistics, 6095-6119, 2023 | 8 | 2023 |
Optimism and delays in episodic reinforcement learning B Howson, C Pike-Burke, S Filippi International Conference on Artificial Intelligence and Statistics, 6061-6094, 2023 | 3 | 2023 |
DISCO: An End-to-End Bandit Framework for Personalised Discount Allocation JS Zhang, B Howson, P Savva, E Loh arXiv preprint arXiv:2406.06433, 2024 | | 2024 |
DISCO: An End-to-End Bandit Framework for Personalised Discount Allocation J Shuo Zhang, B Howson, P Savva, E Loh arXiv e-prints, arXiv: 2406.06433, 2024 | | 2024 |
Delayed Feedback in Generalised Linear Bandits B Howson, C Pike-Burke, SL Filippi Sixteenth European Workshop on Reinforcement Learning, 2023 | | 2023 |