Toshinori Kitamura

Cited by

	All	Since 2019
Citations	12	12
h-index	2	2
i10-index	0	0

Public access

View all

1 article

0 articles

available

not available

Based on funding mandates

Co-authors

Tadashi KozunoOmron Sinic XVerified email at sinicx.com
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Verified email at univ-lorraine.fr
Mohammad Gheshlaghi AzarCohereVerified email at google.com
Rémi MunosDeepMindVerified email at inria.fr
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Pierre MénardOvGU MagdeburgVerified email at inria.fr
Wenhao YangStanford UniversityVerified email at stanford.edu
Yunhao TangResearch Scientist, DeepMindVerified email at columbia.edu
Nino VieillardGoogle DeepMindVerified email at google.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Verified email at univ-lille.fr
Jincheng MeiResearch Scientist, Google BrainVerified email at google.com
Lingwei ZhuUniversity of AlbertaVerified email at ualberta.ca
Takamitsu MatsubaraProfessor, Nara Institute of Science and TechnologyVerified email at is.naist.jp
Wataru KumagaiOmron Sinic XVerified email at sinicx.com
Ryo YonetaniResearch Scientist at CyberAgentVerified email at cyberagent.co.jp

Toshinori Kitamura

The University of Tokyo

Verified email at weblab.t.u-tokyo.ac.jp - Homepage

Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal T Kozuno, W Yang, N Vieillard, T Kitamura, Y Tang, J Mei, P Ménard, ... arXiv preprint arXiv:2205.14211, 2022	5	2022
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives T Kitamura, R Yonetani arXiv preprint arXiv:2112.04123, 2021	3	2021
Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice T Kitamura, T Kozuno, Y Tang, N Vieillard, M Valko, W Yang, J Mei, ... International Conference on Machine Learning, 17135-17175, 2023	2	2023
Cautious Actor-Critic L Zhu, T Kitamura, M Takamitsu Asian Conference on Machine Learning, 220-235, 2021	1	2021
Geometric Value Iteration: Dynamic Error-Aware KL Regularization for Reinforcement Learning T Kitamura, L Zhu, T Matsubara Asian Conference on Machine Learning, 918-931, 2021	1	2021
A Policy Gradient Primal-Dual Algorithm for Constrained MDPs with Uniform PAC Guarantees T Kitamura, T Kozuno, M Kato, Y Ichihara, S Nishimori, A Sannai, ... arXiv preprint arXiv:2401.17780, 2024		2024
Cautious policy programming: exploiting KL regularization for monotonic policy improvement in reinforcement learning L Zhu, T Matsubara Machine Learning 112 (11), 4527-4562, 2023		2023
Dynamic KL Regularization in Reinforcement Learning: Theoretical Error Propagation Analysis and an Algorithm T Kitamura Nara Institute of Science and Technology, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–8

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors