Follow
Qingfeng Lan
Title
Cited by
Cited by
Year
Maxmin Q-learning: Controlling the estimation bias of Q-learning
Q Lan, Y Pan, A Fyshe, M White
International Conference on Learning Representations, 2020
1632020
A deep top-k relevance matching model for ad-hoc retrieval
Z Yang, Q Lan, J Guo, Y Fan, X Zhu, Y Lan, Y Wang, X Cheng
Information Retrieval: 24th China Conference, CCIR 2018, Guilin, China …, 2018
162018
Variational quantum soft actor-critic
Q Lan
arXiv preprint arXiv:2112.11921, 2021
152021
Model-free Policy Learning with Reward Gradients
Q Lan, S Tosatto, H Farrahi, AR Mahmood
The 25th International Conference on Artificial Intelligence and Statistics …, 2022
82022
Reducing selection bias in counterfactual reasoning for individual treatment effects estimation
Z Zhang, Q Lan, L Ding, Y Wang, N Hassanpour, R Greiner
NeurIPS 2019 CausalML Workshop, 2019
82019
Memory-efficient reinforcement learning with value-based knowledge consolidation
Q Lan, Y Pan, J Luo, AR Mahmood
Transactions on Machine Learning Research, 2023
7*2023
Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
H Ishfaq*, Q Lan*, P Xu, AR Mahmood, D Precup, A Anandkumar, ...
International Conference on Learning Representations, 2024
52024
A PyTorch Reinforcement Learning Framework for Exploring New Ideas
Q Lan
https://github.com/qlan3/Explorer, 2019
52019
Learning to Optimize for Reinforcement Learning
Q Lan, AR Mahmood, S Yan, Z Xu
arXiv preprint arXiv:2302.01470, 2023
42023
Overcoming policy collapse in deep reinforcement learning
S Dohare, Q Lan, AR Mahmood
Sixteenth European Workshop on Reinforcement Learning, 2023
32023
Elephant Neural Networks: Born to Be a Continual Learner
Q Lan, AR Mahmood
ICML Workshop on High-dimensional Learning Dynamics, 2023
12023
Predictive Representation Learning for Language Modeling
Q Lan, L Kumar, M White, A Fyshe
arXiv preprint arXiv:2105.14214, 2021
2021
The system can't perform the operation now. Try again later.
Articles 1–12