Environment Reconstruction with Hidden Confounders for Reinforcement Learning based Recommendation W Shang, Y Yu, Q Li, Z Qin, Y Meng, J Ye. Proceedings of the 25th ACM SIGKDD International Conference on Knowledge …, 2019 | 41 | 2019 |
Offline model-based adaptable policy learning XH Chen, Y Yu, Q Li, FM Luo, Z Qin, W Shang, J Ye Advances in Neural Information Processing Systems 34, 8432-8443, 2021 | 14 | 2021 |
Partially observable environment estimation with uplift inference for reinforcement learning based recommendation W Shang, Q Li, Z Qin, Y Yu, Y Meng, J Ye Machine Learning 110 (9), 2603-2640, 2021 | 10 | 2021 |
Reinforcement Learning Method For Driver Incentives: Generative Adversarial Network For Driver-System Interactions W Shang, Q Li, Z Qin, M Yiping, Y Yu, J Ye US Patent App. 17/618,864, 2022 | 1 | 2022 |
Method and system for constructing virtual environment for ride-hailing platforms W Shang, Q Li, Z Qin, J Ye, Y Yu, M Yiping US Patent App. 17/058,407, 2022 | 1 | 2022 |
Sim2Rec: A Simulator-based Decision-making Approach to Optimize Real-World Long-term User Engagement in Sequential Recommender Systems XH Chen, B He, Y Yu, Q Li, Z Qin, W Shang, J Ye, C Ma arXiv preprint arXiv:2305.04832, 2023 | | 2023 |
Method and system for deep reinforcement learning and application at ride-hailing platform Q Li, T Huang, W Shang, Z Qin US Patent App. 17/242,089, 2022 | | 2022 |
Transportation bubbling at a ride-hailing platform and machine learning W Shang, Q Li, Z Qin US Patent App. 17/220,798, 2022 | | 2022 |
Systems and methods for simulating transportation order bubbling behavior W Shang, Q Li, Z Qin US Patent App. 17/124,704, 2022 | | 2022 |
Method and system for uplift prediction of actions T Huang, Z Qin, Q Li, W Shang US Patent App. 17/124,763, 2022 | | 2022 |
A Simulator-based Decision-Making Approach to Sequential Recommender Systems with Application in Ride-hailing Platform XH Chen, Y Yu, Q Li, B He, Z Qin, W Shang, J Ye | | 2018 |
Development and application of traffic flow information collecting and analysis system based on multi-type video M Lu, W Shang, X Ji, M Hua, K Cheng Sixth International Conference on Electronics and Information Engineering …, 2015 | | 2015 |
Offline Adaptive Policy Leaning in Real-World Sequential Recommendation Systems XH Chen, Y Yu, Q Li, ZT Qin, W Shang, Y Meng, J Ye | | |