Follow
Yangyang Zhao
Title
Cited by
Cited by
Year
Dynamic reward-based dueling deep dyna-q: Robust policy learning in noisy environments
Y Zhao, Z Wang, K Yin, R Zhang, Z Huang, P Wang
Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9676-9684, 2020
202020
Automatic curriculum learning with over-repetition penalty for dialogue policy learning
Y Zhao, Z Wang, Z Huang
Proceedings of the AAAI Conference on Artificial Intelligence 35 (16), 14540 …, 2021
132021
Visualizing complex networks by leveraging community structures
Z Huang, J Wu, W Zhu, Z Wang, S Mehrotra, Y Zhao
Physica A: Statistical Mechanics and its Applications 565, 125506, 2021
112021
任务型对话系统研究综述
赵阳洋, 王振宇, 王佩, 杨添, 张睿, 尹凯
计算机学报 43 (10), 1862-1896, 2020
112020
Learning bi-directional social influence in information cascades using graph sequence attention networks
Z Huang, Z Wang, R Zhang, Y Zhao, F Zheng
Companion proceedings of the web conference 2020, 19-21, 2020
102020
Efficient dialogue complementary policy learning via deep q-network policy and episodic memory policy
Y Zhao, Z Wang, C Zhu, S Wang
Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021
92021
Emotion-sensitive deep dyna-Q learning for task-completion dialogue policy learning
R Zhang, Z Wang, M Zheng, Y Zhao, Z Huang
Neurocomputing 459, 122-130, 2021
82021
A Versatile Adaptive Curriculum Learning Framework for Task-oriented Dialogue Policy Learning
Y Zhao, H Qin, W Zhenyu, C Zhu, S Wang
Findings of the Association for Computational Linguistics: NAACL 2022, 711-723, 2022
22022
Network2Vec: Learning node representation based on space mapping in networks
Z Huang, Z Wang, R Zhang, Y Zhao, X Xie, S Mehrotra
2019 International Conference on Data Mining Workshops (ICDMW), 727-734, 2019
12019
Decomposed Deep Q-Network for Coherent Task-Oriented Dialogue Policy Learning
Y Zhao, K Yin, Z Wang, M Dastani, S Wang
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
2024
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization
Y Zhao, Z Wang, M Dastani, S Wang
arXiv preprint arXiv:2305.03262, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–11