Metadrive: Composing diverse driving scenarios for generalizable reinforcement learning Q Li, Z Peng, L Feng, Q Zhang, Z Xue, B Zhou
IEEE transactions on pattern analysis and machine intelligence 45 (3), 3461-3475, 2022
143 2022 Regret Minimization Experience Replay in Off-Policy Reinforcement Learning XH Liu, Z Xue, J Pang, S Jiang, F Xu, Y Yu
Advances in Neural Information Processing Systems 34, 17604-17615, 2021
32 2021 Two-Stage Constrained Actor-Critic for Short Video Recommendation Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ...
The Web Conference 2023 Research Track, 2023
19 2023 PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, B An
Proceedings of the 29th ACM SIGKDD International Conference on Knowledge …, 2023
17 * 2023 A Large Language Model Enhanced Conversational Recommender System Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun
arXiv preprint arXiv:2308.06212, 2023
12 2023 State Regularized Policy Optimization on Data with Dynamics Shift Z Xue, Q Cai, S Liu, D Zheng, P Jiang, K Gai, B An
Advances in Neural Information Processing Systems 36, 32926--32937, 2023
5 2023 Guarded Policy Optimization with Imperfect Online Demonstrations Z Xue, Z Peng, Q Li, Z Liu, B Zhou
The Eleventh International Conference on Learning Representations, 2023
3 2023 AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement Z Xue, Q Cai, T Zuo, B Yang, L Hu, P Jiang, K Gai, B An
arXiv preprint arXiv:2310.03984, 2023
2 2023 AgentStudio: A Toolkit for Building General Virtual Agents L Zheng, Z Huang, Z Xue, X Wang, B An, S Yan
arXiv preprint arXiv:2403.17918, 2024
2024 : Energy-Based Reinforcement Learning with Stein Soft Actor CritcS Messaoud, B Mokeddem, Z Xue, B An, H Chen, S Chawla
The Twelfth International Conference on Learning Representations, 2024
2024