Obserwuj
Qingpeng Cai
Qingpeng Cai
Kuaishou Technology
Zweryfikowany adres z mails.tsinghua.edu.cn - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
A deep reinforcement learning framework for rebalancing dockless bike sharing systems
L Pan, Q Cai, Z Fang, P Tang, L Huang
Proceedings of the AAAI conference on artificial intelligence 33 (01), 1393-1400, 2019
1872019
Softmax deep double deterministic policy gradients
L Pan, Q Cai, L Huang
Advances in neural information processing systems 33, 11767-11777, 2020
1002020
Reinforcement Mechanism Design for e-commerce
Q Cai, A Filos-Ratsikas, P Tang, Y Zhang
Proceedings of the 2018 World Wide Web Conference, 1339-1348, 2018
882018
Reinforcement learning with dynamic boltzmann softmax updates
L Pan, Q Cai, Q Meng, W Chen, L Huang, TY Liu
IJCAI-2020, 2019
482019
Facility location with minimax envy
Q Cai, A Filos-Ratsikas, P Tang
IJCAI 2016, 137-143, 2016
462016
Policy gradients for contextual recommendations
F Pan, Q Cai, P Tang, F Zhuang, Q He
The World Wide Web Conference, 1421-1431, 2019
442019
Reinforcement mechanism design for fraudulent behaviour in e-commerce
Q Cai, A Filos-Ratsikas, P Tang, Y Zhang
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
422018
Two-stage constrained actor-critic for short video recommendation
Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ...
Proceedings of the ACM Web Conference 2023, 865-875, 2023
40*2023
Reinforcement Learning Driven Heuristic Optimization
Q Cai, W Hang, A Mirhoseini, G Tucker, J Wang, W Wei
DRL4KDD-2019, 2019
362019
Reinforcing user retention in a billion scale short video recommender system
Q Cai, S Liu, X Wang, T Zuo, W Xie, B Yang, D Zheng, P Jiang, K Gai
Companion Proceedings of the ACM Web Conference 2023, 421-426, 2023
342023
Multi-task recommendations with reinforcement learning
Z Liu, J Tian, Q Cai, X Zhao, J Gao, S Liu, D Chen, T He, D Zheng, P Jiang, ...
Proceedings of the ACM Web Conference 2023, 1273-1282, 2023
332023
A large language model enhanced conversational recommender system
Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun
arXiv preprint arXiv:2308.06212, 2023
292023
ResAct: Reinforcing long-term engagement in sequential recommendation with residual actor
W Xue, Q Cai, R Zhan, D Zheng, P Jiang, K Gai, B An
ICLR-2023, 2022
252022
PrefRec: recommender systems with human preferences for reinforcing long-term user engagement
W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, K Gai, B An
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
22*2023
Exploration and regularization of the latent action space in recommendation
S Liu, Q Cai, B Sun, Y Wang, J Jiang, D Zheng, P Jiang, K Gai, X Zhao, ...
Proceedings of the ACM Web Conference 2023, 833-844, 2023
212023
KuaiSim: A comprehensive simulator for recommender systems
K Zhao, S Liu, Q Cai, X Zhao, Z Liu, D Zheng, P Jiang, K Gai
Advances in Neural Information Processing Systems 36, 44880-44897, 2023
182023
Policy optimization with model-based explorations
F Pan, Q Cai, AX Zeng, CX Pan, Q Da, H He, Q He, P Tang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4675-4682, 2019
162019
Mechanism design for personalized recommender systems
Q Cai, A Filos-Ratsikas, C Liu, P Tang
Proceedings of the 10th ACM Conference on Recommender Systems, 159-166, 2016
152016
Generative flow network for listwise recommendation
S Liu, Q Cai, Z He, B Sun, J McAuley, D Zheng, P Jiang, K Gai
Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and …, 2023
142023
State regularized policy optimization on data with dynamics shift
Z Xue, Q Cai, S Liu, D Zheng, P Jiang, K Gai, B An
Advances in neural information processing systems 36, 2024
102024
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20