Qingpeng Cai

Cytowane przez

	Wszystkie	Od 2019
Cytowania	735	699
h-indeks	14	14
i10-indeks	17	16

220

110

165

201720182019202020212022202320246 29 66 81 107 117 209 119

Dostęp publiczny

Wyświetl wszystko

14 artykułów

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Peng JiangKuaishou TechnologyZweryfikowany adres z kuaishou.com
Kun GaiSenior Director & Researcher, Alibaba GroupZweryfikowany adres z taobao.com
Ling PanAssistant Professor, Hong Kong University of Science and TechnologyZweryfikowany adres z ust.hk
Longbo HuangProfessor, IIIS @ Tsinghua University, China, ACM Distinguished ScientistZweryfikowany adres z tsinghua.edu.cn
Bo AnNanyang Technological UniversityZweryfikowany adres z ntu.edu.sg
Zhixuan FangTsinghua UniversityZweryfikowany adres z mail.tsinghua.edu.cn
Feiyang PanInstitute of Computing Technology, Chinese Academy of SciencesZweryfikowany adres z ict.ac.cn
Qi MengPrincipal Researcher, Microsoft Research AI4ScienceZweryfikowany adres z pku.edu.cn
Anxiang Zeng(曾安祥)Nanyang Technological UniversityZweryfikowany adres z ntu.edu.sg
Wei Chen (陈薇)Institute of Computing Technology, Chinese Academy of SciencesZweryfikowany adres z ict.ac.cn
Qing DaAlibaba GroupZweryfikowany adres z alibaba-inc.com
Yongfeng ZhangRutgers University, Computer ScienceZweryfikowany adres z rutgers.edu
Xiangyu ZhaoAssistant Professor, City University of Hong KongZweryfikowany adres z cityu.edu.hk
Julian McAuleyProfessor, UC San DiegoZweryfikowany adres z eng.ucsd.edu

Obserwuj

Qingpeng Cai

Kuaishou Technology

Zweryfikowany adres z mails.tsinghua.edu.cn - Strona główna

Reinforcement Learning Mechanism Design Recommender System LLM


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
A Deep Reinforcement Learning Framework for Rebalancing Dockless Bike Sharing Systems L Pan, Q Cai, Z Fang, P Tang, L Huang	173	2020
Softmax Deep Double Deterministic Policy Gradients L Pan, Q Cai, L Huang	82	2020
Reinforcement Mechanism Design for e-commerce Q Cai, A Filos-Ratsikas, P Tang, Y Zhang Proceedings of the 2018 World Wide Web Conference, 1339-1348, 2018	81	2018
Facility location with minimax envy Q Cai, A Filos-Ratsikas, P Tang IJCAI 2016, 137-143, 2016	45	2016
Reinforcement learning with dynamic boltzmann softmax updates L Pan, Q Cai, Q Meng, W Chen, L Huang	43	2019
Reinforcement mechanism design for fraudulent behaviour in e-commerce Q Cai, A Filos-Ratsikas, P Tang, Y Zhang Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	39	2018
Policy gradients for contextual recommendations F Pan, Q Cai, P Tang, F Zhuang, Q He The World Wide Web Conference, 1421-1431, 2019	37	2019
Reinforcement Learning Driven Heuristic Optimization Q Cai, W Hang, A Mirhoseini, G Tucker, J Wang, W Wei	35	2019
Two-Stage Constrained Actor-Critic for Short Video Recommendation Q Cai, Z Xue, C Zhang, W Xue, S Liu, R Zhan, X Wang, T Zuo, W Xie, ...	26*	2023
Reinforcing User Retention in a Billion Scale Short Video Recommender System Q Cai, S Liu, X Wang, T Zuo, W Xie, B Yang, D Zheng, P Jiang, K Gai	23	2023
ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor W Xue, Q Cai, R Zhan, D Zheng, P Jiang, B An	19	2023
Multi-Task Recommendations with Reinforcement Learning Z Liu, J Tian, Q Cai, X Zhao, J Gao, S Liu, D Chen, T He, D Zheng, P Jiang, ...	18	2023
PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement W Xue, Q Cai, Z Xue, S Sun, S Liu, D Zheng, P Jiang, K Gai, B An	17*	2023
Policy optimization with model-based explorations F Pan, Q Cai, AX Zeng, CX Pan, Q Da, H He, Q He, P Tang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 4675-4682, 2019	15	2019
Exploration and Regularization of the Latent Action Space in Recommendation S Liu, Q Cai, B Sun, Y Wang, J Jiang, D Zheng, K Gai, P Jiang, X Zhao, ...	13	2023
Mechanism design for personalized recommender systems Q Cai, A Filos-Ratsikas, C Liu, P Tang Proceedings of the 10th ACM Conference on Recommender Systems, 159-166, 2016	13	2016
A large language model enhanced conversational recommender system Y Feng, S Liu, Z Xue, Q Cai, L Hu, P Jiang, K Gai, F Sun arXiv preprint arXiv:2308.06212, 2023	12	2023
Generator and critic: A deep reinforcement learning approach for slate re-ranking in e-commerce J Wei, A Zeng, Y Wu, P Guo, Q Hua, Q Cai arXiv preprint arXiv:2005.12206, 2020	9	2020
KuaiSim: A Comprehensive Simulator for Recommender Systems K Zhao, S Liu, Q Cai, X Zhao, Z Liu, D Zheng, P Jiang, K Gai	7	2023
Generative Flow Network for Listwise Recommendation S Liu, Q Cai, Z He, B Sun, J McAuley, D Zheng, P Jiang, K Gai	7	2023

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy