Rui Yang

Cytowane przez

	Wszystkie	Od 2019
Cytowania	244	244
h-indeks	8	8
i10-indeks	7	7

120

202120222023202415 38 113 77

Dostęp publiczny

Wyświetl wszystko

1 artykuł

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Lei HanTencent Robotics X & Tencent AI LabZweryfikowany adres z tencent.com
Xiu LITsinghua UniversityZweryfikowany adres z sz.tsinghua.edu.cn
Chongjie zhangWashington University in St. LouisZweryfikowany adres z wustl.edu
Xiaoteng Ma（马骁腾）Center for Intelligent and Networked Systems, Dept. Automation, Tsinghua University, Beijing, ChinaZweryfikowany adres z mails.tsinghua.edu.cn
Lanqing LiZhejiang LabZweryfikowany adres z zhejianglab.com
Hao SunPhD Candidate, DAMTP, University of CambridgeZweryfikowany adres z cam.ac.uk
Meng FangUniversity of LiverpoolZweryfikowany adres z liverpool.ac.uk
Yali DuTuring Fellow, Assistant professor, King's College LondonZweryfikowany adres z kcl.ac.uk
Chenjia BaiShanghai AI LaboratoryZweryfikowany adres z pjlab.org.cn
Zhaoran WangAssistant Professor at Northwestern UniversityZweryfikowany adres z northwestern.edu
Tong ZhangHKUSTZweryfikowany adres z tongzhang-ml.org
Shuang QiuHKUSTZweryfikowany adres z umich.edu
Quanquan GuAssociate Professor of Computer Science, UCLAZweryfikowany adres z cs.ucla.edu
Chenlu YeHong Kong University of Science and TechnologyZweryfikowany adres z connect.ust.hk
Yong Linthe Hong Kong University of Science and TechnologyZweryfikowany adres z connect.ust.hk
Han ZhongPeking UniversityZweryfikowany adres z stu.pku.edu.cn
Jianshu ChenPrincipal Scientist, AmazonZweryfikowany adres z ucla.edu
Xiaoman PanTencent AIZweryfikowany adres z tencent.com

Obserwuj

Rui Yang

Hong Kong University of Science and Technology

Zweryfikowany adres z connect.ust.hk - Strona główna

Reinforcement Learning Deep Learning Machine Learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
Rethinking Goal-conditioned Supervised Learning and Its Connection to Offline RL R Yang, Y Lu, W Li, H Sun, M Fang, Y Du, X Li, L Han, C Zhang International Conference on Learning Representations (ICLR) 2022, 2022	55	2022
FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization L Li, R Yang, D Luo International Conference on Learning Representations (ICLR) 2021, 2020	54	2020
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing R Yang, C Bai, X Ma, Z Wang, C Zhang, L Han Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022	42	2022
Exploiting Reward Shifting in Value-Based Deep RL H Sun, L Han, R Yang, X Ma, J Guo, B Zhou Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022	22*	2022
MHER: Model-based Hindsight Experience Replay R Yang, M Fang, L Han, Y Du, F Luo, X Li NeurIPS 2021 Deep RL Workshop, 2021	21	2021
A survey on sparse reward algorithms in reinforcement learning-theory and experiment 杨瑞，严江鹏，李秀智能系统学报 15 (5), 888-899, 2020	16*	2020
What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL? R Yang, Y Lin, X Ma, H Hu, C Zhang, T Zhang International Conference on Machine Learning (ICML) 2023, 2023	10	2023
Bias-reduced Multi-step Hindsight Experience Replay for Efficient Multi-goal Reinforcement Learning R Yang, J Lyu, Y Yang, J Yan, F Luo, D Luo, L Li, X Li arXiv preprint arXiv:2102.12962, 2021	8*	2021
Corruption-Robust Offline Reinforcement Learning with General Function Approximation C Ye, R Yang, Q Gu, T Zhang Advances in Neural Information Processing Systems (NeurIPS) 2023, 2023	5	2023
GOPlan: Goal-conditioned Offline Reinforcement Learning by Planning with Learned Models M Wang, R Yang, X Chen, M Fang NeurIPS 2023 GCRL workshop, 2023	3	2023
Efficient multi-goal reinforcement learning via value consistency prioritization J Xu, S Li, R Yang, C Yuan, L Han Journal of Artificial Intelligence Research 77, 355-376, 2023	2	2023
Combining hindsight with goal-enhanced prediction for multi-goal reinforcement learning R Yang, F Luo, X Li 2021 IEEE 33rd International Conference on Tools with Artificial …, 2021	2	2021
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards H Wang, Y Lin, W Xiong, R Yang, S Diao, S Qiu, H Zhao, T Zhang arXiv preprint arXiv:2402.18571, 2024	1	2024
Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment R Yang, X Pan, F Luo, S Qiu, H Zhong, D Yu, J Chen arXiv preprint arXiv:2402.10207, 2024	1	2024
Towards Robust Offline Reinforcement Learning under Diverse Data Corruption R Yang, H Zhong, J Xu*, A Zhang, C Zhang, L Han, T Zhang International Conference on Learning Representations (ICLR) 2024, 2023	1	2023
Towards Robust Offline-to-Online Reinforcement Learning via Uncertainty and Smoothness X Wen, X Yu, R Yang, C Bai, Z Wang arXiv preprint arXiv:2309.16973, 2023	1	2023
Robot control method, apparatus and device, storage medium and program product R Yang, L Li, D Luo US Patent App. 17/957,710, 2023		2023

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–17

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy