Jiaming Ji (吉嘉铭)

Cited by

	All	Since 2019
Citations	429	428
h-index	9	9
i10-index	9	9

260

130

195

2022202320243 165 259

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Yaodong YangBOYA (博雅) Assistant Professor at Peking UniversityVerified email at pku.edu.cn
Xuehai PanPeking UniversityVerified email at pku.edu.cn
Boyuan ChenPeking UniversityVerified email at stu.pku.edu.cn
Hantao LouPeking UniversityVerified email at stu.pku.edu.cn
Tianyi (Alex) QiuPeking UniversityVerified email at stu.pku.edu.cn
Yiran GengTuring Class, Peking UniversityVerified email at stu.pku.edu.cn
Yuanpei ChenSouth China University of TechnologyVerified email at stanford.edu

Jiaming Ji (吉嘉铭)

Peking University

Verified email at stu.pku.edu.cn - Homepage

AI Alignment Reinforcement Learning Large Language Model


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Baichuan 2: Open large-scale language models A Yang, B Xiao, B Wang, B Zhang, C Bian, C Yin, C Lv, D Pan, D Wang, ... arXiv preprint arXiv:2309.10305, 2023	173*	2023
Beavertails: Towards improved safety alignment of llm via a human-preference dataset J Ji, M Liu, J Dai, X Pan, C Zhang, C Bian, R Sun, Y Wang, Y Yang NeurIPS 2023, 2023	62	2023
Ai alignment: A comprehensive survey J Ji, T Qiu, B Chen, B Zhang, H Lou, K Wang, Y Duan, Z He, J Zhou, ... arXiv preprint arXiv:2310.19852, 2023	56	2023
Safe rlhf: Safe reinforcement learning from human feedback J Dai, X Pan, R Sun, J Ji, X Xu, M Liu, Y Wang, Y Yang ICLR 2024 Spotlight, 2023	31	2023
Constrained update projection approach to safe policy optimization L Yang, J Ji, J Dai, L Zhang, B Zhou, P Li, Y Yang, G Pan NeurIPS 2022, 2022	25	2022
Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark J Ji, B Zhang, J Zhou, X Pan, W Huang, R Sun, Y Geng, Y Zhong, J Dai, ... NeurIPS 2023, 2023	17*	2023
Omnisafe: An infrastructure for accelerating safe reinforcement learning research J Ji, J Zhou, B Zhang, J Dai, X Pan, R Sun, W Huang, Y Geng, M Liu, ... arXiv preprint arXiv:2305.09304, 2023	13	2023
Cup: A conservative update policy algorithm for safe reinforcement learning L Yang, J Ji, J Dai, Y Zhang, P Li, G Pan arXiv preprint arXiv:2202.07565, 2022	12	2022
Pku-beaver: Constrained value-aligned llm via safe rlhf J Dai, X Pan, J Ji, R Sun, Y Wang, Y Yang	10	2023
Heterogeneous-Agent Reinforcement Learning Y Zhong, JG Kuba, S Hu, J Ji, Y Yang JMLR, 2023	9	2023
Bi-DexHands: Towards Human-Level Bimanual Dexterous Manipulation Y Chen, Y Geng, F Zhong, J Ji, J Jiang, Z Lu, H Dong, Y Yang IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023	5	2023
SafeDreamer: Safe Reinforcement Learning with World Models W Huang, J Ji, B Zhang, C Xia, Y Yang ICLR 2024, 2023	4	2023
Augmented proximal policy optimization for safe reinforcement learning J Dai, J Ji, L Yang, Q Zheng, G Pan Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 7288-7295, 2023	4	2023
Aligner: Achieving efficient alignment through weak-to-strong correction J Ji, B Chen, H Lou, D Hong, B Zhang, X Pan, J Dai, Y Yang arXiv preprint arXiv:2402.02416, 2024	3	2024
Rethinking Information Structures in RLHF: Reward Generalization from a Graph Theory Perspective T Qiu, F Zeng, J Ji, D Yan, K Wang, J Zhou, H Yang, J Dai, X Pan, Y Yang arXiv preprint arXiv:2402.10184, 2024	2	2024
MyoChallenge 2022: Learning contact-rich manipulation using a musculoskeletal hand V Caggiano, G Durandau, H Wang, A Chiappa, A Mathis, P Tano, N Patel, ... NeurIPS 2022 Competition Track, 233-250, 2023	2	2023
VOCE: Variational Optimization with Conservative Estimation for Offline Safe Reinforcement Learning J Guan, G Chen, J Ji, L Yang, A Zhou, Z Li NeurIPS 2023, 2023	1	2023
ReDMan: Reliable Dexterous Manipulation with Safe Reinforcement Learning Y Geng, J Ji, Y Chen, H Geng, F Zhong, Y Yang

The system can't perform the operation now. Try again later.

Articles 1–18

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors