Obserwuj
Chenjia Bai
Chenjia Bai
Shanghai AI Laboratory
Zweryfikowany adres z pjlab.org.cn - Strona główna
Tytuł
Cytowane przez
Cytowane przez
Rok
Exploration in Deep Reinforcement Learning: From Single-Agent to Multi-Agent Domain
J Hao, T Yang, H Tang, C Bai, J Liu, Z Meng, P Liu, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2023
152*2023
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
C Bai, L Wang, Z Yang, Z Deng, A Garg, P Liu, Z Wang
International Conference on Learning representations (ICLR), 2022
992022
RORL: Robust Offline Reinforcement Learning via Conservative Smoothing
R Yang, C Bai, X Ma, Z Wang, C Zhang, L Han
Neural Information Processing Systems (NeurIPS), 2022
422022
Survey on Sparse Reward in Deep Reinforcement Learning
W Yang, C Bai, C Cai, Y Zhao, P Liu
计算机科学 47 (3), 182-191, 2020
37*2020
Principled Exploration via Optimistic Bootstrapping and Backward Induction
C Bai, L Wang, L Han, J Hao, A Garg, P Liu, Z Wang
International Conference on Machine Learning (ICML), 2021
362021
Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
S Qiu, L Wang, C Bai, Z Yang, Z Wang
International Conference on Machine Learning (ICML), 18168-18210, 2022
222022
Guided Goal Generation for Hindsight Multi-Goal Reinforcement Learning
C Bai, P Liu, W Zhao, X Tang
Neurocomputing 359, 353-367, 2019
222019
Dynamic Bottleneck for Robust Self-Supervised Exploration
C Bai, L Wang, L Han, A Garg, J Hao, P Liu, Z Wang
Neural Information Processing Systems (NeurIPS), 2021
212021
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning
H He, C Bai, K Xu, Z Yang, W Zhang, D Wang, B Zhao, X Li
Neural Information Processing Systems (NeurIPS), 2023
182023
Variational Dynamic for Self-Supervised Exploration in Deep Reinforcement Learning
C Bai, P Liu, K Liu, L Wang, Y Zhao, L Han, Z Wang
IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 2021
132021
Addressing Hindsight Bias in Multi-Goal Reinforcement Learning
C Bai, L Wang, Y Wang, Z Wang, R Zhao, C Bai, P Liu
IEEE Transactions on Cybernetics, 2021
122021
Generating Attentive Goals for Prioritized Hindsight Reinforcement Learning
P Liu, C Bai, Y Zhao, C Bai, W Zhao, X Tang
Knowledge-Based Systems 203, 106140, 2020
102020
Active Sampling for Deep Q-learning Based on TD-error Adaptive Correction
C Bai, P Liu, W Zhao, X Tang
计算机研究与发展 56 (2), 262-280, 2019
9*2019
False Correlation Reduction for Offline Reinforcement Learning
Z Deng, Z Fu, L Wang, Z Yang, C Bai, T Zhou, Z Wang, J Jiang
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023
8*2023
Monotonic Quantile Network for Worst-Case Offline Reinforcement Learning
C Bai, T Xiao, Z Zhu, L Wang, F Zhou, A Garg, B He, P Liu, Z Wang
IEEE Transactions on Neural Networks and Learning Systems, 2022
82022
Research on Autonomous Driving Methods Based on Computer Vision and Deep Learning
C Bai
Harbin Institute of Technology, 2017
52017
Obtaining Accurate Estimated Action Values in Categorical Distributional Reinforcement Learning
Y Zhao, P Liu, C Bai, W Zhao, X Tang
Knowledge-Based Systems 194, 105511, 2020
32020
Robust Quadrupedal Locomotion via Risk-Averse Policy Learning
J Shi, C Bai, H He, L Han, D Wang, B Zhao, X Li, X Li
IEEE International Conference on Robotics and Automation (ICRA), 2024
22024
Cross-Domain Policy Adaptation via Value-Guided Data Filtering
K Xu, C Bai, X Ma, D Wang, B Zhao, Z Wang, X Li, W Li
Neural Information Processing Systems (NeurIPS), 2023
22023
Behavior Contrastive Learning for Unsupervised Skill Discovery
R Yang, C Bai, H Guo, S Li, B Zhao, Z Wang, P Liu, X Li
International Conference on Machine Learning (ICML), 2023
22023
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20