Obserwuj
Yuping Luo
Yuping Luo
Computer Science Department, Princeton University
Zweryfikowany adres z cs.princeton.edu
Tytuł
Cytowane przez
Cytowane przez
Rok
Implicit regularization in deep matrix factorization
S Arora, N Cohen, W Hu, Y Luo
Advances in Neural Information Processing Systems 32, 2019
4752019
Algorithmic framework for model-based deep reinforcement learning with theoretical guarantees
Y Luo, H Xu, Y Li, Y Tian, T Darrell, T Ma
arXiv preprint arXiv:1807.03858, 2018
2302018
Towards resolving the implicit bias of gradient descent for matrix factorization: Greedy low-rank learning
Z Li, Y Luo, K Lyu
arXiv preprint arXiv:2012.09839, 2020
1072020
Provably efficient Q-learning with function approximation via distribution shift error checking oracle
SS Du, Y Luo, R Wang, H Zhang
Advances in Neural Information Processing Systems 32, 2019
962019
Provable representation learning for imitation learning via bi-level optimization
S Arora, S Du, S Kakade, Y Luo, N Saunshi
International Conference on Machine Learning, 367-376, 2020
572020
Safe reinforcement learning by imagining the near future
G Thomas, Y Luo, T Ma
Advances in Neural Information Processing Systems 34, 13859-13869, 2021
552021
Learning online alignments with continuous rewards policy gradient
Y Luo, CC Chiu, N Jaitly, I Sutskever
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
512017
Learning barrier certificates: Towards safe reinforcement learning with zero training-time violations
Y Luo, T Ma
Advances in Neural Information Processing Systems 34, 25621-25632, 2021
342021
On the expressivity of neural networks for deep reinforcement learning
K Dong, Y Luo, T Yu, C Finn, T Ma
International conference on machine learning, 2627-2637, 2020
312020
Learning self-correctable policies and value functions from demonstrations with negative sampling
Y Luo, H Xu, T Ma
arXiv preprint arXiv:1907.05634, 2019
172019
Towards learning to play piano with dexterous hands and touch
H Xu, Y Luo, S Wang, T Darrell, R Calandra
2022 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2022
142022
An online sequence-to-sequence model for noisy speech recognition
CC Chiu, D Lawson, Y Luo, G Tucker, K Swersky, I Sutskever, N Jaitly
arXiv preprint arXiv:1706.06428, 2017
82017
Recurrent neural networks for online sequence generation
CC Chiu, N Jaitly, I Sutskever, Y Luo
US Patent 10,281,885, 2019
72019
Bootstrapping the expressivity with model-based planning
K Dong, Y Luo, T Ma
22019
Towards Efficient and Effective Deep Model-Based Reinforcement Learning
Y Luo
Princeton University, 2022
2022
Recurrent neural networks for online sequence generation
CC Chiu, N Jaitly, I Sutskever, Y Luo
US Patent 10,656,605, 2020
2020
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–16