Aviv Tamar
Tytuł
Cytowane przez
Cytowane przez
Rok
Multi-agent actor-critic for mixed cooperative-competitive environments
R Lowe, Y Wu, A Tamar, J Harb, P Abbeel, I Mordatch
arXiv preprint arXiv:1706.02275, 2017
13142017
Value iteration networks
A Tamar, Y Wu, G Thomas, S Levine, P Abbeel
arXiv preprint arXiv:1602.02867, 2016
4642016
Constrained policy optimization
J Achiam, D Held, A Tamar, P Abbeel
International Conference on Machine Learning, 22-31, 2017
3962017
Bayesian reinforcement learning: A survey
M Ghavamzadeh, S Mannor, J Pineau, A Tamar
arXiv preprint arXiv:1609.04436, 2016
2532016
Model-ensemble trust-region policy optimization
T Kurutach, I Clavera, Y Duan, A Tamar, P Abbeel
arXiv preprint arXiv:1802.10592, 2018
2052018
Risk-sensitive and robust decision-making: a cvar optimization approach
Y Chow, A Tamar, S Mannor, M Pavone
arXiv preprint arXiv:1506.02188, 2015
1462015
Policy gradients with variance related risk criteria
D Di Castro, A Tamar, S Mannor
arXiv preprint arXiv:1206.6404, 2012
1292012
Policy gradients with variance related risk criteria
D Di Castro, A Tamar, S Mannor
arXiv preprint arXiv:1206.6404, 2012
1292012
Learning to route
A Valadarsky, M Schapira, D Shahaf, A Tamar
Proceedings of the 16th ACM workshop on hot topics in networks, 185-191, 2017
962017
Optimizing the CVaR via sampling
A Tamar, Y Glassner, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015
932015
Learning plannable representations with causal infogan
T Kurutach, A Tamar, G Yang, S Russell, P Abbeel
arXiv preprint arXiv:1807.09341, 2018
902018
A deep reinforcement learning perspective on internet congestion control
N Jay, N Rotman, B Godfrey, M Schapira, A Tamar
International Conference on Machine Learning, 3050-3059, 2019
832019
Learning robotic assembly from cad
G Thomas, M Chien, A Tamar, JA Ojea, P Abbeel
2018 IEEE International Conference on Robotics and Automation (ICRA), 3524-3531, 2018
792018
Scaling up robust MDPs using function approximation
A Tamar, S Mannor, H Xu
International Conference on Machine Learning, 181-189, 2014
652014
Reinforcement learning on variable impedance controller for high-precision robotic assembly
J Luo, E Solowjow, C Wen, JA Ojea, AM Agogino, A Tamar, P Abbeel
2019 International Conference on Robotics and Automation (ICRA), 3080-3087, 2019
552019
Learning generalized reactive policies using deep neural networks
E Groshev, A Tamar, M Goldstein, S Srivastava, P Abbeel
2018 AAAI Spring Symposium Series, 2018
522018
Policy gradient for coherent risk measures
A Tamar, Y Chow, M Ghavamzadeh, S Mannor
arXiv preprint arXiv:1502.03919, 2015
462015
Learning from the hindsight plan—episodic mpc improvement
A Tamar, G Thomas, T Zhang, S Levine, P Abbeel
2017 IEEE International Conference on Robotics and Automation (ICRA), 336-343, 2017
402017
Learning robotic manipulation through visual planning and acting
A Wang, T Kurutach, K Liu, P Abbeel, A Tamar
arXiv preprint arXiv:1905.04411, 2019
382019
Learning the variance of the reward-to-go
A Tamar, D Di Castro, S Mannor
The Journal of Machine Learning Research 17 (1), 361-396, 2016
382016
Nie można teraz wykonać tej operacji. Spróbuj ponownie później.
Prace 1–20