A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017
Reward Constrained Policy Optimization
C Tessler, DJ Mankowitz, S Mannor
Seventh International Conference on Learning Representations, 2019
Action Robust Reinforcement Learning and Applications in Continuous Control
C Tessler, Y Efroni, S Mannor
International Conference on Machine Learning, 6215--6224, 2019
Distributional policy optimization: An alternative approach for continuous control
C Tessler, G Tennenholtz, S Mannor
Advances in Neural Information Processing Systems 32, 1352--1362, 2019
Action Assembly: Sparse Imitation Learning for Text Based Games with Combinatorial Action Spaces
C Tessler, T Zahavy, D Cohen, DJ Mankowitz, S Mannor
RLDM 2019: The Multi-disciplinary Conference on Reinforcement Learning and …, 2019
Reward Tweaking: Maximizing the Total Reward While Planning for Short Horizons
C Tessler, S Mannor
arXiv preprint arXiv:2002.03327, 2020
Language is power: Representing states using natural language in reinforcement learning
E Schwartz, G Tennenholtz, C Tessler, S Mannor
arXiv preprint arXiv:1910.02789, 2019
Reinforcement Learning for Datacenter Congestion Control
C Tessler, Y Shpigelman, G Dalal, A Mandelbaum, DH Kazakov, B Fuhrer, ...
arXiv preprint arXiv:2102.09337, 2021
Inverse reinforcement learning in contextual MDPs
S Belo, P Korsunsky, S Mannor, C Tessler, T Zahavy
Machine Learning, 1--40, 2021
Ensemble Bootstrapping for Q-Learning
O Peer, C Tessler, N Merlis, R Meir
arXiv preprint arXiv:2103.00445, 2021
Deep Reinforcement Learning Works-Now What?
C Tessler
Personal blog, 2020
