Tom Zahavy

Cytowane przez

	Wszystkie	Od 2019
Cytowania	1998	1753
h-indeks	20	19
i10-indeks	31	31

460

230

115

345

20162017201820192020202120222023202434 58 143 171 257 308 401 454 160

Dostęp publiczny

Wyświetl wszystko

3 artykuły

0 artykułów

dostępne

niedostępne

Objęte finansowaniem

Współautorzy

Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchZweryfikowany adres z technion.ac.il
Daniel J. MankowitzGoogle DeepmindZweryfikowany adres z google.com
Satinder SinghGoogle DeepMind / U. of MichiganZweryfikowany adres z umich.edu
Sebastian FlennerhagResearch Scientist at DeepMindZweryfikowany adres z google.com
Chen TesslerResearch Scientist, NVIDIA ResearchZweryfikowany adres z nvidia.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLZweryfikowany adres z google.com
Mordechai SegevSolid State Institute, Physics Department and Electrical Engineering Department Technion - IsraelZweryfikowany adres z technion.ac.il
Alex DikopoltsevQuantum Optoelectronics Group, Department of Physics, ETHZweryfikowany adres z phys.ethz.ch
Brendan O'DonoghueStanford University, Google DeepMindZweryfikowany adres z alumni.stanford.edu
Zhongwen XuTencentZweryfikowany adres z tencent.com
Oren CohenProfessor of Physics, Technion, IsraelZweryfikowany adres z technion.ac.il
Vivek VeeriahGoogle DeepMindZweryfikowany adres z google.com
David SilverDeepMind, UCLZweryfikowany adres z google.com
Matteo HesselResearch Engineer, Google DeepMindZweryfikowany adres z google.com
Junhyuk OhResearch Scientist, DeepMindZweryfikowany adres z google.com
Nadav MerlisPostdoctoral Fellow @ CREST, ENSAE ParisZweryfikowany adres z ensae.fr
Alessandro MagnaniWalmartlabsZweryfikowany adres z walmartlabs.com
Tom SchaulSenior Staff Scientist, DeepMindZweryfikowany adres z nyu.edu
Valentin DalibardUniversity of CambridgeZweryfikowany adres z cl.cam.ac.uk
Yannick SchroeckerDeepMindZweryfikowany adres z google.com

Obserwuj

Tom Zahavy

Inne imiona/nazwiskaTom Ben Zion Zahavy

Staff Research Scientist, Google DeepMind

Zweryfikowany adres z deepmind.com - Strona główna

Reinforcement Learning


Tytuł Sortuj wg cytatów Sortuj wg roku Sortuj wg tytułu	Cytowane przez Cytowane przez	Rok
A deep hierarchical approach to lifelong learning in minecraft C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	432	2017
Graying the black box: Understanding dqns T Zahavy, N Ben-Zrihem, S Mannor International conference on machine learning (ICML), 1899-1908, 2016	319	2016
Learn what not to learn: Action elimination with deep reinforcement learning T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor Advances in neural information processing systems 31, 2018	232	2018
Deep learning reconstruction of ultrashort pulses T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ... Optica 5 (5), 666-673, 2018	163	2018
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce T Zahavy, A Krishnan, A Magnani, S Mannor Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	105*	2018
A self-tuning actor-critic algorithm T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ... Advances in neural information processing systems 33, 20913-20924, 2020	78	2020
Bootstrapped meta-learning S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh International Conference on Learning Representations (ICLR) 2022, 2021	66	2021
Shallow updates for deep reinforcement learning N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor Advances in Neural Information Processing Systems 30, 2017	52	2017
Reward is enough for convex mdps T Zahavy, B O'Donoghue, G Desjardins, S Singh Advances in Neural Information Processing Systems 34, 25746-25759, 2021	50	2021
Online limited memory neural-linear bandits with likelihood matching O Nabati, T Zahavy, S Mannor International Conference on Machine Learning, 7905-7915, 2021	37*	2021
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	35	2021
Ensemble robustness and generalization of stochastic deep learning algorithms T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor arXiv preprint arXiv:1602.02389, 2016	34*	2016
Discovering Evolution Strategies via Meta-Black-Box Optimization R Tjarko Lange, T Schaul, Y Chen, T Zahavy, V Dallibard, C Lu, S Singh, ... International Conference on Learning Representations (ICLR) 2023, 2022	30*	2022
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ... International Conference on Learning Representations (ICLR) 2023, 2022	27	2022
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ... Optics express 28 (5), 7528-7538, 2020	25	2020
Emphatic algorithms for deep reinforcement learning R Jiang, T Zahavy, Z Xu, A White, M Hessel, C Blundell, H Van Hasselt International Conference on Machine Learning (ICML), 5023-5033, 2021	22	2021
Online Apprenticeship Learning L Shani, T Zahavy, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence, 2021	22	2021
Discovering a set of policies for the worst case reward T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ... International Conference on Learning Representations (ICLR) 2021, 2021	22	2021
Visualizing dynamics: from t-sne to semi-mdps NB Zrihem, T Zahavy, S Mannor Workshop on Human Interpretability in Machine Learning, ICML (WHI 2016), 2016	21*	2016
Balancing constraints and rewards with meta-gradient d4pg DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann International Conference on Learning Representations (ICLR) 2021, 2020	20	2020

Nie można teraz wykonać tej operacji. Spróbuj ponownie później.

Prace 1–20

Cytowania rocznie

Powielone cytowania

Scalone cytowania

Dodaj współautorówWspółautorzy

Obserwuj

Cytowane przez

Współautorzy