James Martens

Cited by

	All	Since 2019
Citations	14622	10258
h-index	26	25
i10-index	31	27

2100

1050

525

1575

201220132014201520162017201820192020202120222023202475 150 255 512 814 1001 1435 1743 1908 2041 1992 1998 576

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

James Martens

Research Scientist, DeepMind

Verified email at google.com - Homepage

Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
On the importance of initialization and momentum in deep learning I Sutskever, J Martens, G Dahl, G Hinton International conference on machine learning, 1139-1147, 2013	6094	2013
Generating text with recurrent neural networks I Sutskever, J Martens, GE Hinton Proceedings of the 28th international conference on machine learning (ICML …, 2011	2008	2011
Deep learning via hessian-free optimization. J Martens Icml 27, 735-742, 2010	1237	2010
Optimizing neural networks with kronecker-factored approximate curvature J Martens, R Grosse International conference on machine learning, 2408-2417, 2015	960	2015
Learning recurrent neural networks with hessian-free optimization J Martens, I Sutskever Proceedings of the 28th international conference on machine learning (ICML …, 2011	807	2011
New insights and perspectives on the natural gradient method J Martens Journal of Machine Learning Research 21 (146), 1-76, 2020	594	2020
Adding gradient noise improves learning for very deep networks A Neelakantan, L Vilnis, QV Le, I Sutskever, L Kaiser, K Kurach, J Martens arXiv preprint arXiv:1511.06807, 2015	574	2015
Adversarial robustness through local linearization C Qin, J Martens, S Gowal, D Krishnan, K Dvijotham, A Fawzi, S De, ... Advances in neural information processing systems 32, 2019	310	2019
The mechanics of n-player differentiable games D Balduzzi, S Racaniere, J Martens, J Foerster, K Tuyls, T Graepel International Conference on Machine Learning, 354-363, 2018	307	2018
A kronecker-factored approximate fisher matrix for convolution layers R Grosse, J Martens International Conference on Machine Learning, 573-582, 2016	264	2016
Training deep and recurrent networks with hessian-free optimization J Martens, I Sutskever Neural Networks: Tricks of the Trade: Second Edition, 479-535, 2012	248	2012
Which algorithmic choices matter at which batch sizes? insights from a noisy quadratic model G Zhang, L Li, Z Nado, J Martens, S Sachdeva, G Dahl, C Shallue, ... Advances in neural information processing systems 32, 2019	127	2019
Fast convergence of natural gradient descent for over-parameterized neural networks G Zhang, J Martens, RB Grosse Advances in Neural Information Processing Systems 32, 2019	122	2019
Distributed second-order optimization using kronecker-factored approximations J Ba, R Grosse, J Martens International conference on learning representations, 2022	101	2022
Differentiable game mechanics A Letcher, D Balduzzi, S Racaniere, J Martens, J Foerster, K Tuyls, ... Journal of Machine Learning Research 20 (84), 1-40, 2019	88	2019
On the representational efficiency of restricted boltzmann machines J Martens, A Chattopadhya, T Pitassi, R Zemel Advances in Neural Information Processing Systems 26, 2013	87	2013
Kronecker-factored curvature approximations for recurrent neural networks J Martens, J Ba, M Johnson International Conference on Learning Representations, 2018	84	2018
Estimating the hessian by back-propagating curvature J Martens, I Sutskever, K Swersky arXiv preprint arXiv:1206.6464, 2012	80	2012
Pre-training via denoising for molecular property prediction S Zaidi, M Schaarschmidt, J Martens, H Kim, YW Teh, ... arXiv preprint arXiv:2206.00133, 2022	74	2022
Second-order optimization for neural networks J Martens University of Toronto (Canada), 2016	71	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by