Follow
Hattie Zhou
Hattie Zhou
Mila; Université de Montréal
Verified email at mila.quebec - Homepage
Title
Cited by
Cited by
Year
Deconstructing lottery tickets: Zeros, signs, and the supermask
H Zhou, J Lan, R Liu, J Yosinski
Advances in neural information processing systems 32, 2019
4802019
Teaching algorithmic reasoning via in-context learning
H Zhou, A Nova, H Larochelle, A Courville, B Neyshabur, H Sedghi
arXiv preprint arXiv:2211.09066, 2022
101*2022
What algorithms can transformers learn? a study in length generalization
H Zhou, A Bradley, E Littwin, N Razin, O Saremi, J Susskind, S Bengio, ...
arXiv preprint arXiv:2310.16028, 2023
902023
Fortuitous forgetting in connectionist networks
H Zhou, A Vani, H Larochelle, A Courville
arXiv preprint arXiv:2202.00155, 2022
412022
Lca: Loss change allocation for neural network training
J Lan, R Liu, H Zhou, J Yosinski
Advances in Neural Information Processing Systems 32, 2019
262019
Predicting grokking long before it happens: A look into the loss landscape of models which grok
P Notsawo Jr, H Zhou, M Pezeshki, I Rish, G Dumas
arXiv preprint arXiv:2306.13253, 2023
122023
Vanishing gradients in reinforcement finetuning of language models
N Razin, H Zhou, O Saremi, V Thilak, A Bradley, P Nakkiran, J Susskind, ...
arXiv preprint arXiv:2310.20703, 2023
52023
A Formal Framework for Understanding Length Generalization in Transformers
X Huang, A Yang, S Bhattamishra, Y Sarrof, A Krebs, H Zhou, P Nakkiran, ...
arXiv preprint arXiv:2410.02140, 2024
2024
Step-by-Step Diffusion: An Elementary Tutorial
P Nakkiran, A Bradley, H Zhou, M Advani
arXiv preprint arXiv:2406.08929, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–9