Hattie Zhou

200

100

150

20192020202120222023202416 83 102 122 190 139

Janice LanFacebook AI ResearchVerified email at fb.com
Jason YosinskiWindscape AI; ML CollectiveVerified email at windscape.ai
Rosanne LiuML Collective; Google DeepMindVerified email at mlcollective.org
Preetum NakkiranApple ML ResearchVerified email at cs.harvard.edu
Hugo LarochelleGoogle DeepMind & MilaVerified email at google.com
Aaron CourvilleProfessor, DIRO, Université de Montréal, Mila, Cifar CAI chairVerified email at umontreal.ca
Noam RazinComputer Science PhD Candidate, Tel Aviv UniversityVerified email at cs.tau.ac.il
Joshua M SusskindApple AI ResearchVerified email at apple.com
Etai LittwinResearch Scientist at AppleVerified email at apple.com
Hanie SedghiSenior Research Scientist, Google DeepMindVerified email at google.com
Behnam NeyshaburSenior Staff Research Scientist, DeepMindVerified email at google.com
Azade NovaResearch Scientist at Google BrainVerified email at google.com
Samy BengioSenior Director, AI and Machine Learning Research, AppleVerified email at apple.com
Ankit VaniPhD candidate, Mila, Université de MontréalVerified email at umontreal.ca
Irina RishUniversity of Montreal / Mila -Quebec AI InstituteVerified email at mila.quebec
Pascal Junior Tikeng NotsawoPhD Student, Université de Montréal, MilaVerified email at mila.quebec
Guillaume DumasAssociate Professor of Computational Psychiatry, University of MontrealVerified email at ppsp.team
Vimal ThilakAppleVerified email at apple.com

Hattie Zhou

Mila; Université de Montréal

Verified email at mila.quebec - Homepage


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deconstructing lottery tickets: Zeros, signs, and the supermask H Zhou, J Lan, R Liu, J Yosinski Advances in neural information processing systems 32, 2019	451	2019
Teaching algorithmic reasoning via in-context learning H Zhou, A Nova, H Larochelle, A Courville, B Neyshabur, H Sedghi arXiv preprint arXiv:2211.09066, 2022	85*	2022
What algorithms can transformers learn? a study in length generalization H Zhou, A Bradley, E Littwin, N Razin, O Saremi, J Susskind, S Bengio, ... arXiv preprint arXiv:2310.16028, 2023	49	2023
Fortuitous forgetting in connectionist networks H Zhou, A Vani, H Larochelle, A Courville International Conference on Learning Representations, 2021	32	2021
Lca: Loss change allocation for neural network training J Lan, R Liu, H Zhou, J Yosinski Advances in Neural Information Processing Systems 32, 2019	25	2019
Predicting grokking long before it happens: A look into the loss landscape of models which grok P Notsawo Jr, H Zhou, M Pezeshki, I Rish, G Dumas arXiv preprint arXiv:2306.13253, 2023	7	2023
Vanishing gradients in reinforcement finetuning of language models N Razin, H Zhou, O Saremi, V Thilak, A Bradley, P Nakkiran, J Susskind, ... arXiv preprint arXiv:2310.20703, 2023	3	2023
Step-by-Step Diffusion: An Elementary Tutorial P Nakkiran, A Bradley, H Zhou, M Advani arXiv preprint arXiv:2406.08929, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–8

Citations per year