Volgen
Ethan Dyer
Ethan Dyer
Google Research, Blueshift Team
Geverifieerd e-mailadres voor google.com - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
6992023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
6932022
Solving quantitative reasoning problems with language models
A Lewkowycz, A Andreassen, D Dohan, E Dyer, H Michalewski, ...
Advances in Neural Information Processing Systems 35, 3843-3857, 2022
3852022
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
3482023
Boundary terms, variational principles, and higher derivative modified gravity
E Dyer, K Hinterbichler
Physical Review D 79 (2), 024028, 2009
2232009
The large learning rate phase of deep learning: the catapult mechanism
A Lewkowycz, Y Bahri, E Dyer, J Sohl-Dickstein, G Gur-Ari
arXiv preprint arXiv:2003.02218, 2020
1822020
Gradient descent happens in a tiny subspace
G Gur-Ari, DA Roberts, E Dyer
arXiv preprint arXiv:1812.04754, 2018
1682018
Anatomy of catastrophic forgetting: Hidden representations and task semantics
VV Ramasesh, E Dyer, M Raghu
arXiv preprint arXiv:2007.07400, 2020
1392020
Explaining neural scaling laws
Y Bahri, E Dyer, J Kaplan, J Lee, U Sharma
arXiv preprint arXiv:2102.06701, 2021
1232021
When do curricula work?
X Wu, E Dyer, B Neyshabur
arXiv preprint arXiv:2012.03107, 2020
1152020
Effect of scale on catastrophic forgetting in neural networks
VV Ramasesh, A Lewkowycz, E Dyer
International Conference on Learning Representations, 2021
1092021
Asymptotics of wide networks from feynman diagrams
E Dyer, G Gur-Ari
arXiv preprint arXiv:1909.11304, 2019
1082019
Exploring length generalization in large language models
C Anil, Y Wu, A Andreassen, A Lewkowycz, V Misra, V Ramasesh, ...
Advances in Neural Information Processing Systems 35, 38546-38556, 2022
1032022
Universal bounds on charged states in 2d CFT and 3d gravity
N Benjamin, E Dyer, AL Fitzpatrick, S Kachru
Journal of High Energy Physics 2016 (8), 1-26, 2016
882016
Block-recurrent transformers
DL Hutchins, I Schlag, Y Wu, E Dyer, B Neyshabur
Advances in neural information processing systems 35, 33248-33261, 2022
802022
Scaling dimensions of monopole operators in the theory in 2 + 1 dimensions
E Dyer, M Mezei, SS Pufu, S Sachdev
Journal of High Energy Physics 2015 (6), 1-48, 2015
742015
2D CFT partition functions at late times
E Dyer, G Gur-Ari
Journal of High Energy Physics 2017 (8), 1-35, 2017
732017
Monopole taxonomy in three-dimensional conformal field theories
E Dyer, M Mezei, SS Pufu
arXiv preprint arXiv:1309.1160, 2013
632013
Spinning geodesic Witten diagrams
E Dyer, DZ Freedman, J Sully
Journal of High Energy Physics 2017 (11), 1-37, 2017
552017
Affinity and diversity: Quantifying mechanisms of data augmentation
R Gontijo-Lopes, SJ Smullin, ED Cubuk, E Dyer
arXiv preprint arXiv:2002.08973, 2020
532020
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20