Levent Sagun

Cited by

	All	Since 2019
Citations	3963	3652
h-index	18	18
i10-index	22	21

920

460

230

690

20162017201820192020202120222023202415 37 212 354 504 668 855 907 362

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Giulio BiroliProfessor of Theoretical Physics, ENS ParisVerified email at cea.fr
Stéphane d'AscoliAI4science fellow at EPFL, LausanneVerified email at epfl.ch
Yann LeCunChief AI Scientist at Facebook & Silver Professor at the Courant Institute, New York UniversityVerified email at cs.nyu.edu
Stefano SpiglerÉcole Polytechnique Fédérale de LausanneVerified email at epfl.ch
Matthieu WyartProfessor of Physics, EPFLVerified email at epfl.ch
Mario GeigerMITVerified email at mit.edu
V. Uğur GüneyPhD Gruaduate, CUNY Graduate Center, Hunter CollegeVerified email at hunter.cuny.edu
Leon BottouFacebook AI ResearchVerified email at bottou.org
Matthew DunnNYUVerified email at nyu.edu
Ari S. MorcosDatologyAIVerified email at datologyai.com
Utku EvciResearcher @Google DeepmindVerified email at nyu.edu
Carlo BaldassiBocconi University; ELLIS scholarVerified email at unibocconi.it
Riccardo Zecchinaprofessor, theoretical physics, Bocconi UniversityVerified email at unibocconi.it
Jennifer ChayesProfessor, UC BerkeleyVerified email at jenniferchayes.com
Marco Baity-JesiEawagVerified email at eawag.ch
Volkan CirikASAPPVerified email at asapp.com
Kyunghyun ChoNew York University, GenentechVerified email at nyu.edu
Gerard BEN AROUSProfessor of Mathematics, NYU Shanghai and Courant Institute of Mathematical Sciences, New YorkVerified email at nyu.edu
Yann N. DauphinGoogle AIVerified email at dauphin.io
Priya GoyalFounding member@DatologyAI, ex-FAIR, ex-Google DeepmindVerified email at google.com

Levent Sagun

FAIR

Verified email at meta.com

machine learning deep learning probability statistical physics energy landscapes


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Entropy-sgd: Biasing gradient descent into wide valleys P Chaudhari, A Choromanska, S Soatto, Y LeCun, C Baldassi, C Borgs, ... arXiv preprint arXiv:1611.01838, 2016	763	2016
Convit: Improving vision transformers with soft convolutional inductive biases S d’Ascoli, H Touvron, ML Leavitt, AS Morcos, G Biroli, L Sagun International conference on machine learning, 2286-2296, 2021	729	2021
Searchqa: A new q&a dataset augmented with context from a search engine M Dunn, L Sagun, M Higgins, VU Guney, V Cirik, K Cho arXiv preprint arXiv:1704.05179, 2017	430	2017
Empirical analysis of the hessian of over-parametrized neural networks L Sagun, U Evci, VU Guney, Y Dauphin, L Bottou arXiv preprint arXiv:1706.04454, 2017	352	2017
Eigenvalues of the hessian in deep learning: Singularity and beyond L Sagun, L Bottou, Y LeCun arXiv preprint arXiv:1611.07476, 2016	229*	2016
A tail-index analysis of stochastic gradient noise in deep neural networks U Simsekli, L Sagun, M Gurbuzbalaban International Conference on Machine Learning, 5827-5837, 2019	216	2019
Scaling description of generalization with number of parameters in deep learning M Geiger, A Jacot, S Spigler, F Gabriel, L Sagun, S d’Ascoli, G Biroli, ... Journal of Statistical Mechanics: Theory and Experiment 2020 (2), 023401, 2020	208	2020
A jamming transition from under-to over-parametrization affects generalization in deep learning S Spigler, M Geiger, S d’Ascoli, L Sagun, G Biroli, M Wyart Journal of Physics A: Mathematical and Theoretical 52 (47), 474001, 2019	194*	2019
Jamming transition as a paradigm to understand the loss landscape of deep neural networks M Geiger, S Spigler, S d'Ascoli, L Sagun, M Baity-Jesi, G Biroli, M Wyart Physical Review E 100 (1), 012115, 2019	161	2019
Comparing dynamics: Deep neural networks versus glassy systems M Baity-Jesi, L Sagun, M Geiger, S Spigler, GB Arous, C Cammarota, ... International Conference on Machine Learning, 314-323, 2018	118	2018
Energy landscapes for machine learning AJ Ballard, R Das, S Martiniani, D Mehta, L Sagun, JD Stevenson, ... Physical Chemistry Chemical Physics 19 (20), 12585-12603, 2017	118	2017
Vision models are more robust and fair when pretrained on uncurated images without supervision P Goyal, Q Duval, I Seessel, M Caron, I Misra, L Sagun, A Joulin, ... arXiv preprint arXiv:2202.08360, 2022	92	2022
Triple descent and the two kinds of overfitting: Where & why do they appear? S d'Ascoli, L Sagun, G Biroli Advances in Neural Information Processing Systems 33, 3058-3069, 2020	89	2020
Explorations on high dimensional landscapes L Sagun, VU Guney, GB Arous, Y LeCun arXiv preprint arXiv:1412.6615, 2014	67	2014
Early Predictability of Asylum Court Decisions M Dunn, H Sirin, L Sagun, D Chen	44*	2017
Finding the needle in the haystack with convolutions: on the benefits of architectural bias S d'Ascoli, L Sagun, G Biroli, J Bruna Advances in Neural Information Processing Systems 32, 2019	39	2019
On the heavy-tailed theory of stochastic gradient descent for deep neural networks U Şimşekli, M Gürbüzbalaban, TH Nguyen, G Richard, L Sagun arXiv preprint arXiv:1912.00018, 2019	22	2019
Fairness indicators for systematic assessments of visual feature extractors P Goyal, AR Soriano, C Hazirbas, L Sagun, N Usunier Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022	19	2022
On the interplay between data structure and loss function in classification problems S d'Ascoli, M Gabrié, L Sagun, G Biroli Advances in Neural Information Processing Systems 34, 8506-8517, 2021	17*	2021
Easing non-convex optimization with neural networks D Lopez-Paz, L Sagun	16	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors