Follow
Kushal Tirumala
Kushal Tirumala
Facebook AI Research
Verified email at fb.com - Homepage
Title
Cited by
Cited by
Year
Machine learning for the zwicky transient facility
A Mahabal, U Rebbapragada, R Walters, FJ Masci, N Blagorodnova, ...
Publications of the Astronomical Society of the Pacific 131 (997), 038002, 2019
1382019
Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
K Tirumala, AH Markosyan, L Zettlemoyer, A Aghajanyan
Neural Information Processing Systems, 2022
1342022
Semdedup: Data-efficient learning at web-scale through semantic deduplication
A Abbas, K Tirumala, D Simig, S Ganguli, AS Morcos
arXiv preprint arXiv:2303.09540, 2023
582023
DeepStreaks: identifying fast-moving objects in the Zwicky Transient Facility data with deep learning
DA Duev, A Mahabal, Q Ye, K Tirumala, J Belicki, R Dekany, S Frederick, ...
Monthly Notices of the Royal Astronomical Society 486 (3), 4158-4165, 2019
392019
D4: Improving llm pretraining via document de-duplication and diversification
K Tirumala, D Simig, A Aghajanyan, A Morcos
Advances in Neural Information Processing Systems 36, 2024
252024
A method for finding anomalous astronomical light curves and their analogues
JR Martínez-Galarza, FB Bianco, D Crake, K Tirumala, AA Mahabal, ...
Monthly Notices of the Royal Astronomical Society 508 (4), 5734-5756, 2021
212021
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
T Thrush, K Tirumala, A Gupta, M Bartolo, P Rodriguez, T Kane, ...
ACL System Demos, 2022
102022
Decoding data quality via synthetic corruptions: Embedding-guided pruning of code data
Y Yang, AK Singh, M Elhoushi, A Mahmoud, K Tirumala, F Gloeckle, ...
arXiv preprint arXiv:2312.02418, 2023
42023
Ensemble machine learning methods for modeling Covid19 deaths
R Bathwal, P Chitta, K Tirumala, V Varadarajan
arXiv preprint arXiv:2010.04052, 2020
42020
Investigating Generalization by Controlling Normalized Margin
A Farhang, J Bernstein, K Tirumala, Y Liu, Y Yue
International Conference on Machine Learning, 2022
32022
The Unreasonable Ineffectiveness of the Deeper Layers
A Gromov, K Tirumala, H Shapourian, P Glorioso, DA Roberts
arXiv preprint arXiv:2403.17887, 2024
12024
Effective pruning of web-scale datasets based on complexity of concept clusters
A Abbas, E Rusak, K Tirumala, W Brendel, K Chaudhuri, AS Morcos
arXiv preprint arXiv:2401.04578, 2024
12024
WaldoInSky: Anomaly detection algorithms for time-domain astronomy
JR Martinez-Galarza, F Bianco, K Tirumala, D Crake, A Mahabal
Astrophysics Source Code Library, ascl: 2108.004, 2021
2021
A Granular Method for Finding Anomalous Light Curves and their Analogs
K Tirumala, JR Martínez-Galarza, FB Bianco, D Crake, AA Mahabal, ...
NeurIPS 2021 Workshop on Machine Learning and the Physical Sciences (NeurIPS …, 2021
2021
Generalization Bounds for MLP’s (Multilayer Perceptron)
K Tirumala
The system can't perform the operation now. Try again later.
Articles 1–15