Volgen
Keshav Santhanam
Keshav Santhanam
Geverifieerd e-mailadres voor stanford.edu - Homepage
Titel
Geciteerd door
Geciteerd door
Jaar
On the opportunities and risks of foundation models
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2021
25792021
Holistic evaluation of language models
P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ...
arXiv preprint arXiv:2211.09110, 2022
5672022
Colbertv2: Effective and efficient retrieval via lightweight late interaction
K Santhanam, O Khattab, J Saad-Falcon, C Potts, M Zaharia
arXiv preprint arXiv:2112.01488, 2021
2342021
{Heterogeneity-Aware} cluster scheduling policies for deep learning workloads
D Narayanan, K Santhanam, F Kazhamiaka, A Phanishayee, M Zaharia
14th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2020
1712020
Demonstrate-search-predict: Composing retrieval and language models for knowledge-intensive nlp
O Khattab, K Santhanam, XL Li, D Hall, P Liang, C Potts, M Zaharia
arXiv preprint arXiv:2212.14024, 2022
1002022
On the opportunities and risks of foundation models. arXiv 2021
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2023
562023
Accelerating deep learning workloads through efficient multi-model execution
D Narayanan, K Santhanam, A Phanishayee, M Zaharia
NeurIPS Workshop on Systems for Machine Learning 20, 2018
562018
PLAID: an efficient engine for late interaction retrieval
K Santhanam, O Khattab, C Potts, M Zaharia
Proceedings of the 31st ACM International Conference on Information …, 2022
372022
On the opportunities and risks of foundation models (2021)
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2022
342022
Analysis and exploitation of dynamic pricing in the public cloud for ml training
D Narayanan, K Santhanam, F Kazhamiaka, A Phanishayee, M Zaharia
VLDB DISPA Workshop 2020, 2020
232020
Dspy: Compiling declarative language model calls into self-improving pipelines
O Khattab, A Singhvi, P Maheshwari, Z Zhang, K Santhanam, ...
arXiv preprint arXiv:2310.03714, 2023
182023
ROLA: A New Distributed Transaction Protocol and Its Formal Analysis.
S Liu, PC Ölveczky, K Santhanam, Q Wang, I Gupta, J Meseguer
FASE, 77-93, 2018
182018
Accelerating model search with model batching
D Narayanan, K Santhanam, M Zaharia
1st Conference on Systems and Machine Learning (SysML), SysML 18, 2018
92018
Distir: An intermediate representation for optimizing distributed neural networks
K Santhanam, S Krishna, R Tomioka, A Fitzgibbon, T Harris
Proceedings of the 1st Workshop on Machine Learning and Systems, 15-23, 2021
62021
Cheaply estimating inference efficiency metrics for autoregressive transformer models
D Narayanan, K Santhanam, P Henderson, R Bommasani, T Lee, ...
Advances in Neural Information Processing Systems 36, 2024
52024
Moving beyond downstream task accuracy for information retrieval benchmarking
K Santhanam, J Saad-Falcon, M Franz, O Khattab, A Sil, R Florian, ...
arXiv preprint arXiv:2212.01340, 2022
52022
UDAPDR: unsupervised domain adaptation via LLM prompting and distillation of rerankers
J Saad-Falcon, O Khattab, K Santhanam, R Florian, M Franz, S Roukos, ...
arXiv preprint arXiv:2303.00807, 2023
42023
Cheaply evaluating inference efficiency metrics for autoregressive transformer APIs
D Narayanan, K Santhanam, P Henderson, R Bommasani, T Lee, P Liang
arXiv preprint arXiv:2305.02440, 2023
32023
Distir: An intermediate representation and simulator for efficient neural network distribution
K Santhanam, S Krishna, R Tomioka, T Harris, M Zaharia
arXiv preprint arXiv:2111.05426, 2021
22021
DSPy: Compiling Declarative Language Model Calls into State-of-the-Art Pipelines
O Khattab, A Singhvi, P Maheshwari, Z Zhang, K Santhanam, S Haq, ...
The Twelfth International Conference on Learning Representations, 2023
12023
Het systeem kan de bewerking nu niet uitvoeren. Probeer het later opnieuw.
Artikelen 1–20