Should You Mask 15% in Masked Language Modeling? A Wettig, T Gao, Z Zhong, D Chen Conference of the European Chapter of the Association for Computational …, 2023 | 138 | 2023 |
SWE-bench: Can language models resolve real-world github issues? CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan International Conference on Learning Representations (ICLR), 2024 | 134 | 2024 |
Adapting language models to compress contexts A Chevalier, A Wettig, A Ajith, D Chen Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023 | 86 | 2023 |
A Kernel-Based View of Language Model Fine-Tuning S Malladi, A Wettig, D Yu, D Chen, S Arora International Conference on Machine Learning (ICML), 2023 | 46 | 2023 |
SWE-agent: Agent-Computer Interfaces Enable Automated Software Engineering J Yang, CE Jimenez, A Wettig, K Lieret, S Yao, K Narasimhan, O Press | 45* | 2024 |
Phrase Retrieval Learns Passage Retrieval, Too J Lee, A Wettig, D Chen Conference on Empirical Methods in Natural Language Processing (EMNLP), 2021 | 43 | 2021 |
Poisoning retrieval corpora by injecting adversarial passages Z Zhong, Z Huang, A Wettig, D Chen Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023 | 26 | 2023 |
Learning Transformer Programs D Friedman, A Wettig, D Chen NeurIPS, 2023 | 23 | 2023 |
QuRating: Selecting High-Quality Data for Training Language Models A Wettig, A Gupta, S Malik, D Chen International Conference on Machine Learning (ICML), 2024 | 10 | 2024 |
Finding Dataset Shortcuts with Grammar Induction D Friedman, A Wettig, D Chen Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022 | 10 | 2022 |
Language Models as Science Tutors A Chevalier, J Geng, A Wettig, H Chen, S Mizera, T Annala, MJ Aragon, ... International Conference on Machine Learning (ICML), 2024 | 6 | 2024 |
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?(2023) CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan arXiv preprint cs.CL/2310.06770, 2023 | 5 | 2023 |
Finding Transformer Circuits with Edge Pruning A Bhaskar, A Wettig, D Friedman, D Chen arXiv preprint arXiv:2406.16778, 2024 | 1 | 2024 |
OLMoE: Open Mixture-of-Experts Language Models N Muennighoff, L Soldaini, D Groeneveld, K Lo, J Morrison, S Min, W Shi, ... arXiv preprint arXiv:2409.02060, 2024 | | 2024 |