Bloom: A 176b-parameter open-access multilingual language model BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ... arXiv preprint arXiv:2211.05100, 2022 | 721 | 2022 |
Pythia: A suite for analyzing large language models across training and scaling S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ... International Conference on Machine Learning, 2397-2430, 2023 | 161 | 2023 |
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Challenges & Perspectives in Creating Large Language Models, 26, 2022 | 37 | 2022 |
Inseq: An interpretability toolkit for sequence generation models G Sarti, N Feldhus, L Sickert, O van der Wal arXiv preprint arXiv:2302.13942, 2023 | 15 | 2023 |
The grammar of emergent languages O van der Wal, S de Boer, E Bruni, D Hupkes Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 8 | 2020 |
Undesirable biases in NLP: Averting a crisis of measurement O Van der Wal, D Bachmann, A Leidinger, L van Maanen, W Zuidema, ... arXiv preprint arXiv:2211.13709, 2022 | 7 | 2022 |
The Birth of Bias: A case study on the evolution of gender bias in an English language model O van der Wal, J Jumelet, K Schulz, W Zuidema arXiv preprint arXiv:2207.10245, 2022 | 7 | 2022 |
Identifying and Adapting Transformer-Components Responsible for Gender Bias in an English Language Model A Chintam, R Beloch, W Zuidema, M Hanna, O van der Wal arXiv preprint arXiv:2310.12611, 2023 | | 2023 |
ChapGTP, ILLC's Attempt at Raising a BabyLM: Improving Data Efficiency by Automatic Task Formation J Jumelet, M Hanna, MH Kloots, A Langedijk, C Pouw, O van der Wal arXiv preprint arXiv:2310.11282, 2023 | | 2023 |
Jelle Zuidema's webplek O van der Wal, J Jumelet, K Schulz | | |
Category: Uncategorized O van der Wal, J Jumelet, K Schulz, W Zuidema | | |