Bloom: A 176b-parameter open-access multilingual language model TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... arXiv preprint arXiv:2211.05100, 2022 | 487 | 2022 |
Pythia: A suite for analyzing large language models across training and scaling S Biderman, H Schoelkopf, QG Anthony, H Bradley, K O’Brien, E Hallahan, ... International Conference on Machine Learning, 2397-2430, 2023 | 90 | 2023 |
You reap what you sow: On the challenges of bias evaluation under multilingual settings Z Talat, A Névéol, S Biderman, M Clinciu, M Dey, S Longpre, S Luccioni, ... Challenges & Perspectives in Creating Large Language Models, 26, 2022 | 27 | 2022 |
Inseq: An interpretability toolkit for sequence generation models G Sarti, N Feldhus, L Sickert, O van der Wal arXiv preprint arXiv:2302.13942, 2023 | 8 | 2023 |
The grammar of emergent languages O van der Wal, S de Boer, E Bruni, D Hupkes Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020 | 7 | 2020 |
The Birth of Bias: A case study on the evolution of gender bias in an English language model O van der Wal, J Jumelet, K Schulz, W Zuidema arXiv preprint arXiv:2207.10245, 2022 | 5 | 2022 |
Undesirable biases in NLP: Averting a crisis of measurement O van der Wal, D Bachmann, A Leidinger, L van Maanen, W Zuidema, ... arXiv preprint arXiv:2211.13709, 2022 | 3 | 2022 |
Jelle Zuidema's webplek O van der Wal, J Jumelet, K Schulz | | |
Category: Uncategorized O van der Wal, J Jumelet, K Schulz, W Zuidema | | |