Follow
Marzieh Fadaee
Marzieh Fadaee
Senior Research Scientist, Cohere For AI
Verified email at cohere.com - Homepage
Title
Cited by
Cited by
Year
Data Augmentation for Low-Resource Neural Machine Translation
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
5382017
Back-translation sampling by targeting difficult words in neural machine translation
M Fadaee, C Monz
arXiv preprint arXiv:1808.09006, 2018
822018
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
L Henrique Bonifacio, V Jeronymo, H Queiroz Abonizio, I Campiotti, ...
arXiv preprint arXiv:2108.13897, 2021
602021
Inpars: Unsupervised dataset generation for information retrieval
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
592022
Inpars: Data augmentation for information retrieval using large language models
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
arXiv preprint arXiv:2202.05144, 2022
502022
Inpars-v2: Large language models as efficient dataset generators for information retrieval
V Jeronymo, L Bonifacio, H Abonizio, M Fadaee, R Lotufo, J Zavrel, ...
arXiv preprint arXiv:2301.01820, 2023
462023
Examining the tip of the iceberg: A data set for idiom translation
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1802.04681, 2018
332018
When less is more: Investigating data pruning for pretraining llms at scale
M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker
arXiv preprint arXiv:2309.04564, 2023
242023
Learning Topic-Sensitive Word Representations
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
202017
No parameter left behind: How distillation and model size affect zero-shot retrieval
GM Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2206.02873, 2022
192022
The unreasonable volatility of neural machine translation models
M Fadaee, C Monz
arXiv preprint arXiv:2005.12398, 2020
152020
In defense of cross-encoders for zero-shot retrieval
G Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2212.06121, 2022
142022
Aya model: An instruction finetuned open-access multilingual language model
A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ...
arXiv preprint arXiv:2402.07827, 2024
82024
Aya dataset: An open-access collection for multilingual instruction tuning
S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ...
arXiv preprint arXiv:2402.06619, 2024
82024
Data augmentation for low-resource neural machine translation. arXiv 2017
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1705.00440, 0
8
Automatic WordNet Construction Using Markov Chain Monte Carlo
M Fadaee, H Ghader, H Faili, A Shakery
Polibits, 13-22, 2013
72013
A New Neural Search and Insights Platform for Navigating and Organizing AI Research
M Fadaee, O Gureenkova, F Rejon-Barrera, C Schnober, W Weerkamp, ...
arXiv preprint arXiv:2011.00061, 2020
62020
InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval. abs/2301.01820 (2023)
V Jeronymo, LH Bonifacio, H Abonizio, M Fadaee, R de Alencar Lotufo, ...
arXiv preprint arXiv:2301.01820, 2023
42023
Examining the tip of the iceberg: A data set for idiom translation
F Marzieh, B Arianna, M Christof
arXiv preprint arXiv:1802.04681, 2018
42018
Elo uncovered: Robustness and best practices in language model evaluation
M Boubdir, E Kim, B Ermis, S Hooker, M Fadaee
arXiv preprint arXiv:2311.17295, 2023
22023
The system can't perform the operation now. Try again later.
Articles 1–20