Marzieh Fadaee

Cited by

	All	Since 2019
Citations	1042	991
h-index	12	12
i10-index	12	12

320

160

240

2016201720182019202020212022202320243 10 35 73 127 159 185 303 142

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Christof MonzUniversity of AmsterdamVerified email at uva.nl
Rodrigo NogueiraFounder and CEO of Maritaca AIVerified email at unicamp.br
Arianna BisazzaAssociate Professor, University of GroningenVerified email at rug.nl
Sara HookerHead of Cohere For AIVerified email at cohere.com
Ahmet UstunCohere For AIVerified email at cohere.com
Jakub ZavrelZeta AlphaVerified email at zeta-alpha.com
Alex WangPhD student at New York UniversityVerified email at nyu.edu
Luiza PozzobonResearch Scholar @ Cohere For AIVerified email at g.unicamp.br
Julia KreutzerResearch Scientist at Cohere For AIVerified email at cohere.com
Beyza ErmişCohere for AIVerified email at cohere.com
Edward KimCohere AIVerified email at cohere.com
Niklas MuennighoffPeking UniversityVerified email at stu.pku.edu.cn
Phil BlunsomCohere & Oxford UniversityVerified email at cs.ox.ac.uk
Amr KayidTUMVerified email at tum.de
Zheng-Xin YongBrown UniversityVerified email at brown.edu
Shayne LongpreMIT, Stanford, AppleVerified email at cs.stanford.edu
Gbemileke OniludeGraduate Research Assistant, Carnegie Mellon UniversityVerified email at andrew.cmu.edu
Hamidreza GhaderLead NLP Scientist at Contexta360Verified email at contexta360.com
Fernando Rejon BarreraVerified email at zeta-alpha.com
Matthias Gallécohere.aiVerified email at cohere.com

Marzieh Fadaee

Senior Research Scientist, Cohere For AI

Verified email at cohere.com - Homepage

Computational Linguistics Natural language processing Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Data Augmentation for Low-Resource Neural Machine Translation M Fadaee, A Bisazza, C Monz Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017	543	2017
Back-translation sampling by targeting difficult words in neural machine translation M Fadaee, C Monz arXiv preprint arXiv:1808.09006, 2018	82	2018
Inpars: Unsupervised dataset generation for information retrieval L Bonifacio, H Abonizio, M Fadaee, R Nogueira Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022	62	2022
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset L Henrique Bonifacio, V Jeronymo, H Queiroz Abonizio, I Campiotti, ... arXiv preprint arXiv:2108.13897, 2021	61	2021
Inpars: Data augmentation for information retrieval using large language models L Bonifacio, H Abonizio, M Fadaee, R Nogueira arXiv preprint arXiv:2202.05144, 2022	58	2022
Inpars-v2: Large language models as efficient dataset generators for information retrieval V Jeronymo, L Bonifacio, H Abonizio, M Fadaee, R Lotufo, J Zavrel, ... arXiv preprint arXiv:2301.01820, 2023	51	2023
Examining the tip of the iceberg: A data set for idiom translation M Fadaee, A Bisazza, C Monz arXiv preprint arXiv:1802.04681, 2018	35	2018
When less is more: Investigating data pruning for pretraining llms at scale M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker arXiv preprint arXiv:2309.04564, 2023	25	2023
No parameter left behind: How distillation and model size affect zero-shot retrieval GM Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ... arXiv preprint arXiv:2206.02873, 2022	21	2022
Learning Topic-Sensitive Word Representations M Fadaee, A Bisazza, C Monz Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017	20	2017
In defense of cross-encoders for zero-shot retrieval G Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ... arXiv preprint arXiv:2212.06121, 2022	15	2022
The unreasonable volatility of neural machine translation models M Fadaee, C Monz arXiv preprint arXiv:2005.12398, 2020	15	2020
Aya dataset: An open-access collection for multilingual instruction tuning S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ... arXiv preprint arXiv:2402.06619, 2024	9	2024
Aya model: An instruction finetuned open-access multilingual language model A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ... arXiv preprint arXiv:2402.07827, 2024	8	2024
Data augmentation for low-resource neural machine translation. arXiv 2017 M Fadaee, A Bisazza, C Monz arXiv preprint arXiv:1705.00440, 0	8
Automatic WordNet Construction Using Markov Chain Monte Carlo M Fadaee, H Ghader, H Faili, A Shakery Polibits, 13-22, 2013	7	2013
A New Neural Search and Insights Platform for Navigating and Organizing AI Research M Fadaee, O Gureenkova, F Rejon-Barrera, C Schnober, W Weerkamp, ... arXiv preprint arXiv:2011.00061, 2020	6	2020
Elo uncovered: Robustness and best practices in language model evaluation M Boubdir, E Kim, B Ermis, S Hooker, M Fadaee arXiv preprint arXiv:2311.17295, 2023	4	2023
InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval. abs/2301.01820 (2023) V Jeronymo, LH Bonifacio, H Abonizio, M Fadaee, R de Alencar Lotufo, ... arXiv preprint arXiv:2301.01820, 2023	4	2023
Examining the tip of the iceberg: A data set for idiom translation F Marzieh, B Arianna, M Christof arXiv preprint arXiv:1802.04681, 2018	4	2018

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors