Follow
Marzieh Fadaee
Marzieh Fadaee
Senior Research Scientist, Cohere For AI
Verified email at cohere.com - Homepage
Title
Cited by
Cited by
Year
Data Augmentation for Low-Resource Neural Machine Translation
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
5362017
Back-translation sampling by targeting difficult words in neural machine translation
M Fadaee, C Monz
arXiv preprint arXiv:1808.09006, 2018
802018
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
L Henrique Bonifacio, V Jeronymo, H Queiroz Abonizio, I Campiotti, ...
arXiv preprint arXiv:2108.13897, 2021
552021
Inpars: Unsupervised dataset generation for information retrieval
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
502022
Inpars: Data augmentation for information retrieval using large language models
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
arXiv preprint arXiv:2202.05144, 2022
422022
InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval
V Jeronymo, L Bonifacio, H Abonizio, M Fadaee, R Lotufo, J Zavrel, ...
arXiv preprint arXiv:2301.01820, 2023
412023
Examining the tip of the iceberg: A data set for idiom translation
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1802.04681, 2018
332018
Learning Topic-Sensitive Word Representations
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
202017
When less is more: Investigating data pruning for pretraining llms at scale
M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker
arXiv preprint arXiv:2309.04564, 2023
152023
The unreasonable volatility of neural machine translation models
M Fadaee, C Monz
arXiv preprint arXiv:2005.12398, 2020
152020
No parameter left behind: How distillation and model size affect zero-shot retrieval
GM Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2206.02873, 2022
142022
In defense of cross-encoders for zero-shot retrieval
G Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2212.06121, 2022
122022
Data augmentation for low-resource neural machine translation. arXiv 2017
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1705.00440, 0
8
Automatic WordNet Construction Using Markov Chain Monte Carlo
M Fadaee, H Ghader, H Faili, A Shakery
Polibits, 13-22, 2013
72013
A New Neural Search and Insights Platform for Navigating and Organizing AI Research
M Fadaee, O Gureenkova, F Rejon-Barrera, C Schnober, W Weerkamp, ...
arXiv preprint arXiv:2011.00061, 2020
42020
Examining the tip of the iceberg: A data set for idiom translation
F Marzieh, B Arianna, M Christof
arXiv preprint arXiv:1802.04681, 2018
42018
InPars-v2: Large Language Models as Efficient Dataset Generators for Information Retrieval. CoRR abs/2301.01820 (2023)
V Jeronymo, LH Bonifacio, H Abonizio, M Fadaee, R de Alencar Lotufo, ...
32023
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ...
arXiv preprint arXiv:2402.06619, 2024
12024
Elo uncovered: Robustness and best practices in language model evaluation
M Boubdir, E Kim, B Ermis, S Hooker, M Fadaee
arXiv preprint arXiv:2311.17295, 2023
12023
Understanding and enhancing the use of context for machine translation
M Fadaee
12020
The system can't perform the operation now. Try again later.
Articles 1–20