Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 743 | 2022 |
Cross-attention is all you need: Adapting pretrained transformers for machine translation M Gheini, X Ren, J May EMNLP 2021, 2021 | 59* | 2021 |
ParsiNLU: A Suite of Language Understanding Challenges for Persian D Khashabi, A Cohan, S Shakeri, P Hosseini, P Pezeshkpour, M Alikhani, ... Transactions of the Association for Computational Linguistics 9, 1147-1162, 2021 | 29 | 2021 |
A universal parent model for low-resource neural machine translation transfer M Gheini, J May arXiv preprint arXiv:1909.06516, 2019 | 19 | 2019 |
Unsupervised Product Entity Resolution using Graph Representation Learning. M Gheini, M Kejriwal eCOM@ SIGIR, 2019 | 10 | 2019 |
Joint speech transcription and translation: Pseudo-labeling with out-of-distribution data M Gheini, T Likhomanenko, M Sperber, H Setiawan ACL 2023 Findings, 2022 | 4 | 2022 |
Checks and strategies for enabling code-switched machine translation T Gowda, M Gheini, J May arXiv preprint arXiv:2210.05096, 2022 | 2 | 2022 |
Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning M Gheini, X Ma, J May ACL 2023 Findings, 2022 | | 2022 |