Seguir
Siamak Shakeri
Siamak Shakeri
E-mail confirmado em google.com
Título
Citado por
Citado por
Ano
Palm 2 technical report
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 2023
7812023
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
7322022
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
4572023
Ul2: Unifying language learning paradigms
Y Tay, M Dehghani, VQ Tran, X Garcia, J Wei, X Wang, HW Chung, ...
arXiv preprint arXiv:2205.05131, 2022
2702022
Knowledge graph based synthetic corpus generation for knowledge-enhanced language model pre-training
O Agarwal, H Ge, S Shakeri, R Al-Rfou
arXiv preprint arXiv:2010.12688, 2020
1842020
End-to-end synthetic data generation for domain adaptation of question answering systems
S Shakeri, C dos Santos, H Zhu, P Ng, F Nan, Z Wang, R Nallapati, ...
Proceedings of the 2020 Conference on Empirical Methods in Natural Language …, 2020
902020
Pali-x: On scaling up a multilingual vision and language model
X Chen, J Djolonga, P Padlewski, B Mustafa, S Changpinyo, J Wu, ...
arXiv preprint arXiv:2305.18565, 2023
832023
Sunipa Dev
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
Jacob Devlin, Mark Díaz, Nan Du, Ethan Dyer, Vladimir Feinberg, Fangxiaoyu …, 2023
512023
Transcending scaling laws with 0.1% extra compute
Y Tay, J Wei, HW Chung, VQ Tran, DR So, S Shakeri, X Garcia, HS Zheng, ...
arXiv preprint arXiv:2210.11399, 2022
492022
Embedding-based zero-shot retrieval through query generation
D Liang, P Xu, S Shakeri, CN Santos, R Nallapati, Z Huang, B Xiang
arXiv preprint arXiv:2009.10270, 2020
372020
Machine translation aided bilingual data-to-text generation and semantic parsing
O Agarwal, M Kale, H Ge, S Shakeri, R Al-Rfou
Proceedings of the 3rd international workshop on natural language generation …, 2020
322020
Gemma: Open models based on gemini research and technology
G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ...
arXiv preprint arXiv:2403.08295, 2024
292024
ParsiNLU: A Suite of Language Understanding Challenges for Persian
D Khashabi, A Cohan, S Shakeri, P Hosseini, P Pezeshkpour, M Alikhani, ...
Transactions of the Association for Computational Linguistics 9, 1147-1162, 2021
252021
Enct5: Fine-tuning t5 encoder for non-autoregressive tasks
F Liu, S Shakeri, H Yu, J Li
arXiv preprint arXiv:2110.08426 2, 2021
162021
Towards zero-shot multilingual synthetic question and answer generation for cross-lingual reading comprehension
S Shakeri, N Constant, MS Kale, L Xue
arXiv preprint arXiv:2010.12008, 2020
162020
Characterizing attribution and fluency tradeoffs for retrieval-augmented large language models
R Aksitov, CC Chang, D Reitter, S Shakeri, Y Sung
arXiv preprint arXiv:2302.05578, 2023
152023
Palm 2 technical report. arXiv 2023
R Anil, AM Dai, O Firat, M Johnson, D Lepikhin, A Passos, S Shakeri, ...
arXiv preprint arXiv:2305.10403, 0
13
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models
SU Toshniwal, S Debnath, S Shakeri, S Thormeyer, S Melzi, S Reddy, ...
ArXiv, abs/2206.04615, 2022
112022
Reducing retraining by recycling parameter-efficient prompts
B Lester, J Yurtsever, S Shakeri, N Constant
arXiv preprint arXiv:2208.05577, 2022
102022
Knowledge distillation in document retrieval
S Shakeri, A Sethy, C Cheng
arXiv preprint arXiv:1911.11065, 2019
102019
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–20