Thomas Hubert
Thomas Hubert
Google Deepmind
E-mail confirmado em google.com
Título
Citado por
Citado por
Ano
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
nature 550 (7676), 354-359, 2017
66872017
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
Science 362 (6419), 1140-1144, 2018
19922018
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
12242017
Mastering atari, go, chess and shogi by planning with a learned model
J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ...
Nature 588 (7839), 604-609, 2020
6392020
Monte-carlo tree search as regularized policy optimization
JB Grill, F Altché, Y Tang, T Hubert, M Valko, I Antonoglou, R Munos
International Conference on Machine Learning, 3769-3778, 2020
272020
Online and offline reinforcement learning by planning with a learned model
J Schrittwieser, T Hubert, A Mandhane, M Barekatain, I Antonoglou, ...
arXiv preprint arXiv:2104.06294, 2021
122021
Learning and Planning in Complex Action Spaces
T Hubert, J Schrittwieser, I Antonoglou, M Barekatain, S Schmitt, D Silver
arXiv preprint arXiv:2104.06303, 2021
92021
Approximate exploitability: Learning a best response in large games
F Timbers, E Lockhart, M Lanctot, M Schmid, J Schrittwieser, T Hubert, ...
arXiv preprint arXiv:2004.09677, 2020
62020
Lai
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
M., Bolton, A., Chen, Y., Lillicrap, T., Hui, F., Sifre, L., van den, 0
1
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–9