Multi-step reinforcement learning: A unifying algorithm K De Asis, JF Hernandez-Garcia, GZ Holland, RS Sutton Thirty-Second AAAI Conference on Artificial Intelligence, 2018 | 128 | 2018 |
The Effect of Planning Shape on Dyna-style Planning in High-dimensional State Spaces GZ Holland, E Talvitie, M Bowling arXiv preprint arXiv:1806.01825, 2018 | 50 | 2018 |
Player of games M Schmid, M Moravcik, N Burch, R Kadlec, J Davidson, K Waugh, N Bard, ... arXiv preprint arXiv:2112.03178, 2021 | 48 | 2021 |
Reward-respecting subtasks for model-based reinforcement learning RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Artificial Intelligence 324, 104001, 2023 | 19 | 2023 |
Student of Games: A unified learning algorithm for both perfect and imperfect information games M Schmid, M Moravčík, N Burch, R Kadlec, J Davidson, K Waugh, N Bard, ... Science Advances 9 (46), eadg3256, 2023 | 4 | 2023 |