Empirical evaluation methods for multiobjective reinforcement learning algorithms P Vamplew, R Dazeley, A Berry, R Issabekov, E Dekker Machine learning 84, 51-80, 2011 | 370 | 2011 |
Steering approaches to Pareto-optimal multiobjective reinforcement learning P Vamplew, R Issabekov, R Dazeley, C Foale, A Berry, T Moore, ... Neurocomputing 263, 26-38, 2017 | 36 | 2017 |
An empirical comparison of two common multiobjective reinforcement learning algorithms R Issabekov, P Vamplew AI 2012: Advances in Artificial Intelligence: 25th Australasian Joint …, 2012 | 25 | 2012 |
MORL-Glue: A benchmark suite for multi-objective reinforcement learning P Vamplew, D Webb, LM Zintgraf, DM Roijers, R Dazeley, R Issabekov, ... 29th Benelux Conference on Artificial Intelligence November 8–9, 2017 …, 2017 | 10 | 2017 |
Reinforcement learning of Pareto-optimal multiobjective policies using steering P Vamplew, R Issabekov, R Dazeley, C Foale AI 2015: Advances in Artificial Intelligence: 28th Australasian Joint …, 2015 | 9 | 2015 |