Shimon Whiteson
Shimon Whiteson
Professor of Computer Science, University of Oxford / Head of Research, Waymo UK
E-mail confirmado em cs.ox.ac.uk - Página inicial
Título
Citado por
Citado por
Ano
Learning to communicate with deep multi-agent reinforcement learning
J Foerster, IA Assael, N De Freitas, S Whiteson
Advances in neural information processing systems, 2137-2145, 2016
6412016
Counterfactual multi-agent policy gradients
J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
arXiv preprint arXiv:1705.08926, 2017
5142017
A survey of multi-objective sequential decision-making
DM Roijers, P Vamplew, S Whiteson, R Dazeley
Journal of Artificial Intelligence Research 48, 67-113, 2014
3362014
Evolutionary Function Approximation for Reinforcement Learning
S Whiteson, P Stone
Journal of Machine Learning Research 7, 877-917, 2006
3122006
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
arXiv preprint arXiv:1702.08887, 2017
2942017
QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, CS De Witt, G Farquhar, J Foerster, S Whiteson
arXiv preprint arXiv:1803.11485, 2018
2552018
Learning with opponent-learning awareness
JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch
arXiv preprint arXiv:1709.04326, 2017
2082017
Multiagent reinforcement learning for urban traffic control using coordination graphs
L Kuyer, S Whiteson, B Bakker, N Vlassis
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2008
1772008
Lipnet: End-to-end sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599, 2016
1482016
A theoretical and empirical analysis of Expected Sarsa
H Van Seijen, H Van Hasselt, S Whiteson, M Wiering
2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009
1332009
Transfer via inter-task mappings in policy search reinforcement learning
ME Taylor, S Whiteson, P Stone
Proceedings of the 6th international joint conference on Autonomous agents …, 2007
1262007
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval
K Hofmann, S Whiteson, M de Rijke
Information Retrieval 16 (1), 63-90, 2013
1232013
Automatic feature selection in neuroevolution
S Whiteson, P Stone, KO Stanley, R Miikkulainen, N Kohl
Proceedings of the 7th annual conference on Genetic and evolutionary …, 2005
1232005
Exploiting locality of interaction in factored Dec-POMDPs
FA Oliehoek, MTJ Spaan, N Vlassis, S Whiteson
Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems, 517-524, 2008
1222008
Comparing evolutionary and temporal difference methods in a reinforcement learning domain
ME Taylor, S Whiteson, P Stone
Proceedings of the 8th annual conference on Genetic and evolutionary …, 2006
1182006
Evolving soccer keepaway players through task decomposition
S Whiteson, N Kohl, R Miikkulainen, P Stone
Machine Learning 59 (1-2), 5-30, 2005
1162005
A probabilistic method for inferring preferences from clicks
K Hofmann, S Whiteson, M De Rijke
Proceedings of the 20th ACM international conference on Information and …, 2011
1122011
Lipnet: Sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N de Freitas
arXiv preprint arXiv:1611.01599 2 (8), 2016
1082016
Adaptive tile coding for value function approximation
S Whiteson
1062007
Learning to communicate to solve riddles with deep distributed recurrent q-networks
JN Foerster, YM Assael, N de Freitas, S Whiteson
arXiv preprint arXiv:1602.02672, 2016
932016
O sistema não pode executar a operação agora. Tente novamente mais tarde.
Artigos 1–20