Shimon Whiteson
Shimon Whiteson
Professor of Computer Science, University of Oxford / Head of Research, Waymo UK
Dirección de correo verificada de cs.ox.ac.uk - Página principal
Título
Citado por
Citado por
Año
Learning to communicate with deep multi-agent reinforcement learning
JN Foerster, YM Assael, N De Freitas, S Whiteson
arXiv preprint arXiv:1605.06676, 2016
8852016
Counterfactual multi-agent policy gradients
J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
7942018
Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, C Schroeder, G Farquhar, J Foerster, S Whiteson
International Conference on Machine Learning, 4295-4304, 2018
5092018
A survey of multi-objective sequential decision-making
DM Roijers, P Vamplew, S Whiteson, R Dazeley
Journal of Artificial Intelligence Research 48, 67-113, 2014
4202014
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
International conference on machine learning, 1146-1155, 2017
4032017
Evolutionary Function Approximation for Reinforcement Learning
S Whiteson, P Stone
Journal of Machine Learning Research 7, 877-917, 2006
3262006
Learning with opponent-learning awareness
JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch
arXiv preprint arXiv:1709.04326, 2017
3072017
Lipnet: End-to-end sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599, 2016
2082016
Multiagent reinforcement learning for urban traffic control using coordination graphs
L Kuyer, S Whiteson, B Bakker, N Vlassis
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2008
2032008
The starcraft multi-agent challenge
M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ...
arXiv preprint arXiv:1902.04043, 2019
1732019
A theoretical and empirical analysis of Expected Sarsa
H Van Seijen, H Van Hasselt, S Whiteson, M Wiering
2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009
1722009
Fast context adaptation via meta-learning
L Zintgraf, K Shiarli, V Kurin, K Hofmann, S Whiteson
International Conference on Machine Learning, 7693-7702, 2019
1542019
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval
K Hofmann, S Whiteson, M de Rijke
Information Retrieval 16 (1), 63-90, 2013
1372013
Exploiting locality of interaction in factored Dec-POMDPs
FA Oliehoek, MTJ Spaan, N Vlassis, S Whiteson
Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems, 517-524, 2008
1342008
Lipnet: Sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599 2 (4), 2016
1332016
Transfer via inter-task mappings in policy search reinforcement learning
ME Taylor, S Whiteson, P Stone
Proceedings of the 6th international joint conference on Autonomous agents …, 2007
1312007
Deep variational reinforcement learning for POMDPs
M Igl, L Zintgraf, TA Le, F Wood, S Whiteson
International Conference on Machine Learning, 2117-2126, 2018
1292018
Automatic feature selection in neuroevolution
S Whiteson, P Stone, KO Stanley, R Miikkulainen, N Kohl
Proceedings of the 7th annual conference on Genetic and evolutionary …, 2005
1272005
Evolving soccer keepaway players through task decomposition
S Whiteson, N Kohl, R Miikkulainen, P Stone
Machine Learning 59 (1-2), 5-30, 2005
1252005
A probabilistic method for inferring preferences from clicks
K Hofmann, S Whiteson, M De Rijke
Proceedings of the 20th ACM international conference on Information and …, 2011
1232011
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20