Seguir
Shimon Whiteson
Shimon Whiteson
Professor of Computer Science, University of Oxford / Head of Research, Waymo UK
Dirección de correo verificada de cs.ox.ac.uk - Página principal
Título
Citado por
Citado por
Año
Counterfactual multi-agent policy gradients
J Foerster, G Farquhar, T Afouras, N Nardelli, S Whiteson
Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018
13252018
Learning to communicate with deep multi-agent reinforcement learning
J Foerster, IA Assael, N De Freitas, S Whiteson
Advances in neural information processing systems 29, 2016
12672016
Qmix: Monotonic value function factorisation for deep multi-agent reinforcement learning
T Rashid, M Samvelyan, C Schroeder, G Farquhar, J Foerster, S Whiteson
International conference on machine learning, 4295-4304, 2018
11202018
Stabilising experience replay for deep multi-agent reinforcement learning
J Foerster, N Nardelli, G Farquhar, T Afouras, PHS Torr, P Kohli, ...
International conference on machine learning, 1146-1155, 2017
5542017
A survey of multi-objective sequential decision-making
DM Roijers, P Vamplew, S Whiteson, R Dazeley
Journal of Artificial Intelligence Research 48, 67-113, 2014
5362014
The starcraft multi-agent challenge
M Samvelyan, T Rashid, CS De Witt, G Farquhar, N Nardelli, TGJ Rudner, ...
arXiv preprint arXiv:1902.04043, 2019
4342019
Learning with opponent-learning awareness
JN Foerster, RY Chen, M Al-Shedivat, S Whiteson, P Abbeel, I Mordatch
arXiv preprint arXiv:1709.04326, 2017
4112017
Evolutionary Function Approximation for Reinforcement Learning
S Whiteson, P Stone
Journal of Machine Learning Research 7, 877-917, 2006
3472006
Lipnet: End-to-end sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599, 2016
2992016
Fast context adaptation via meta-learning
L Zintgraf, K Shiarli, V Kurin, K Hofmann, S Whiteson
International Conference on Machine Learning, 7693-7702, 2019
2622019
Multiagent reinforcement learning for urban traffic control using coordination graphs
L Kuyer, S Whiteson, B Bakker, N Vlassis
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2008
2542008
Deep variational reinforcement learning for POMDPs
M Igl, L Zintgraf, TA Le, F Wood, S Whiteson
International Conference on Machine Learning, 2117-2126, 2018
2132018
A theoretical and empirical analysis of Expected Sarsa
H Van Seijen, H Van Hasselt, S Whiteson, M Wiering
2009 ieee symposium on adaptive dynamic programming and reinforcement …, 2009
2082009
Maven: Multi-agent variational exploration
A Mahajan, T Rashid, M Samvelyan, S Whiteson
Advances in Neural Information Processing Systems 32, 2019
2052019
Lipnet: Sentence-level lipreading
YM Assael, B Shillingford, S Whiteson, N De Freitas
arXiv preprint arXiv:1611.01599 2 (8), 2016
1702016
A survey of reinforcement learning informed by natural language
J Luketina, N Nardelli, G Farquhar, J Foerster, J Andreas, E Grefenstette, ...
arXiv preprint arXiv:1906.03926, 2019
1682019
Balancing exploration and exploitation in listwise and pairwise online learning to rank for information retrieval
K Hofmann, S Whiteson, M de Rijke
Information Retrieval 16 (1), 63-90, 2013
1472013
Exploiting locality of interaction in factored Dec-POMDPs
FA Oliehoek, MTJ Spaan, N Vlassis, S Whiteson
Int. Joint Conf. on Autonomous Agents and Multi-Agent Systems, 517-524, 2008
1462008
Learning to communicate to solve riddles with deep distributed recurrent q-networks
JN Foerster, YM Assael, N de Freitas, S Whiteson
arXiv preprint arXiv:1602.02672, 2016
1422016
Transfer via inter-task mappings in policy search reinforcement learning
ME Taylor, S Whiteson, P Stone
Proceedings of the 6th international joint conference on Autonomous agents …, 2007
1372007
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20