Charlie Beattie
Charlie Beattie
Software Engineer, DeepMind
Dirección de correo verificada de google.com
Título
Citado por
Citado por
Año
Human-level control through deep reinforcement learning
V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness, MG Bellemare, ...
nature 518 (7540), 529-533, 2015
122822015
Massively parallel methods for deep reinforcement learning
A Nair, P Srinivasan, S Blackwell, C Alcicek, R Fearon, A De Maria, ...
arXiv preprint arXiv:1507.04296, 2015
3012015
Human-level performance in 3D multiplayer games with population-based reinforcement learning
M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ...
Science 364 (6443), 859-865, 2019
2792019
Vector-based navigation using grid-like representations in artificial agents
A Banino, C Barry, B Uria, C Blundell, T Lillicrap, P Mirowski, A Pritzel, ...
Nature 557 (7705), 429-433, 2018
2772018
Deepmind lab
C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ...
arXiv preprint arXiv:1612.03801, 2016
2612016
A multi-agent reinforcement learning model of common-pool resource appropriation
J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel
Advances in Neural Information Processing Systems, 3643-3652, 2017
812017
Psychlab: a psychology laboratory for deep reinforcement learning agents
JZ Leibo, CM d'Autume, D Zoran, D Amos, C Beattie, K Anderson, ...
arXiv preprint arXiv:1801.08116, 2018
302018
Uncovering surprising behaviors in reinforcement learning via worst-case analysis
A Ruderman, R Everett, B Sikder, H Soyer, J Uesato, A Kumar, C Beattie, ...
52018
Bellemare Marc G, Alex Graves, Martin Riedmiller, Andreas K
V Mnih, K Kavukcuoglu, D Silver, AA Rusu, J Veness
Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik …, 2015
52015
DeepMind Lab2D
C Beattie, T Köppe, EA Duéñez-Guzmán, JZ Leibo
arXiv preprint arXiv:2011.07027, 2020
2020
Vector-based Navigation using Grid-like Representations in Artificial Agents.
A Pritzel, A Banino, B Uria, BC Zhang, C Barry, C Blundell, C Beattie, ...
2018
代写 RC algorithm Scheme game math scala parallel AI statistic software network Bayesian GPU Go react theory Humanlevel performance in firstperson multiplayer games with …
M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ...
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–12