Arthur Guez
Arthur Guez
Google DeepMind
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484-489, 2016
88002016
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
nature 550 (7676), 354-359, 2017
44422017
Deep reinforcement learning with double q-learning
H Van Hasselt, A Guez, D Silver
arXiv preprint arXiv:1509.06461, 2015
26882015
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
Science 362 (6419), 1140-1144, 2018
9792018
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
8142017
Imagination-augmented agents for deep reinforcement learning
S Racanière, T Weber, D Reichert, L Buesing, A Guez, ...
Advances in neural information processing systems 30, 5690-5701, 2017
1822017
Imagination-augmented agents for deep reinforcement learning
T Weber, S Racanière, DP Reichert, L Buesing, A Guez, DJ Rezende, ...
arXiv preprint arXiv:1707.06203, 2017
1752017
The predictron: End-to-end learning and planning
D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
International Conference on Machine Learning, 3191-3199, 2017
1742017
Mastering atari, go, chess and shogi by planning with a learned model
J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ...
arXiv preprint arXiv:1911.08265, 2019
1352019
Efficient Bayes-adaptive reinforcement learning using sample-based search
A Guez, D Silver, P Dayan
Advances in neural information processing systems, 1025-1033, 2012
1282012
Learning values across many orders of magnitude
HP van Hasselt, A Guez, M Hessel, V Mnih, D Silver
Advances In Neural Information Processing Systems, 4287-4295, 2016
952016
Increasing the action gap: New operators for reinforcement learning
MG Bellemare, G Ostrovski, A Guez, PS Thomas, R Munos
arXiv preprint arXiv:1512.04860, 2015
922015
Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning.
A Guez, RD Vincent, M Avoli, J Pineau
AAAI, 1671-1678, 2008
822008
Scalable and efficient Bayes-adaptive reinforcement learning based on Monte-Carlo tree search
A Guez, D Silver, P Dayan
Journal of Artificial Intelligence Research 48, 841-883, 2013
652013
Treating epilepsy via adaptive neurostimulation: a reinforcement learning approach
J Pineau, A Guez, R Vincent, G Panuccio, M Avoli
International journal of neural systems 19 (04), 227-240, 2009
522009
Woulda, coulda, shoulda: Counterfactually-guided policy search
L Buesing, T Weber, Y Zwols, S Racaniere, A Guez, JB Lespiau, N Heess
arXiv preprint arXiv:1811.06272, 2018
442018
Learning to search with MCTSnets
A Guez, T Weber, I Antonoglou, K Simonyan, O Vinyals, D Wierstra, ...
arXiv preprint arXiv:1802.04697, 2018
442018
An investigation of model-free planning
A Guez, M Mirza, K Gregor, R Kabra, S Racanière, T Weber, D Raposo, ...
arXiv preprint arXiv:1901.03559, 2019
272019
Adaptive control of epileptiform excitability in an in vitro model of limbic seizures
G Panuccio, A Guez, R Vincent, M Avoli, J Pineau
Experimental neurology 241, 179-183, 2013
262013
Bayes-adaptive simulation-based search with value function approximation
A Guez, N Heess, D Silver, P Dayan
Advances in Neural Information Processing Systems, 451-459, 2014
182014
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20