Marlos C. Machado
Marlos C. Machado
DeepMind, Amii, and University of Alberta
Dirección de correo verificada de google.com - Página principal
Título
Citado por
Citado por
Año
Revisiting the Arcade Learning Environment: Evaluation Protocols and Open Problems for General Agents
MC Machado, MG Bellemare, E Talvitie, J Veness, M Hausknecht, ...
Journal of Artificial Intelligence Research 61, 523-562, 2018
2892018
A laplacian framework for option discovery in reinforcement learning
MC Machado, MG Bellemare, M Bowling
Proceedings of the 34th International Conference on Machine Learning-Volume …, 2017
1552017
State of the art control of atari games using shallow reinforcement learning
Y Liang, MC Machado, E Talvitie, M Bowling
Proceedings of the 2016 International Conference on Autonomous Agents …, 2016
1012016
True online temporal-difference learning
H Van Seijen, AR Mahmood, PM Pilarski, MC Machado, RS Sutton
The Journal of Machine Learning Research 17 (1), 5057-5096, 2016
792016
Eigenoption Discovery through the Deep Successor Representation
MC Machado, C Rosenbaum, X Guo, M Liu, G Tesauro, M Campbell
arXiv preprint arXiv:1710.11089, 2017
662017
Generalization and Regularization in DQN
J Farebrother, MC Machado, M Bowling
arXiv preprint arXiv:1810.00123, 2018
622018
Count-based exploration with the successor representation
MC Machado, MG Bellemare, M Bowling
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5125-5133, 2020
522020
Player modeling: Towards a common taxonomy
MC Machado, EPC Fantini, L Chaimowicz
2011 16th international conference on computer games (CGAMES), 50-57, 2011
482011
On Bonus Based Exploration Methods In The Arcade Learning Environment
AA Taiga, W Fedus, MC Machado, A Courville, MG Bellemare
41*2020
Autonomous navigation of stratospheric balloons using reinforcement learning
MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ...
Nature 588 (7836), 77-82, 2020
352020
Learning Purposeful Behaviour in the Absence of Rewards
MC Machado, M Bowling
arXiv preprint arXiv:1605.07700, 2016
26*2016
Domain-independent optimistic initialization for reinforcement learning
MC Machado, S Srinivasan, M Bowling
Workshops at the Twenty-Ninth AAAI Conference on Artificial Intelligence, 2015
202015
Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
R Agarwal, MC Machado, PS Castro, MG Bellemare
arXiv preprint arXiv:2101.05265, 2021
112021
Exploration in reinforcement learning with deep covering options
Y Jinnai, JW Park, MC Machado, G Konidaris
International Conference on Learning Representations, 2020
112020
The Eigenoption-Critic Framework
M Liu, MC Machado, G Tesauro, M Campbell
arXiv preprint arXiv:1712.04065, 2017
102017
Introspective agents: Confidence measures for general value functions
C Sherstan, A White, MC Machado, PM Pilarski
International Conference on Artificial General Intelligence, 258-261, 2016
92016
Combining metaheuristics and csp algorithms to solve sudoku
MC Machado, L Chaimowicz
2011 Brazilian Symposium on Games and Digital Entertainment, 124-131, 2011
92011
A binary classification approach for automatic preference modeling of virtual agents in Civilization IV
MC Machado, GL Pappa, L Chaimowicz
2012 IEEE Conference on Computational Intelligence and Games (CIG), 155-162, 2012
82012
Accelerating learning in constructive predictive frameworks with the successor representation
C Sherstan, MC Machado, PM Pilarski
2018 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2018
62018
Rtsmate: Towards an advice system for rts games
RLDF Cunha, MC Machado, L Chaimowicz
Computers in Entertainment (CIE) 12 (1), 1-20, 2015
62015
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20