Seguir
Sergio Gómez Colmenarejo
Sergio Gómez Colmenarejo
Research Engineer, DeepMind
Dirección de correo verificada de google.com
Título
Citado por
Citado por
Año
Learning to learn by gradient descent by gradient descent
M Andrychowicz, M Denil, S Gomez, MW Hoffman, D Pfau, T Schaul, ...
Advances in neural information processing systems 29, 2016
17462016
Hybrid computing using a neural network with dynamic external memory
A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ...
Nature 538 (7626), 471-476, 2016
15492016
Policy distillation
AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ...
arXiv preprint arXiv:1511.06295, 2015
5362015
Learning to learn without gradient descent by gradient descent
Y Chen, MW Hoffman, SG Colmenarejo, M Denil, TP Lillicrap, M Botvinick, ...
International Conference on Machine Learning, 748-756, 2017
244*2017
Learned optimizers that scale and generalize
O Wichrowska, N Maheswaranathan, MW Hoffman, SG Colmenarejo, ...
International Conference on Machine Learning, 3751-3760, 2017
2152017
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ...
International conference on machine learning, 2912-2921, 2017
1952017
Acme: A research framework for distributed reinforcement learning
M Hoffman, B Shahriari, J Aslanides, G Barth-Maron, F Behbahani, ...
arXiv preprint arXiv:2006.00979, 2020
1292020
A generalist agent
S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ...
arXiv preprint arXiv:2205.06175, 2022
1002022
Scaling data-driven robotics with reward sketching and batch reinforcement learning
S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ...
arXiv preprint arXiv:1909.12200, 2019
90*2019
Rl unplugged: A suite of benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems 33, 7248-7259, 2020
87*2020
Programmable agents
M Denil, SG Colmenarejo, S Cabi, D Saxton, N de Freitas
arXiv preprint arXiv:1706.06383, 2017
642017
Learning awareness models
B Amos, L Dinh, S Cabi, T Rothörl, SG Colmenarejo, A Muldal, T Erez, ...
arXiv preprint arXiv:1804.06318, 2018
422018
The intentional unintentional agent: Learning to solve many continuous control tasks simultaneously
S Cabi, SG Colmenarejo, MW Hoffman, M Denil, Z Wang, N Freitas
Conference on Robot Learning, 207-216, 2017
352017
Task-relevant adversarial imitation learning
K Zolna, S Reed, A Novikov, SG Colmenarejo, D Budden, S Cabi, M Denil, ...
Conference on Robot Learning, 247-263, 2021
302021
TF-Replicator: Distributed machine learning for researchers
P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ...
arXiv preprint arXiv:1902.00465, 2019
222019
One-shot high-fidelity imitation: Training large-scale deep nets with rl
TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ...
arXiv preprint arXiv:1810.05017, 2018
192018
Regularized behavior value estimation
C Gulcehre, SG Colmenarejo, Z Wang, J Sygnowski, T Paine, K Zolna, ...
arXiv preprint arXiv:2103.09575, 2021
162021
StarCraft II Unplugged: Large Scale Offline Reinforcement Learning
M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ...
Deep RL Workshop NeurIPS 2021, 2021
42021
RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning
Ç Gülçehre, Z Wang, A Novikov, T Paine, SG Colmenarejo, K Zolna, ...
NeurIPS, 2020
42020
Visual imitation with a minimal adversary
S Reed, Y Aytar, Z Wang, T Paine, A van den Oord, T Pfaff, S Gomez, ...
22018
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20