Seguir
Sergio Gómez Colmenarejo
Sergio Gómez Colmenarejo
Research Engineer, DeepMind
Dirección de correo verificada de google.com
Título
Citado por
Citado por
Año
Learning to learn by gradient descent by gradient descent
M Andrychowicz, M Denil, S Gomez, MW Hoffman, D Pfau, T Schaul, ...
Advances in neural information processing systems 29, 2016
22402016
Hybrid computing using a neural network with dynamic external memory
A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ...
Nature 538 (7626), 471-476, 2016
18372016
Policy distillation
AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ...
arXiv preprint arXiv:1511.06295, 2015
7402015
A generalist agent
S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ...
arXiv preprint arXiv:2205.06175, 2022
6672022
Learning to learn without gradient descent by gradient descent
Y Chen, MW Hoffman, SG Colmenarejo, M Denil, TP Lillicrap, M Botvinick, ...
International Conference on Machine Learning, 748-756, 2017
315*2017
Learned optimizers that scale and generalize
O Wichrowska, N Maheswaranathan, MW Hoffman, SG Colmenarejo, ...
International conference on machine learning, 3751-3760, 2017
2982017
Parallel multiscale autoregressive density estimation
S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ...
International conference on machine learning, 2912-2921, 2017
2322017
Acme: A research framework for distributed reinforcement learning
MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ...
arXiv preprint arXiv:2006.00979, 2020
2292020
Rl unplugged: A suite of benchmarks for offline reinforcement learning
C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ...
Advances in Neural Information Processing Systems 33, 7248-7259, 2020
183*2020
Scaling data-driven robotics with reward sketching and batch reinforcement learning
S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ...
arXiv preprint arXiv:1909.12200, 2019
152*2019
Programmable agents
M Denil, SG Colmenarejo, S Cabi, D Saxton, N De Freitas
arXiv preprint arXiv:1706.06383, 2017
682017
Task-relevant adversarial imitation learning
K Zolna, S Reed, A Novikov, SG Colmenarejo, D Budden, S Cabi, M Denil, ...
Conference on Robot Learning, 247-263, 2021
562021
Learning awareness models
B Amos, L Dinh, S Cabi, T Rothörl, SG Colmenarejo, A Muldal, T Erez, ...
arXiv preprint arXiv:1804.06318, 2018
522018
The intentional unintentional agent: Learning to solve many continuous control tasks simultaneously
S Cabi, SG Colmenarejo, MW Hoffman, M Denil, Z Wang, N Freitas
Conference on Robot Learning, 207-216, 2017
392017
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation
K Bousmalis, G Vezzani, D Rao, CM Devin, AX Lee, MB Villalonga, ...
Transactions on Machine Learning Research, 2023
36*2023
Regularized behavior value estimation
C Gulcehre, SG Colmenarejo, Z Wang, J Sygnowski, T Paine, K Zolna, ...
arXiv preprint arXiv:2103.09575, 2021
262021
One-shot high-fidelity imitation: Training large-scale deep nets with rl
TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ...
arXiv preprint arXiv:1810.05017, 2018
252018
TF-Replicator: Distributed machine learning for researchers
P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ...
arXiv preprint arXiv:1902.00465, 2019
242019
Starcraft ii unplugged: Large scale offline reinforcement learning
M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ...
Deep RL Workshop NeurIPS 2021, 2021
19*2021
Iterative multiscale image generation using neural networks
NE Kalchbrenner, D Belov, SG Colmenarejo, AGA van den Oord, Z Wang, ...
US Patent 11,361,403, 2022
52022
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20