Sergio Gómez Colmenarejo

Citado por

	Total	Desde 2019
Citas	7252	6090
Índice h	19	19
Índice i10	19	19

1600

800

400

1200

20162017201820192020202120222023202488 355 666 851 946 1133 1193 1520 441

Acceso público

Ver todo

3 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Nando de FreitasCIFAR & DeepMindDirección de correo verificada de google.com
Misha DenilDeepMindDirección de correo verificada de google.com
Matthew W. HoffmanGoogle DeepMindDirección de correo verificada de google.com
Caglar GulcehreProf at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMind, PhD@MILADirección de correo verificada de google.com
Marcin AndrychowiczGoogle BrainDirección de correo verificada de openai.com
Tom SchaulSenior Staff Scientist, DeepMindDirección de correo verificada de nyu.edu
koray kavukcuogluDeepMindDirección de correo verificada de kavukcuoglu.org
Raia HadsellGoogle DeepMindDirección de correo verificada de google.com
Edward GrefenstetteDirector of Research, Google DeepMind | Honorary Professor, UCLDirección de correo verificada de google.com
Karl Moritz HermannReliant AIDirección de correo verificada de google.com
Phil BlunsomCohere & Oxford UniversityDirección de correo verificada de cs.ox.ac.uk
Guillaume DesjardinsDeepMindDirección de correo verificada de google.com
Volodymyr MnihDeepMindDirección de correo verificada de cs.toronto.edu
Julien CornebiseHon. Associate Professor, University College LondonDirección de correo verificada de ucl.ac.uk
Joel Z LeiboResearch scientistDirección de correo verificada de google.com
Demis HassabisDeepMind

Seguir

Sergio Gómez Colmenarejo

Research Engineer, DeepMind

Dirección de correo verificada de google.com

Artificial Intelligence


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Learning to learn by gradient descent by gradient descent M Andrychowicz, M Denil, S Gomez, MW Hoffman, D Pfau, T Schaul, ... Advances in neural information processing systems 29, 2016	2240	2016
Hybrid computing using a neural network with dynamic external memory A Graves, G Wayne, M Reynolds, T Harley, I Danihelka, ... Nature 538 (7626), 471-476, 2016	1837	2016
Policy distillation AA Rusu, SG Colmenarejo, C Gulcehre, G Desjardins, J Kirkpatrick, ... arXiv preprint arXiv:1511.06295, 2015	740	2015
A generalist agent S Reed, K Zolna, E Parisotto, SG Colmenarejo, A Novikov, G Barth-Maron, ... arXiv preprint arXiv:2205.06175, 2022	667	2022
Learning to learn without gradient descent by gradient descent Y Chen, MW Hoffman, SG Colmenarejo, M Denil, TP Lillicrap, M Botvinick, ... International Conference on Machine Learning, 748-756, 2017	315*	2017
Learned optimizers that scale and generalize O Wichrowska, N Maheswaranathan, MW Hoffman, SG Colmenarejo, ... International conference on machine learning, 3751-3760, 2017	298	2017
Parallel multiscale autoregressive density estimation S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ... International conference on machine learning, 2912-2921, 2017	232	2017
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	229	2020
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020	183*	2020
Scaling data-driven robotics with reward sketching and batch reinforcement learning S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ... arXiv preprint arXiv:1909.12200, 2019	152*	2019
Programmable agents M Denil, SG Colmenarejo, S Cabi, D Saxton, N De Freitas arXiv preprint arXiv:1706.06383, 2017	68	2017
Task-relevant adversarial imitation learning K Zolna, S Reed, A Novikov, SG Colmenarejo, D Budden, S Cabi, M Denil, ... Conference on Robot Learning, 247-263, 2021	56	2021
Learning awareness models B Amos, L Dinh, S Cabi, T Rothörl, SG Colmenarejo, A Muldal, T Erez, ... arXiv preprint arXiv:1804.06318, 2018	52	2018
The intentional unintentional agent: Learning to solve many continuous control tasks simultaneously S Cabi, SG Colmenarejo, MW Hoffman, M Denil, Z Wang, N Freitas Conference on Robot Learning, 207-216, 2017	39	2017
RoboCat: A Self-Improving Generalist Agent for Robotic Manipulation K Bousmalis, G Vezzani, D Rao, CM Devin, AX Lee, MB Villalonga, ... Transactions on Machine Learning Research, 2023	36*	2023
Regularized behavior value estimation C Gulcehre, SG Colmenarejo, Z Wang, J Sygnowski, T Paine, K Zolna, ... arXiv preprint arXiv:2103.09575, 2021	26	2021
One-shot high-fidelity imitation: Training large-scale deep nets with rl TL Paine, SG Colmenarejo, Z Wang, S Reed, Y Aytar, T Pfaff, ... arXiv preprint arXiv:1810.05017, 2018	25	2018
TF-Replicator: Distributed machine learning for researchers P Buchlovsky, D Budden, D Grewe, C Jones, J Aslanides, F Besse, ... arXiv preprint arXiv:1902.00465, 2019	24	2019
Starcraft ii unplugged: Large scale offline reinforcement learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... Deep RL Workshop NeurIPS 2021, 2021	19*	2021
Iterative multiscale image generation using neural networks NE Kalchbrenner, D Belov, SG Colmenarejo, AGA van den Oord, Z Wang, ... US Patent 11,361,403, 2022	5	2022

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores