Vinicius Zambaldi
Vinicius Zambaldi
Google Deepmind
Dirección de correo verificada de google.com
Título
Citado por
Citado por
Año
Relational inductive biases, deep learning, and graph networks
PW Battaglia, JB Hamrick, V Bapst, A Sanchez-Gonzalez, V Zambaldi, ...
arXiv preprint arXiv:1806.01261, 2018
8312018
Multi-agent reinforcement learning in sequential social dilemmas
JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel
arXiv preprint arXiv:1702.03037, 2017
2962017
A unified game-theoretic approach to multiagent reinforcement learning
M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ...
Advances in neural information processing systems, 4190-4203, 2017
2202017
Deep reinforcement learning with relational inductive biases
V Zambaldi, D Raposo, A Santoro, V Bapst, Y Li, I Babuschkin, K Tuyls, ...
International Conference on Learning Representations, 2018
157*2018
Value-Decomposition Networks For Cooperative Multi-Agent Learning Based On Team Reward.
P Sunehag, G Lever, A Gruslys, WM Czarnecki, VF Zambaldi, ...
AAMAS, 2085-2087, 2018
1092018
Value-decomposition networks for cooperative multi-agent learning
P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ...
arXiv preprint arXiv:1706.05296, 2017
932017
Dawn of the selfie era: The whos, wheres, and hows of selfies on Instagram
F Souza, D de Las Casas, V Flores, SB Youn, M Cha, D Quercia, ...
Proceedings of the 2015 ACM on conference on online social networks, 221-231, 2015
832015
A multi-agent reinforcement learning model of common-pool resource appropriation
J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel
Advances in Neural Information Processing Systems, 3643-3652, 2017
762017
Actor-critic policy optimization in partially observable multiagent environments
S Srinivasan, M Lanctot, V Zambaldi, J Pérolat, K Tuyls, R Munos, ...
Advances in neural information processing systems, 3422-3435, 2018
632018
Relational forward models for multi-agent learning
A Tacchetti, HF Song, PAM Mediano, V Zambaldi, NC Rabinowitz, ...
arXiv preprint arXiv:1809.11044, 2018
262018
OpenSpiel: A framework for reinforcement learning in games
M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, J Pérolat, ...
arXiv preprint arXiv:1908.09453, 2019
192019
CompILE: Compositional imitation learning and execution
T Kipf, Y Li, H Dai, V Zambaldi, A Sanchez-Gonzalez, E Grefenstette, ...
International Conference on Machine Learning, 3418-3428, 2019
142019
Lightweight Contextual Ranking of City Pictures: Urban Sociology to the Rescue.
VF Zambaldi, JP Pesce, D Quercia, VAF Almeida
ICWSM, 2014
132014
Compositional imitation learning: Explaining and executing one task at a time
T Kipf, Y Li, H Dai, V Zambaldi, E Grefenstette, P Kohli, P Battaglia
arXiv preprint arXiv:1812.01483, 2018
92018
Memo: A deep network for flexible combination of episodic memories
A Banino, AP Badia, R Köster, MJ Chadwick, V Zambaldi, D Hassabis, ...
arXiv preprint arXiv:2001.10913, 2020
62020
The Advantage Regret-Matching Actor-Critic
A Gruslys, M Lanctot, R Munos, F Timbers, M Schmid, J Perolat, D Morrill, ...
arXiv preprint arXiv:2008.12234, 2020
2020
Deep Learning Monitor
CT Page, M Lanctot, E Lockhart, JB Lespiau, V Zambaldi, S Upadhyay, ...
Nature Communications 11 (1), 1760, 2020
2020
Reinforcement learning using a relational network for generating data encoding relationships between entities in an environment
Y Li, VC Bapst, V Zambaldi, DN Raposo, AA Santoro
US Patent App. 16/417,580, 2019
2019
CompILE: Compositional Imitation Learning and Execution Download PDF
T Kipf, Y Li, H Dai, V Zambaldi, A Sanchez-Gonzalez, E Grefenstette, ...
使用 NLP 预测电影类型-多标签...
PW Battaglia, JB Hamrick, V Bapst, A Sanchez-Gonzalez, V Zambaldi, ...
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20