Seguir
Yunhao Tang
Yunhao Tang
Research Scientist, DeepMind
Dirección de correo verificada de columbia.edu - Página principal
Título
Citado por
Citado por
Año
Reinforcement learning for integer programming: Learning to cut
Y Tang, S Agrawal, Y Faenza
International conference on machine learning, 9367-9376, 2020
892020
Es-maml: Simple hessian-free meta learning
X Song, W Gao, Y Yang, K Choromanski, A Pacchiano, Y Tang
arXiv preprint arXiv:1910.01215, 2019
762019
Discretizing continuous action space for on-policy optimization
Y Tang, S Agrawal
Proceedings of the aaai conference on artificial intelligence 34 (04), 5981-5988, 2020
622020
Monte-Carlo tree search as regularized policy optimization
JB Grill, F Altché, Y Tang, T Hubert, M Valko, I Antonoglou, R Munos
International Conference on Machine Learning, 3769-3778, 2020
442020
From complexity to simplicity: Adaptive es-active subspaces for blackbox optimization
KM Choromanski, A Pacchiano, J Parker-Holder, Y Tang, V Sindhwani
Advances in Neural Information Processing Systems 32, 2019
352019
Provably robust blackbox optimization for reinforcement learning
K Choromanski, A Pacchiano, J Parker-Holder, Y Tang, D Jain, Y Yang, ...
CoRR, abs/1903.02993, 2019
29*2019
Boosting trust region policy optimization by normalizing flows policy
Y Tang, S Agrawal
arXiv preprint arXiv:1809.10326, 2018
282018
Orthogonal estimation of wasserstein distances
M Rowland, J Hron, Y Tang, K Choromanski, T Sarlós, A Weller
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
272019
Exploration by distributional reinforcement learning
Y Tang, S Agrawal
arXiv preprint arXiv:1805.01907, 2018
252018
Learning to Score Behaviors for Guided Policy Optimization
A Pacchiano, J Parker-Holder, Y Tang, A Choromanska, K Choromanski, ...
arXiv preprint arXiv:1906.04349, 2019
202019
Variational deep q network
Y Tang, A Kucukelbir
arXiv preprint arXiv:1711.11225, 2017
142017
Revisiting Peng’s Q() for Modern Reinforcement Learning
T Kozuno, Y Tang, M Rowland, R Munos, S Kapturowski, W Dabney, ...
International Conference on Machine Learning, 5794-5804, 2021
112021
Hindsight expectation maximization for goal-conditioned reinforcement learning
Y Tang, A Kucukelbir
International Conference on Artificial Intelligence and Statistics, 2863-2871, 2021
112021
Taylor expansion policy optimization
Y Tang, M Valko, R Munos
International Conference on Machine Learning, 9397-9406, 2020
102020
Self-imitation learning via generalized lower bound q-learning
Y Tang
Advances in neural information processing systems 33, 13964-13975, 2020
102020
Online hyper-parameter tuning in off-policy learning via evolutionary strategies
Y Tang, K Choromanski
arXiv preprint arXiv:2006.07554, 2020
92020
Implicit policy for reinforcement learning
Y Tang, S Agrawal
arXiv preprint arXiv:1806.06798, 2018
92018
Variance reduction for evolution strategies via structured control variates
Y Tang, K Choromanski, A Kucukelbir
International Conference on Artificial Intelligence and Statistics, 646-656, 2020
72020
KAMA-NNs: Low-dimensional rotation based neural networks
K Choromanski, A Pacchiano, J Pennington, Y Tang
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
72019
Unifying gradient estimators for meta-reinforcement learning via off-policy evaluation
Y Tang, T Kozuno, M Rowland, R Munos, M Valko
Advances in Neural Information Processing Systems 34, 5303-5315, 2021
62021
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20