Shangtong Zhang

Cited by

	All	Since 2019
Citations	1151	1114
h-index	16	16
i10-index	23	21

300

150

225

201720182019202020212022202320245 26 63 158 231 275 298 86

Public access

View all

10 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoVerified email at cs.ox.ac.uk
Hengshuai YaoSony AIVerified email at ualberta.ca
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Bo LiuAAAI SM, IEEE SMVerified email at cs.umass.edu
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiVerified email at ualberta.ca
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyVerified email at tudelft.nl
Ray JiangResearch Scientist, DeepMindVerified email at google.com
Remi Tachet des CombesVerified email at alpacaml.com
Romain LarocheMicrosoft ResearchVerified email at polytechnique.org
Marcus EdelComputer Science, Free University of BerlinVerified email at fu-berlin.de
Ryan R. CurtinFree agentVerified email at ratml.org
Nando de FreitasCIFAR & DeepMindVerified email at google.com
Tom Le PaineStaff Research Scientist at Google DeepMindVerified email at google.com
Julian SchrittwieserDeepMindVerified email at furidamu.org
Roman RingDeepMindVerified email at deepmind.com
Petko GeorgievGoogle DeepMind, University of CambridgeVerified email at cam.ac.uk
Michael MathieuDeepMindVerified email at google.com
Aäron van den OordGoogle DeepMindVerified email at google.com
Sergio Gómez ColmenarejoResearch Engineer, DeepMindVerified email at google.com
Caglar GulcehreProf at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMind, PhD@MILAVerified email at google.com

Shangtong Zhang

University of Virginia

Verified email at virginia.edu - Homepage

reinforcement learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017	333	2017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020	94	2020
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019	87	2019
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018	85	2018
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019	77	2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019	53	2019
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019	51	2019
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021	40	2021
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021	36	2021
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018	32	2018
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021	31	2021
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018	31	2018
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang	29*	2018
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015	27	2015
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019	22	2019
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023	19*	2023
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022	13	2022
Learning Retrospective Knowledge with Reverse Reinforcement Learning S Zhang, V Veeriah, S Whiteson NeurIPS 2020, 2020	13	2020
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019	13	2019
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control S Zhang, OR Zaiane Deep Reinforcement Learning Symposium, NIPS 2017, 2017	12	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors