Shangtong Zhang
Shangtong Zhang
Verified email at cs.ox.ac.uk - Homepage
Title
Cited by
Cited by
Year
A Deeper Look at Experience Replay
S Zhang, RS Sutton
Deep Reinforcement Learning Symposium, NIPS 2017, 2017
632017
mlpack 3: a fast, flexible machine learning library
R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang
Journal of Open Source Software 3 (26), 726, 2018
272018
A deep neural network for modeling music
P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang
Proceedings of the 5th ACM on International Conference on Multimedia …, 2015
142015
Generalized Off-Policy Actor-Critic
S Zhang, W Boehmer, S Whiteson
NeurIPS 2019, 2019
92019
DAC: The Double Actor-Critic Architecture for Learning Options
S Zhang, S Whiteson
NeurIPS 2019, 2019
82019
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search
S Zhang, H Chen, H Yao
AAAI 2019, 2018
72018
QUOTA: The Quantile Option Architecture for Reinforcement Learning
S Zhang, B Mavrin, L Kong, B Liu, H Yao
AAAI 2019, 2018
72018
Distributional Reinforcement Learning for Efficient Exploration
B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu
ICML 2019, 2019
5*2019
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control
S Zhang, OR Zaiane
Deep Reinforcement Learning Symposium, NIPS 2017, 2017
42017
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation
S Zhang, B Liu, H Yao, S Whiteson
ICML 2020, 2019
2*2019
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards
Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu
AAAI 2020, 2019
22019
Learning with Artificial Neural Networks
S Zhang
Master thesis, University of Alberta, 2018
12018
Crossprop: Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks
V Veeriah, S Zhang, RS Sutton
Joint European Conference on Machine Learning and Knowledge Discovery in …, 2017
12017
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
S Zhang, B Liu, S Whiteson
arXiv preprint arXiv:2004.10888, 2020
2020
Deep Residual Reinforcement Learning
S Zhang, W Boehmer, S Whiteson
AAMAS 2020, 2019
2019
The system can't perform the operation now. Try again later.
Articles 1–15