Seguir
Shun Zhang
Shun Zhang
MIT-IBM Watson AI Lab
Dirección de correo verificada de ibm.com - Página principal
Título
Citado por
Citado por
Año
Autonomous intersection management for semi-autonomous vehicles
TC Au, S Zhang, P Stone
Routledge Handbook of Transportation, 88-104, 2015
1422015
Prompting Decision Transformer for Few-Shot Policy Generalization
M Xu, Y Shen, S Zhang, Y Lu, D Zhao, JB Tenenbaum, C Gan
International Conference on Machine Learning, 2022
782022
Planning with large language models for code generation
S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan
arXiv preprint arXiv:2303.05510, 2023
532023
Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes.
S Zhang, EH Durfee, S Singh
IJCAI, 4867-4873, 2018
432018
Determining placements of influencing agents in a flock
K Genter, S Zhang, P Stone
Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015
302015
Semi-autonomous intersection management.
TC Au, S Zhang, P Stone
AAMAS, 1451-1452, 2014
292014
Hyper-decision transformer for efficient online policy adaptation
M Xu, Y Lu, Y Shen, S Zhang, D Zhao, C Gan
arXiv preprint arXiv:2304.08487, 2023
152023
Modeling sensory-motor decisions in natural behavior
R Zhang, S Zhang, MH Tong, Y Cui, CA Rothkopf, DH Ballard, ...
PLoS computational biology 14 (10), e1006518, 2018
132018
Querying to find a safe policy under uncertain safety constraints in markov decision processes
S Zhang, E Durfee, S Singh
Proceedings of the AAAI Conference on Artificial Intelligence 34 (03), 2552-2559, 2020
92020
Approximately-optimal queries for planning in reward-uncertain Markov decision processes
S Zhang, E Durfee, S Singh
Proceedings of the International Conference on Automated Planning and …, 2017
92017
From specification to topology: Automatic power converter design via reinforcement learning
S Fan, N Cao, S Zhang, J Li, X Guo, X Zhang
2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021
82021
Modeling Task Control of Gaze
M Tong, S Zhang, L Johnson, D Ballard, M Hayhoe
Journal of Vision 15 (12), 784-784, 2015
42015
Adaptive Online Replanning with Diffusion Models
S Zhou, Y Du, S Zhang, M Xu, Y Shen, W Xiao, DY Yeung, C Gan
Advances in Neural Information Processing Systems 36, 2023
22023
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
S Zhang, Z Chen, S Chen, Y Shen, Z Sun, C Gan
arXiv preprint arXiv:2401.16635, 2024
12024
Power Converter Circuit Design Automation using Parallel Monte Carlo Tree Search
S Fan, S Zhang, J Liu, N Cao, X Guo, J Li, X Zhang
ACM Transactions on Design Automation of Electronic Systems (TODAES), 2022
12022
Modeling Sensorimotor Behavior through Modular Inverse Reinforcement Learning with Discount Factors
R Zhang, S Zhang, MH Tong, MM Hayhoe, DH Ballard
Journal of Vision 17 (10), 1267-1267, 2017
12017
Parameterized modular inverse reinforcement learning
S Zhang
12015
Intersection Management With Constraint-Based Reservation Systems
TC Au, S Zhang, P Stone
Autonomous Robots and Multirobot Systems (ARMS), 2014
12014
Efficiently Finding Approximately-Optimal Queries for Improving Policies and Guaranteeing Safety
S Zhang
2020
On Querying for Safe Optimality in Factored Markov Decision Processes
S Zhang, EH Durfee, S Singh
Proceedings of the 17th International Conference on Autonomous Agents and …, 2018
2018
El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.
Artículos 1–20