Shun Zhang

Citado por

	Total	Desde 2019
Citas	440	342
Índice h	9	9
Índice i10	8	8

120

201420152016201720182019202020212022202320243 5 15 33 40 23 40 41 50 112 76

Acceso público

Ver todo

9 artículos

0 artículos

disponibles

no disponibles

Basado en requisitos de financiación

Coautores

Peter StoneProfessor of Computer Science, The University of Texas at AustinDirección de correo verificada de cs.utexas.edu
Chuang GanUMass Amherst | MIT-IBM Watson AI LabDirección de correo verificada de csail.mit.edu
Tsz-Chiu AuUlsan National Institute of Science and TechnologyDirección de correo verificada de cs.utexas.edu
Edmund DurfeeProfessor Emeritus of Computer Science and Engineering, University of MichiganDirección de correo verificada de umich.edu
Satinder SinghGoogle DeepMind / U. of MichiganDirección de correo verificada de umich.edu
Dana BallardProfessor of Computer Science, University of Texas at AustinDirección de correo verificada de cs.utexas.edu
Mary HayhoeProfessor of Psychology, University of Texas AustinDirección de correo verificada de utexas.edu
Matthew TongIBM ResearchDirección de correo verificada de alumni.ucsd.edu
Ruohan ZhangStanford UniversityDirección de correo verificada de stanford.edu
xin zhangIBM Thomas J. Watson Research Center / Columbia UniversityDirección de correo verificada de us.ibm.com

Seguir

Shun Zhang

MIT-IBM Watson AI Lab

Dirección de correo verificada de ibm.com - Página principal

reinforcement learning human-agent interaction value alignment AI safety


Título Ordenar por citas Ordenar por año Ordenar por título	Citado por Citado por	Año
Autonomous intersection management for semi-autonomous vehicles TC Au, S Zhang, P Stone Routledge Handbook of Transportation, 88-104, 2015	142	2015
Prompting Decision Transformer for Few-Shot Policy Generalization M Xu, Y Shen, S Zhang, Y Lu, D Zhao, JB Tenenbaum, C Gan International Conference on Machine Learning, 2022	78	2022
Planning with large language models for code generation S Zhang, Z Chen, Y Shen, M Ding, JB Tenenbaum, C Gan arXiv preprint arXiv:2303.05510, 2023	53	2023
Minimax-Regret Querying on Side Effects for Safe Optimality in Factored Markov Decision Processes. S Zhang, EH Durfee, S Singh IJCAI, 4867-4873, 2018	43	2018
Determining placements of influencing agents in a flock K Genter, S Zhang, P Stone Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	30	2015
Semi-autonomous intersection management. TC Au, S Zhang, P Stone AAMAS, 1451-1452, 2014	29	2014
Hyper-decision transformer for efficient online policy adaptation M Xu, Y Lu, Y Shen, S Zhang, D Zhao, C Gan arXiv preprint arXiv:2304.08487, 2023	15	2023
Modeling sensory-motor decisions in natural behavior R Zhang, S Zhang, MH Tong, Y Cui, CA Rothkopf, DH Ballard, ... PLoS computational biology 14 (10), e1006518, 2018	13	2018
Querying to find a safe policy under uncertain safety constraints in markov decision processes S Zhang, E Durfee, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 34 (03), 2552-2559, 2020	9	2020
Approximately-optimal queries for planning in reward-uncertain Markov decision processes S Zhang, E Durfee, S Singh Proceedings of the International Conference on Automated Planning and …, 2017	9	2017
From specification to topology: Automatic power converter design via reinforcement learning S Fan, N Cao, S Zhang, J Li, X Guo, X Zhang 2021 IEEE/ACM International Conference On Computer Aided Design (ICCAD), 1-9, 2021	8	2021
Modeling Task Control of Gaze M Tong, S Zhang, L Johnson, D Ballard, M Hayhoe Journal of Vision 15 (12), 784-784, 2015	4	2015
Adaptive Online Replanning with Diffusion Models S Zhou, Y Du, S Zhang, M Xu, Y Shen, W Xiao, DY Yeung, C Gan Advances in Neural Information Processing Systems 36, 2023	2	2023
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble S Zhang, Z Chen, S Chen, Y Shen, Z Sun, C Gan arXiv preprint arXiv:2401.16635, 2024	1	2024
Power Converter Circuit Design Automation using Parallel Monte Carlo Tree Search S Fan, S Zhang, J Liu, N Cao, X Guo, J Li, X Zhang ACM Transactions on Design Automation of Electronic Systems (TODAES), 2022	1	2022
Modeling Sensorimotor Behavior through Modular Inverse Reinforcement Learning with Discount Factors R Zhang, S Zhang, MH Tong, MM Hayhoe, DH Ballard Journal of Vision 17 (10), 1267-1267, 2017	1	2017
Parameterized modular inverse reinforcement learning S Zhang	1	2015
Intersection Management With Constraint-Based Reservation Systems TC Au, S Zhang, P Stone Autonomous Robots and Multirobot Systems (ARMS), 2014	1	2014
Efficiently Finding Approximately-Optimal Queries for Improving Policies and Guaranteeing Safety S Zhang		2020
On Querying for Safe Optimality in Factored Markov Decision Processes S Zhang, EH Durfee, S Singh Proceedings of the 17th International Conference on Autonomous Agents and …, 2018		2018

El sistema no puede realizar la operación en estos momentos. Inténtalo de nuevo más tarde.

Artículos 1–20

Citas por año

Citas duplicadas

Citas combinadas

Añadir coautoresCoautores

Seguir

Citado por

Coautores