Sridhar Mahadevan
Sridhar Mahadevan
Director, Data Science Lab, Adobe Research & Professor, University of Massachusetts, Amherst
Verified email at cs.umass.edu - Homepage
TitleCited byYear
Recent advances in hierarchical reinforcement learning
AG Barto, S Mahadevan
Discrete event dynamic systems 13 (1-2), 41-77, 2003
11052003
Automatic programming of behavior-based robots using reinforcement learning
S Mahadevan, J Connell
Artificial intelligence 55 (2-3), 311-365, 1992
8481992
Average reward reinforcement learning: Foundations, algorithms, and empirical results
S Mahadevan
Machine learning 22 (1-3), 159-195, 1996
4281996
LEAP: A learning apprentice for VLSI design
TM Mitchell, S Mabadevan, LI Steinberg
Machine learning, 271-289, 1990
4021990
Multifaceted Therapeutic Benefits of Ginkgo biloba L.: Chemistry, Efficacy, Safety, and Uses
S Mahadevan, Y Park
Journal of food science 73 (1), R14-R19, 2008
3502008
Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes
S Mahadevan, M Maggioni
Journal of Machine Learning Research 8 (Oct), 2169-2231, 2007
2722007
Robot learning
JH Connell, S Mahadevan
Springer Science & Business Media, 2012
2622012
Heterogeneous domain adaptation using manifold alignment
C Wang, S Mahadevan
Twenty-Second International Joint Conference on Artificial Intelligence, 2011
2532011
Manifold alignment using procrustes analysis
C Wang, S Mahadevan
Proceedings of the 25th international conference on Machine learning, 1120-1127, 2008
2312008
Solving semi-Markov decision problems using average reward reinforcement learning
TK Das, A Gosavi, S Mahadevan, N Marchalleck
Management Science 45 (4), 560-574, 1999
2121999
Manifold alignment without correspondence
C Wang, S Mahadevan
Twenty-First International Joint Conference on Artificial Intelligence, 2009
1532009
Self-improving factory simulation using continuous-time average-reward reinforcement learning
S Mahadevan, N Marchalleck, TK Das, A Gosavi
MACHINE LEARNING-INTERNATIONAL WORKSHOP THEN CONFERENCE-, 202-210, 1997
1421997
Generative multi-adversarial networks
I Durugkar, I Gemp, S Mahadevan
arXiv preprint arXiv:1611.01673, 2016
1362016
Hierarchical multi-agent reinforcement learning
R Makar, S Mahadevan, M Ghavamzadeh
Proceedings of the fifth international conference on Autonomous agents, 246-253, 2001
1342001
Hierarchical multi-agent reinforcement learning
M Ghavamzadeh, S Mahadevan, R Makar
Autonomous Agents and Multi-Agent Systems 13 (2), 197-229, 2006
1282006
J. 4 supervised actor-critic reinforcement learning
M Barto, MT Rosenstein
Handbook of learning and approximate dynamic programming 2, 359, 2004
1262004
Repairing disengagement with non-invasive interventions
I Arroyo, K Ferguson, J Johns, T Dragon, H Meheranian, D Fisher, A Barto, ...
AIED 2007, 195-202, 2007
1232007
Gaze control for face learning and recognition by humans and machines
TF Shipley, PJ Kellman
From fragments to objects: Segmentation and grouping in vision, 463, 2001
1212001
Proto-value functions: Developmental reinforcement learning
S Mahadevan
Proceedings of the 22nd international conference on Machine learning, 553-560, 2005
1142005
Hierarchical memory-based reinforcement learning
N Hernandez-Gardiol, S Mahadevan
Advances in Neural Information Processing Systems, 1047-1053, 2001
1072001
The system can't perform the operation now. Try again later.
Articles 1–20