Michael Littman
Title
Cited by
Cited by
Year
Reinforcement learning: A survey
LP Kaelbling, ML Littman, AW Moore
Journal of artificial intelligence research 4, 237-285, 1996
77661996
Planning and acting in partially observable stochastic domains
LP Kaelbling, ML Littman, AR Cassandra
Artificial intelligence 101 (1-2), 99-134, 1998
39541998
Markov games as a framework for multi-agent reinforcement learning
ML Littman
Machine learning proceedings 1994, 157-163, 1994
21971994
Measuring praise and criticism: Inference of semantic orientation from association
PD Turney, ML Littman
ACM Transactions on Information Systems (TOIS) 21 (4), 315-346, 2003
19912003
Activity recognition from accelerometer data
N Ravi, N Dandekar, P Mysore, ML Littman
Aaai 5 (2005), 1541-1546, 2005
18352005
Packet routing in dynamically changing networks: A reinforcement learning approach
JA Boyan, ML Littman
Advances in neural information processing systems, 671-678, 1994
8431994
Acting optimally in partially observable stochastic domains
AR Cassandra, LP Kaelbling, ML Littman
Aaai 94, 1023-1028, 1994
8011994
Learning policies for partially observable environments: Scaling up
ML Littman, AR Cassandra, LP Kaelbling
Machine Learning Proceedings 1995, 362-370, 1995
7851995
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvári
Machine learning 38 (3), 287-308, 2000
7012000
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
6952013
Interactions between learning and evolution
D Ackley, M Littman
Artificial life II 10, 487-509, 1991
6691991
On the complexity of solving Markov decision problems
ML Littman, TL Dean, LP Kaelbling
arXiv preprint arXiv:1302.4971, 2013
5882013
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
AR Cassandra, ML Littman, NL Zhang
arXiv preprint arXiv:1302.1525, 2013
5742013
Friend-or-foe Q-learning in general-sum games
ML Littman
ICML 1, 322-328, 2001
5382001
Predictive representations of state
ML Littman, RS Sutton
Advances in neural information processing systems, 1555-1561, 2002
5252002
Computerized cross-language document retrieval using latent semantic indexing
TK Landauer, ML Littman
US Patent 5,301,109, 1994
4901994
Algorithms for sequential decision making
ML Littman
Brown University, 1996
4891996
Unsupervised learning of semantic orientation from a hundred-billion-word corpus
PD Turney, ML Littman
arXiv preprint cs/0212012, 2002
4002002
Value-function reinforcement learning in Markov games
ML Littman
Cognitive systems research 2 (1), 55-66, 2001
3892001
PAC model-free reinforcement learning
AL Strehl, L Li, E Wiewiora, J Langford, ML Littman
Proceedings of the 23rd international conference on Machine learning, 881-888, 2006
3862006
The system can't perform the operation now. Try again later.
Articles 1–20