Learning to act using real-time dynamic programming AG Barto, SJ Bradtke, SP Singh Artificial intelligence 72 (1-2), 81-138, 1995 | 1666 | 1995 |
Linear least-squares algorithms for temporal difference learning SJ Bradtke, AG Barto Machine learning 22 (1), 33-57, 1996 | 1015 | 1996 |
Adaptive linear quadratic control using policy iteration SJ Bradtke, BE Ydstie, AG Barto Proceedings of 1994 American Control Conference-ACC'94 3, 3475-3479, 1994 | 521 | 1994 |
Reinforcement learning methods for continuous-time Markov decision problems SJ Bradtke, MO Duff Advances in Neural Information Processing Systems 7 7, 393-400, 1995 | 504 | 1995 |
Real-time learning and control using asynchronous dynamic programming AG Barto, SJ Bradtke, SP Singh University of Massachusetts at Amherst, Department of Computer and …, 1991 | 239 | 1991 |
Reinforcement learning applied to linear quadratic regulation S Bradtke Advances in neural information processing systems 5, 1992 | 227 | 1992 |
Incremental dynamic programming for on-line adaptive optimal control SJ Bradtke University of Massachusetts Amherst, 1994 | 72 | 1994 |
Some Experiments with Case-Based Search. S Bradtke, WG Lehnert AAAI, 133-138, 1988 | 38 | 1988 |
Learning to Solve Stochastic Optimal Path Problems Using Real-Time Dynamic Programming AG Barto, SJ Bradtke The Proceedings of the Seventh Yale Workshop on Adaptive and Learning …, 1992 | | 1992 |