|
1134
|
Reinforcement learning: a survey
– Leslie Pack Kaelbling, Michael L. Littman, Andrew W. Moore
- 1996
|
|
102
|
Machine-Learning Research -- Four Current Directions
– Thomas G. Dietterich
|
|
4
|
What Makes a Good Co-Evolutionary Learning Environment?
– Alan D. Blair, Jordan B. Pollack
- 1997
|
|
75
|
Robust Non-linear Control through Neuroevolution
– Faustino John Gomez
- 2003
|
|
6
|
Missile Defense and Interceptor Allocation by Neuro-Dynamic Programming
– Dimitri Bertsekas, Mark L. Homer, David A. Logan, Stephen D. Patek, Nils R. Sandell
- 1999
|
|
18
|
Large-Scale Dynamic Optimization Using Teams of Reinforcement Learning Agents
– Robert Harry Crites
- 1996
|
|
2
|
How to Make Software Agents Do the Right Thing: An Introduction to Reinforcement Learning
– Santinder Singh, Peter Norvig, David Cohn, Harlequin Inc
- 1996
|
|
5
|
A Dynamic Channel Assignment Policy through Q-Learning
– Junhong Nie, Simon Haykin
- 1999
|
|
80
|
An introduction to collective intelligence
– David H. Wolpert, Kagan Tumer
- 1999
|
|
68
|
Elevator Group Control Using Multiple Reinforcement Learning Agents
– Robert H. Crites, Andrew G. Barto, Michael Huhns, Gerhard Weiss
- 1998
|
|
49
|
TD(λ) Converges with Probability 1
– Peter Dayan, Terrence J. Sejnowski
- 1994
|
|
16
|
Value Function Based Production Scheduling
– Jeff G. Schneider, Justin A. Boyan, Andrew W. Moore
- 1998
|
|
9
|
A Tutorial Survey of Reinforcement Learning
– S Sathiya Keerthi, B Ravindran
|
|
23
|
Truncating temporal differences: On the efficient implementation of TD(λ) for reinforcement learning
– Paweł Cichosz
- 1995
|
|
31
|
Modular Neural Networks for Learning Context-Dependent Game Strategies
– Justin A. Boyan
- 1992
|
|
224
|
Generalization in Reinforcement Learning: Safely Approximating the Value Function
– Justin A. Boyan, Andrew W. Moore
- 1995
|
|
275
|
Prioritized sweeping: Reinforcement learning with less data and less time
– Andrew W. Moore, Christopher G. Atkeson
- 1993
|
|
|
Report of the 1996 Workshop on Reinforcement Learning
– Sridhar Mahadevan, Leslie Pack Kaelbling
|
|
6
|
Intelligent Traffic Light Control
– Marco Wiering, Jelle van Veenen, Jilles Vreeken, Arne Koopman
- 2004
|