Consistency of HDP applied to a simple reinforcement learning problem (1990)

by P J Werbos
Venue:Neural Networks