A Dual Back-Propagation Scheme for Scalar Reward Learning (1987)

by P Munro
Venue:Proc. Ninth Annual Conf. of the Cognitive Science Society