Punish/Reward: Learning with a Critic in Adaptive Threshold Systems (1989)

by N K Gupta Widrow, S Maitra
Venue:IEEE Transactions on Control Systems Magazine