A reinforcement learning method for maximizing undiscounted rewards (1993)

by A Schwartz
Venue:Proceedings of the Tenth International Conference on Machine Learning