Kernel-based reinforcement learning in average-cost problems (2002)

by D Ormoneit, P W Glynn
Venue:IEEE Transactions on Automatic Control