R.: Incremental multi-step Q-learning (1996)

by Jing Peng , Ronald J. Williams , Pack Kaelbling
Citations:112 - 3 self