Fast online policy gradient learning with smd gain vector adaptation (2006)

by Nicol N. Schraudolph , Jin Yu , Douglas Aberdeen
Venue:Advances in Neural Information Processing Systems 18
Citations:11 - 1 self