Fast online policy gradient learning with smd gain vector adaptation (2006)

by N Schraudolph, J Yu, D Aberdeen
Venue:in Advances in Neural Information Processing Systems 18