Scaling internal-state policy-gradient methods for POMDPs (2002)

by Douglas Aberdeen , Jonathan Baxter
Venue:Proc. ICML-02, pp.3–10
Citations:21 - 0 self