Reinforcement Learning with High-dimensional, Continuous Actions (1993)

by L Baird, A Klopf