Reinforcement learning with high-dimensional, continuous actions (1993)

by L C Baird, A Klopf H