Learning multiple goal behavior via task decomposition and dynamic policy merging (1993)

by S Whitehead, J Karlsson, J Tenenberg
Venue:Robot Learning