Off-policy shaping ensembles in reinforcement learning (2014)

by A Harutyunyan, T Brys, P Vrancx, A Nowé
Venue:In Proceedings of ECAI