Multi-objective reinforcement learning using sets of pareto dominating policies. (2014)

by Kristof Van Moffaert, Ann Nowe
Venue:Journal of Machine Learning Research,