Preferencebased policy iteration: leveraging preference learning for reinforcement learning,” (2011)

by W Cheng, J Furnkranz, E Hullermeier, S Park
Venue:in Machine Learning and Knowledge Discovery in Databases.