DMCA

Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning

by Weiwei Cheng , Johannes Fürnkranz , Eyke Hüllermeier , Sang-hyeun Park
Citations:9 - 4 self