Dynamic preferences in multi-criteria reinforcement learning. (2005)

by Sriraam Natarajan, Prasad Tadepalli
Venue:In Proceedings of the 22nd International Conference on Machine learning,