Long-term reward prediction in TD models of the dopamine system. (2002)

by N D Daw, D S Touretzky
Venue:Neural Comput.,