Efficient learning of multi-step best response (2005)

by Bikramjit Banerjee, Jing Peng
Venue:In AAMAS ’05: Proceedings of the fourth international joint conference on Autonomous agents and multiagent systems