Reinforcement Learning to Play an Optimal Nash Equilibrium in Team Markov Games (2002)
by
Xiaofeng Wang
,
Tuomas Sandholm
| Venue: | in Advances in Neural Information Processing Systems |
| Citations: | 57 - 3 self |







