• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • Donate

CiteSeerX logo

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 3,548
Next 10 →

General Game Management Agent

by n.n. , 2009
"... The task of managing general game playing in a multi-agent system is the problem addressed in this paper. It is considered to be done by an agent. There are many reasons for constructing such an agent, called general game management agent. This agent manages strategic interactions between other agen ..."
Abstract - Add to MetaCart
The task of managing general game playing in a multi-agent system is the problem addressed in this paper. It is considered to be done by an agent. There are many reasons for constructing such an agent, called general game management agent. This agent manages strategic interactions between other

A Methodology for Agent-Oriented Analysis and Design

by Michael Wooldridge , Nicholas R. Jennings, David Kinny , 1999
"... This paper presents a methodology for agent-oriented analysis and design. The methodology is general, in that it is applicable to a wide range of multi-agent systems, and comprehensive, in that it deals with both the macro-level (societal) and the micro-level (agent) aspects of systems. The methodol ..."
Abstract - Cited by 829 (12 self) - Add to MetaCart
This paper presents a methodology for agent-oriented analysis and design. The methodology is general, in that it is applicable to a wide range of multi-agent systems, and comprehensive, in that it deals with both the macro-level (societal) and the micro-level (agent) aspects of systems

R-MAX - A General Polynomial Time Algorithm for Near-Optimal Reinforcement Learning

by Ronen I. Brafman, Moshe Tennenholtz, Pack Kaelbling , 2001
"... R-max is a very simple model-based reinforcement learning algorithm which can attain near-optimal average reward in polynomial time. In R-max, the agent always maintains a complete, but possibly inaccurate model of its environment and acts based on the optimal policy derived from this model. The mod ..."
Abstract - Cited by 297 (10 self) - Add to MetaCart
. The model is initialized in an optimistic fashion: all actions in all states return the maximal possible reward (hence the name). During execution, it is updated based on the agent's observations. R-max improves upon several previous algorithms: (1) It is simpler and more general than Kearns and Singh

A framework for argumentation-based negotiation

by Carles Sierra, Nick R. Jennings, Pablo Noriega, Simon Parsons - Proceedings of the 4th International Workshop on Agent Theories, Architectures, and Languages (ATAL-97), volume 1365 of LNAI , 1998
"... Abstract. Many autonomous agents operate in domains in which the cooperation of their fellow agents cannot be guaranteed. In such domains negotiation is essential to persuade others of the value of co-operation. This paper describes a general framework for negotiation in which agents exchange propos ..."
Abstract - Cited by 289 (57 self) - Add to MetaCart
Abstract. Many autonomous agents operate in domains in which the cooperation of their fellow agents cannot be guaranteed. In such domains negotiation is essential to persuade others of the value of co-operation. This paper describes a general framework for negotiation in which agents exchange

Multi-agent influence diagrams for representing and solving games

by Daphne Koller, Brian Milch - GAMES AND ECONOMIC BEHAVIOR , 2001
"... The traditional representations of games using the extensive form or the strategic (normal) form obscure much of the structure that is present in real-world games. In this paper, we propose a new representation language for general multiplayer games — multi-agent influence diagrams (MAIDs). This rep ..."
Abstract - Cited by 188 (2 self) - Add to MetaCart
The traditional representations of games using the extensive form or the strategic (normal) form obscure much of the structure that is present in real-world games. In this paper, we propose a new representation language for general multiplayer games — multi-agent influence diagrams (MAIDs

The Michigan Internet AuctionBot: A configurable auction server for human and software agents

by Peter R. Wurman, Michael P. Wellman, William E. Walsh - IN PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS, MAY1998 , 1998
"... Market mechanisms, such as auctions, will likely represent a common interaction medium for agents on the Internet. The Michigan Internet AuctionBot is a flexible, scalable, and robust auction server that supports both software and human agents. The server manages many simultaneous auctions by separa ..."
Abstract - Cited by 250 (15 self) - Add to MetaCart
Market mechanisms, such as auctions, will likely represent a common interaction medium for agents on the Internet. The Michigan Internet AuctionBot is a flexible, scalable, and robust auction server that supports both software and human agents. The server manages many simultaneous auctions

Nash Q-Learning for General-Sum Stochastic Games

by Junling Hu , Michael P. Wellman - JOURNAL OF MACHINE LEARNING RESEARCH , 2003
"... We extend Q-learning to a noncooperative multiagent context, using the framework of generalsum stochastic games. A learning agent maintains Q-functions over joint actions, and performs updates based on assuming Nash equilibrium behavior over the current Q-values. This learning protocol provably conv ..."
Abstract - Cited by 138 (0 self) - Add to MetaCart
We extend Q-learning to a noncooperative multiagent context, using the framework of generalsum stochastic games. A learning agent maintains Q-functions over joint actions, and performs updates based on assuming Nash equilibrium behavior over the current Q-values. This learning protocol provably

Friend or foe Q-Learning in general-sum games

by Michael L. Littman - In Proceedings of the 18th Int. Conf. on Machine Learning , 2001
"... This paper describes an approach to rein-forcement learning in multiagent general-sum games in which a learner is told to treat each other agent as either a \friend " or \foe". This Q-learning-style algorithm provides strong convergence guarantees compared to an ex-isting Nash-equilibrium- ..."
Abstract - Cited by 137 (6 self) - Add to MetaCart
This paper describes an approach to rein-forcement learning in multiagent general-sum games in which a learner is told to treat each other agent as either a \friend " or \foe". This Q-learning-style algorithm provides strong convergence guarantees compared to an ex-isting Nash

Multiagent Learning Using a Variable Learning Rate

by Michael Bowling, Manuela Veloso - Artificial Intelligence , 2002
"... Learning to act in a multiagent environment is a difficult problem since the normal definition of an optimal policy no longer applies. The optimal policy at any moment depends on the policies of the other agents and so creates a situation of learning a moving target. Previous learning algorithms hav ..."
Abstract - Cited by 225 (8 self) - Add to MetaCart
rate. We examine this technique theoretically, proving convergence in self-play on a restricted class of iterated matrix games. We also present empirical results on a variety of more general stochastic games, in situations of self-play and otherwise, demonstrating the wide applicability of this method.

Evolutionary games on graphs

by György Szabó , Gábor Fáth , 2007
"... Game theory is one of the key paradigms behind many scientific disciplines from biology to behavioral sciences to economics. In its evolutionary form and especially when the interacting agents are linked in a specific social network the underlying solution concepts and methods are very similar to ..."
Abstract - Cited by 152 (0 self) - Add to MetaCart
Game theory is one of the key paradigms behind many scientific disciplines from biology to behavioral sciences to economics. In its evolutionary form and especially when the interacting agents are linked in a specific social network the underlying solution concepts and methods are very similar
Next 10 →
Results 1 - 10 of 3,548
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2016 The Pennsylvania State University