Results 1 - 10
of
259
Heuristic search for identical payoff Bayesian games
- AAMAS
, 2010
"... Bayesian games can be used to model single-shot decision problems in which agents only possess incomplete information about other agents, and hence are important for multiagent coordination under uncertainty. Moreover they can be used to represent different stages of sequential multiagent decision p ..."
Abstract
-
Cited by 13 (9 self)
- Add to MetaCart
problems, such as POSGs and DEC-POMDPs, and appear as an operation in many methods for multiagent planning under uncertainty. In this paper we are interested in coordinating teams of cooperative agents. While many such problems can be formulated as Bayesian games with identical payoffs, little work has
The Nonstochastic Multiarmed Bandit Problem
- SIAM JOURNAL OF COMPUTING
, 2002
"... In the multiarmed bandit problem, a gambler must decide which arm of K non-identical slot machines to play in a sequence of trials so as to maximize his reward. This classical problem has received much attention because of the simple model it provides of the trade-off between exploration (trying out ..."
Abstract
-
Cited by 491 (34 self)
- Add to MetaCart
In the multiarmed bandit problem, a gambler must decide which arm of K non-identical slot machines to play in a sequence of trials so as to maximize his reward. This classical problem has received much attention because of the simple model it provides of the trade-off between exploration (trying
Gambling in a rigged casino: The adversarial multi-armed bandit problem
, 1995
"... In the multi-armed bandit problem, a gambler must decide which arm of K non-identical slot machines to play in a sequence of trials so as to maximize his reward. This classical problem has received much attention because of the simple model it provides of the trade-off between exploration (trying ou ..."
Abstract
-
Cited by 242 (8 self)
- Add to MetaCart
In the multi-armed bandit problem, a gambler must decide which arm of K non-identical slot machines to play in a sequence of trials so as to maximize his reward. This classical problem has received much attention because of the simple model it provides of the trade-off between exploration (trying
of LaborWork and Money: Payoffs by Ethnic Identity and Gender
, 2009
"... Any opinions expressed here are those of the author(s) and not those of IZA. Research published in this series may include views on policy, but the institute itself takes no institutional policy positions. The Institute for the Study of Labor (IZA) in Bonn is a local and virtual international resear ..."
Abstract
- Add to MetaCart
Any opinions expressed here are those of the author(s) and not those of IZA. Research published in this series may include views on policy, but the institute itself takes no institutional policy positions. The Institute for the Study of Labor (IZA) in Bonn is a local and virtual international research center and a place of communication between science, politics and business. IZA is an independent nonprofit organization supported by Deutsche Post Foundation. The center is associated with the University of Bonn and offers a stimulating research environment through its international network, workshops and conferences, data service, project support, research visits and doctoral program. IZA engages in (i) original and internationally competitive research in all fields of labor economics, (ii) development of policy concepts, and (iii) dissemination of research results and concepts to the interested public. IZA Discussion Papers often represent preliminary work and are circulated to encourage discussion. Citation of such a paper should account for its provisional character. A revised version may be
Strategic Experimentation with Private Payoffs∗
, 2012
"... We consider two players facing identical discrete-time bandit problems with a safe and a risky arm. In any period, the risky arm yields either a success or a failure, and the first success reveals the risky arm to dominate the safe one. When payoffs are public information, the ensuing free-rider pro ..."
Abstract
- Add to MetaCart
We consider two players facing identical discrete-time bandit problems with a safe and a risky arm. In any period, the risky arm yields either a success or a failure, and the first success reveals the risky arm to dominate the safe one. When payoffs are public information, the ensuing free
The Effects of Payoff Preferences on Agent Tolerance
"... An objective of multi-agent systems is to build robust intelli-gent systems capable of existing in complex environments. These environments are often open, noisy and subject to rapid, unpredictable changes. This paper will explore how agents can bias their interactions and choices in these com-plex ..."
Abstract
- Add to MetaCart
-retic simulations examining the effects of agent payoff prefer-ences within an evolutionary setting. Our experiments show that these factors promote tolerance throughout the popula-tion. We provide an experimental benchmark using an almost identical game environment where payoffs are not considered by agents
Group Identity and Social Preferences
, 2007
"... Identity is a central concept in the social sciences. In this study, we present a laboratory experiment that measures the effects of induced group identity on participant social preferences. We find that when participants are matched with an ingroup member (as opposed to an outgroup member) they sho ..."
Abstract
-
Cited by 88 (4 self)
- Add to MetaCart
Identity is a central concept in the social sciences. In this study, we present a laboratory experiment that measures the effects of induced group identity on participant social preferences. We find that when participants are matched with an ingroup member (as opposed to an outgroup member
Competitive Centipede Games: Zero-End Payoffs and Payoff Inequality Deter Reciprocal Cooperation
, 2015
"... Abstract: Reciprocal cooperation can be studied in the Centipede game, in which two players alternate in choosing between a cooperative GO move and a non-cooperative STOP move. GO sustains the interaction and increases the player pair's total payoff while incurring a small personal cost; STOP ..."
Abstract
- Add to MetaCart
-theoretic solution, therefore they should have elicited identical decision patterns, according to orthodox game theory. Nevertheless, both zero-end payoffs and high payoff inequality were found to reduce cooperation significantly. Contrary to previous predictions, combining these two factors in one game resulted
Immediately Reactive Equilibria in Infinitely Repeated Games with Additively Separable Continuous Payoffs
, 2006
"... This paper studies a class of infinitely repeated games with two players in which the action space of each player is an interval, and the one-shot payoff of each player is additively separable in their actions. We define an immediately reactive equilibrium (IRE) as a pure-strategy subgame perfect eq ..."
Abstract
- Add to MetaCart
This paper studies a class of infinitely repeated games with two players in which the action space of each player is an interval, and the one-shot payoff of each player is additively separable in their actions. We define an immediately reactive equilibrium (IRE) as a pure-strategy subgame perfect
Learning to Coordinate Efficiently: A Model-based Approach
"... In common-interest stochastic games all players receive an identical payoff. Players participating in such games must learn to coordinate with each other in order to receive the highest-possible value. A number of reinforcement learning algorithms have been proposed for this problem, and some have b ..."
Abstract
-
Cited by 14 (1 self)
- Add to MetaCart
In common-interest stochastic games all players receive an identical payoff. Players participating in such games must learn to coordinate with each other in order to receive the highest-possible value. A number of reinforcement learning algorithms have been proposed for this problem, and some have
Results 1 - 10
of
259