Results 1 - 10
of
31
Regulatory Fit and Systematic Exploration in a Dynamic Decision-Making Environment
"... This work explores the influence of motivation on choice behavior in a dynamic decision-making environment, where the payoffs from each choice depend on one’s recent choice history. Previous research reveals that participants in a regulatory fit exhibit increased levels of exploratory choice and fle ..."
Abstract
-
Cited by 17 (6 self)
- Add to MetaCart
(Show Context)
This work explores the influence of motivation on choice behavior in a dynamic decision-making environment, where the payoffs from each choice depend on one’s recent choice history. Previous research reveals that participants in a regulatory fit exhibit increased levels of exploratory choice and flexible use of multiple strategies over the course of an experiment. The present study placed promotion and prevention-focused participants in a dynamic environment for which optimal performance is facilitated by systematic exploration of the decision space. These participants either gained or lost points with each choice. Our experiment revealed that participants in a regulatory fit were more likely to engage in systematic exploration of the task environment than were participants in a regulatory mismatch and performed more optimally as a result. Implications for contemporary models of human reinforcement learning are discussed.
You don’t want to know what you’re missing: When information about forgone rewards impedes dynamic decision making
"... When people learn to make decisions from experience, a reasonable intuition is that additional relevant information should improve their performance. In contrast, we find that additional information about foregone rewards (i.e., what could have gained at each point by making a different choice) seve ..."
Abstract
-
Cited by 8 (2 self)
- Add to MetaCart
(Show Context)
When people learn to make decisions from experience, a reasonable intuition is that additional relevant information should improve their performance. In contrast, we find that additional information about foregone rewards (i.e., what could have gained at each point by making a different choice) severely hinders participants ’ ability to repeatedly make choices that maximize long-term gains. We conclude that foregone reward information accentuates the local superiority of short-term options (e.g., consumption) and consequently biases choice away from productive long-term options (e.g., exercise). These conclusions are consistent with a standard reinforcement-learning mechanism that processes information about experienced and forgone rewards. In contrast to related contributions using delay-of-gratification paradigms, we do not posit separate top-down and emotion-driven systems to explain performance. We find that individual and group data are well characterized by a single reinforcement-learning mechanism that combines information about experienced and foregone rewards.
Taking more, now: The optimality of impulsive choice hinges on environment structure
- Social Psychological and Personality Science
, 2012
"... Abstract Impulsivity is a stable personality trait associated with myopic choice behavior that favors immediate rewards over larger, delayed rewards and is often characterized as maladaptive inside and outside of the laboratory. An alternative view suggests that the consequences of trait impulsivit ..."
Abstract
-
Cited by 4 (4 self)
- Add to MetaCart
(Show Context)
Abstract Impulsivity is a stable personality trait associated with myopic choice behavior that favors immediate rewards over larger, delayed rewards and is often characterized as maladaptive inside and outside of the laboratory. An alternative view suggests that the consequences of trait impulsivity depend on the nature of the task environment. On this view, the optimal level of impulsivity varies across task payoff structures. This hypothesis is tested in two dynamic decision-making tasks that differ in the relative payoffs of delayed and immediate rewards. In a task that favors delayed rewards to immediate rewards, high-impulsive participants perform worse than low-impulsive participants. In contrast, in a task that favors immediate rewards over delayed rewards, high-impulsive participants outperform low-impulsive participants. These results suggest a more nuanced conceptualization of trait impulsivity as it applies to rewards-related decision making that may help explain the variability observed in this trait across individuals.
When, What, and How Much to Reward in Reinforcement Learning‐Based Models of Cognition
- Cognitive science
, 2012
"... Reinforcement learning approaches to cognitive modeling represent task acquisition as learning to choose the sequence of steps that accomplishes the task while maximizing a reward. However, an apparently unrecognized problem for modelers is choosing when, what, and how much to reward; that is, when ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
(Show Context)
Reinforcement learning approaches to cognitive modeling represent task acquisition as learning to choose the sequence of steps that accomplishes the task while maximizing a reward. However, an apparently unrecognized problem for modelers is choosing when, what, and how much to reward; that is, when (the moment: end of trial, subtask, or some other interval of task performance), what (the objective function: e.g., performance time or performance accuracy), and how much (the magni-tude: with binary, categorical, or continuous values). In this article, we explore the problem space of these three parameters in the context of a task whose completion entails some combination of 36 state–action pairs, where all intermediate states (i.e., after the initial state and prior to the end state) represent progressive but partial completion of the task. Different choices produce profoundly different learning paths and outcomes, with the strongest effect for moment. Unfortunately, there is little discussion in the literature of the effect of such choices. This absence is disappointing, as the choice of when, what, and how much needs to be made by a modeler for every learning model.
How Long Have I Got? Making Optimal Visit Durations in a Dual-Task Setting
"... Can people multitask optimally? We use a dual-task paradigm in which participants had to enter digits while monitoring a randomly moving cursor. Participants earned points for entering digits correctly and were docked points if they let the cursor drift outside of a target area. The severity of the ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
(Show Context)
Can people multitask optimally? We use a dual-task paradigm in which participants had to enter digits while monitoring a randomly moving cursor. Participants earned points for entering digits correctly and were docked points if they let the cursor drift outside of a target area. The severity of the tracking penalty was varied between conditions. Participants therefore had to decide how long to leave the tracking task unattended. As expected, participants left the tracking task for longer when the penalty was less severe and also when the cursor moved less erratically. To test whether participants were adjusting their behavior in an optimal manner, observed behavior was compared to a prediction of the optimal visit duration for each condition. Overall, the degree of correspondence between the observed behavior and the predicted optimum was very good, suggesting that people can multitask in a near optimal fashion given explicit feedback on their performance.
Reviewed by:
, 2012
"... doi: 10.3389/fpsyg.2011.00398 The nature of belief-directed exploratory choice in human decision-making ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
doi: 10.3389/fpsyg.2011.00398 The nature of belief-directed exploratory choice in human decision-making
Reprints and permission: sagepub.com/journalsPermissions.nav
"... Decisions are a pervasive part of people’s lives. The impor-tance and impact of these decisions may increase with an indi-vidual’s age. Older adults often work in prominent positions and face numerous important personal decisions, such as which retirement options to select, how to spend their life s ..."
Abstract
- Add to MetaCart
(Show Context)
Decisions are a pervasive part of people’s lives. The impor-tance and impact of these decisions may increase with an indi-vidual’s age. Older adults often work in prominent positions and face numerous important personal decisions, such as which retirement options to select, how to spend their life sav-ings, and how to best live out the remaining years of their lives. Likewise, younger adults must choose which career path to take, which college to attend, and when to buy a house. The importance of decision making throughout life makes it criti-cal to understand how age affects decision-making strategies. Decisions rarely occur without context. Often, the rewards available from each option depend on previous choices. One’s immediate job prospects or retirement investment options are dependent on the state that one has reached. Generally, one cannot apply for teaching jobs without first deciding to attend
Reprints and permission:
"... Decisions are a pervasive part of people’s lives. The impor-tance and impact of these decisions may increase with an indi-vidual’s age. Older adults often work in prominent positions and face numerous important personal decisions, such as which retirement options to select, how to spend their life s ..."
Abstract
- Add to MetaCart
(Show Context)
Decisions are a pervasive part of people’s lives. The impor-tance and impact of these decisions may increase with an indi-vidual’s age. Older adults often work in prominent positions and face numerous important personal decisions, such as which retirement options to select, how to spend their life sav-ings, and how to best live out the remaining years of their lives. Likewise, younger adults must choose which career path to take, which college to attend, and when to buy a house. The importance of decision making throughout life makes it criti-cal to understand how age affects decision-making strategies. Decisions rarely occur without context. Often, the rewards available from each option depend on previous choices. One’s immediate job prospects or retirement investment options are dependent on the state that one has reached. Generally, one cannot apply for teaching jobs without first deciding to attend
Of matchers and maximizers: How competition shapes choice under risk and uncertainty
"... a b s t r a c t In a world of limited resources, scarcity and rivalry are central challenges for decision makers-animals foraging for food, corporations seeking maximal profits, and athletes training to win, all strive against others competing for the same goals. In this article, we establish the r ..."
Abstract
- Add to MetaCart
(Show Context)
a b s t r a c t In a world of limited resources, scarcity and rivalry are central challenges for decision makers-animals foraging for food, corporations seeking maximal profits, and athletes training to win, all strive against others competing for the same goals. In this article, we establish the role of competitive pressures for the facilitation of optimal decision making in simple sequential binary choice tasks. In two experiments, competition was introduced with a computerized opponent whose choice behavior reinforced one of two strategies: If the opponent probabilistically imitated participant choices, probability matching was optimal; if the opponent was indifferent, probability maximizing was optimal. We observed accurate asymptotic strategy use in both conditions irrespective of the provision of outcome probabilities, suggesting that participants were sensitive to the differences in opponent behavior. An analysis of reinforcement learning models established that computational conceptualizations of opponent behavior are critical to account for the observed divergence in strategy adoption. Our results provide a novel appraisal of probability matching and show how this individually 'irrational' choice phenomenon can be socially adaptive under competition.
DOI 10.3758/s13423-012-0324-9
"... Heterogeneity of strategy use in the Iowa gambling task: A comparison of win-stay/lose-shift and reinforcement learning models ..."
Abstract
- Add to MetaCart
(Show Context)
Heterogeneity of strategy use in the Iowa gambling task: A comparison of win-stay/lose-shift and reinforcement learning models