Timezone: »
How humans achieve long-term goals in an uncertain environment, via repeated trials and noisy observations, is an important problem in cognitive science. We investigate this behavior in the context of a multi-armed bandit task. We compare human behavior to a variety of models that vary in their representational and computational complexity. Our result shows that subjects' choices, on a trial-to-trial basis, are best captured by a "forgetful" Bayesian iterative learning model in combination with a partially myopic decision policy known as Knowledge Gradient. This model accounts for subjects' trial-by-trial choice better than a number of other previously proposed models, including optimal Bayesian learning and risk minimization, epsilon-greedy and win-stay-lose-shift. It has the added benefit of being closest in performance to the optimal Bayesian model than all the other heuristic models that have the same computational complexity (all are significantly less complex than the optimal model). These results constitute an advancement in the theoretical understanding of how humans negotiate the tension between exploration and exploitation in a noisy, imperfectly known environment.
Author Information
Shunan Zhang (UC San Diego)
Angela Yu (UC San Diego)
More from the Same Authors
-
2021 : Panel I: Human decisions »
Jennifer Trueblood · Alex Peysakhovich · Angela Yu · Ori Plonsky · Tal Yarkoni · Daniel Bjorkegren -
2019 : Panel Discussion led by Grace Lindsay »
Grace Lindsay · Blake Richards · Doina Precup · Jacqueline Gottlieb · Jeff Clune · Jane Wang · Richard Sutton · Angela Yu · Ida Momennejad -
2019 : Invited Talk #6: Features or Bugs: Synergistic Idiosyncrasies in Human Learning and Decision-Making »
Angela Yu -
2018 Poster: Why so gloomy? A Bayesian explanation of human pessimism bias in the multi-armed bandit task »
Dalin Guo · Angela Yu -
2018 Poster: Demystifying excessively volatile human learning: A Bayesian persistent prior and a neural approximation »
Chaitanya Ryali · Gautam Reddy · Angela Yu -
2018 Poster: Beauty-in-averageness and its contextual modulations: A Bayesian statistical account »
Chaitanya Ryali · Angela Yu -
2017 : Computational modeling of human face processing »
Angela Yu -
2017 : Workshop overview »
Michael Mozer · Angela Yu · Brenden Lake -
2017 Workshop: Cognitively Informed Artificial Intelligence: Insights From Natural Intelligence »
Michael Mozer · Brenden Lake · Angela Yu -
2013 Poster: Context-sensitive active sensing in humans »
Sheeraz Ahmad · He Huang · Angela Yu -
2012 Poster: Strategic Impatience in Go/NoGo versus Forced-Choice Decision-Making »
Pradeep Shenoy · Angela Yu -
2012 Oral: Strategic Impatience in Go/NoGo versus Forced-Choice Decision-Making »
Pradeep Shenoy · Angela Yu -
2010 Oral: A rational decision making framework for inhibitory control »
Pradeep Shenoy · Rajesh PN Rao · Angela Yu -
2010 Poster: A rational decision making framework for inhibitory control »
Pradeep Shenoy · Rajesh PN Rao · Angela Yu -
2008 Poster: Sequential effects: Superstition or rational behavior? »
Angela Yu · Jonathan D Cohen -
2008 Spotlight: Sequential effects: Superstition or rational behavior? »
Angela Yu · Jonathan D Cohen -
2007 Spotlight: Sequential Hypothesis Testing under Stochastic Deadlines »
Peter Frazier · Angela Yu -
2007 Poster: Sequential Hypothesis Testing under Stochastic Deadlines »
Peter Frazier · Angela Yu -
2006 Poster: Optimal Change-Detection and Spiking Neurons »
Angela Yu