Skip to main content
Have a personal or library account? Click to login
Learning and Choice in Mood Disorders: Searching for the Computational Parameters of Anhedonia Cover

Learning and Choice in Mood Disorders: Searching for the Computational Parameters of Anhedonia

Open Access
|Dec 2017

Figures & Tables

Figure 1. 

Schematic of the learning and choice models discussed in the article. a) Illustrative example of reinforcement learning models. b) Illustrative example of reaction time models (e.g., drift diffusion models). c) Illustrative example of models of economic choice under uncertainty.

Table 1. 

Reinforcement learning model parameters that could be altered in anhedonia

Construct Description Computational instantiation Evidence implicating Evidence exonerating Missing evidence
Value-guided behaviorCapacity of value representations to guide choiceValue (Equation 1) Most studies report broadly intact acquisition
Feedback insensitivity“Blunted” response to feedback, both positive and negativeReduced learning rate (Equation 2)Chase, Frank et al. (2010), Steele et al. (2007)Rothkirch, Tonn, Kohler, & Sterzer (2017)
Enhanced punishment sensitivityRelatively enhanced response to negative feedbackEnhanced learning rate if outcome is aversiveBeevers et al. (2013), Herzallah et al. (2013), Maddox et al. (2012), Murphy, Michael, Robbins, & Sahakian (2003), Taylor Tavares et al. (2008)Cavanagh, Bismark, Frank, & Allen (2011), Chase, Frank et al. (2010), Whitmer, Frank, & Gotlib (2012)
Reduced reward sensitivityRelatively reduced response to positive feedbackReduced learning rate if outcome is appetitiveBeevers et al. (2013), DelDonno et al. (2015), Herzallah et al. (2013), Kunisato et al. (2012), Maddox et al. (2012), O. J. Robinson et al. (2012), Treadway, Bossaller, Shelton, & Zald (2012)Cavanagh et al. (2011), Chase, Frank et al. (2010), Chase, Michael, Bullmore, Sahakian, & Robbins (2010), Whitmer et al. (2012)
Pavlovian biasInfluence of reward- or punishment-predictive stimuli on behaviorSee Equation 3 Bylsma, Morris, & Rottenberg (2008), Huys, Golzer et al. (2016), Radke, Guths, Andre, Muller, & de Bruijn (2014); see Mkrtchian, Aylward, Dayan, Roiser, & Robinson (2017) for anxiety
TemperatureStochastic choiceTemperature (Equation 4)Huys et al. (2012), Huys et al. (2013), Kunisato et al. (2012); for indirect evidence, see Blanco, Otto, Maddox, Beevers, & Love (2013), Clery-Melin et al. (2011); for trend level, see Chase et al. (2017)Chung et al. (2017), Rothkirch et al. (2017)
Reduced outcome magnitude sensitivityLinear or nonlinear scaling of utility across increasing expected value[Outcome*sensitivity] or [Outcome^sensitivity]Indirect evidence: Herzallah et al. (2013), Treadway et al. (2012)
Effort costsSuppression of responding by effort[Outcome value–effort cost]Hershenberg et al. (2016), Treadway et al. (2012), Yang et al. (2016), Yang et al. (2014)No simple increase in effort costs: Clery-Melin et al. (2011), Sherdell, Waugh, & Gotlib (2012)
Working memory/“model-based” learningRapid adaptation of behavior in response to feedbackVarious approaches, e.g., control choice in terms of previous outcome (Myers et al., 2016)N/AN/ALittle direct examination in MDD
Uncertainty-modulated learningIncreases or decreases in learning rate in response to uncertaintyModulation of learning rate (e.g., Equation 2) by stimulus/outcome uncertaintyN/AN/ALittle direct examination in MDD (but see Browning et al., 2015, on anxiety)

[i] Note. Here we define indirect evidence as suggestive that the construct might be significant, but this was not assessed directly via a modeling or other analytic strategy. To complete this table, combinations of the following terms were used in systematic searches: reward, model-based learning, Pavlovian, exploration, decision, choice, punishment learning, with anhedonia or major depression. The goal of the table is to provide an overview of salient exemplars of existing data from studies incorporating depressed, dysphoric, or euthymic individuals, which may be particularly relevant for the constructs listed.

Figure 2. 

Moderation of relationship between the risk aversion parameter (see Equation 5) and risk preference by temperature. High temperatures are red; low temperatures are blue. A low score on the risk aversion parameter amplifies the utility of small wins, leading to risk aversion, but this is only clearly manifest in behavior if the temperature is low. Likewise, a high score reduces the utility of small wins, leading to risk seeking, but again, only if the temperature is low.

Table 2. 

Exploring reward processing in the striatum

Study Groups Outcome magnitude Probability (%) Response contingent Task length Striatum differences Reported null findings
Hagele et al. (2015)AUD, SZ, MDD, BD (manic), ADHD, HC±€0.1, €0.6, €367Yes2 × 72 trialsRight VS: Increasing depression severity reduces reward anticipation vs. neutral
Stoy et al. (2012)MDD (before and during treatment), HC€0.1, €0.6, €367Yes2 × 72 trialsVS: HC > MDD, reward and loss anticipation vs. neutral—partially recovers after treatment
Knutson, Bhanji, Cooney, Atlas, & Gotlib (2008)Unmedicated MDD, HC± $0.1, $0.2, $1, $567Yes (individually calibrated RT threshold)2 × 90 trialsPutamen: HC > MDD, reward outcome vs. neutralVS: Reward anticipation
Admon et al. (2015)MDD, HCVariable: mean +$2.15, –$250No; instructed5 × 24 trialsCaudate: HC > MDD, reward and loss outcomes vs. neutral
Wacker, Dillon, & Pizzagalli (2009)Healthy individuals varying in anhedonic symptomsVariable: mean +$2.15, –$250No; instructed5 × 24 trialsVS: Increasing anhedonia reduces reward outcome vs. neutralVS: Reward anticipation
Pizzagalli et al. (2009)MDD, HCVariable: mean +$2.15, –$250No; instructed5 × 24 trialsPutamen: HC > MDD, reward anticipation vs. neutral; Caudate/VS: HC > MDD, reward outcome vs. neutral;VS: Reward anticipation
Smoski, Rittenberg, & Dichter (2011)MDD, HCMoney (+$1), IAPS pictures67Yes2 × 2 × 40 trialsPutamen: Anticipation Group × Reward Type interactionWidespread anticipation-related activation; little outcome-related activation
Arrondo et al. (2015)MDD, SZ, HCHigh (£1), low (£0.01)70 high win, 30 low winNo; instructed30 win, 30 neutral trialsVS: HC > MDD/SZ, reward anticipation; relationship of VS anticipation activation with anhedonia in SZ, not MDD
Dichter, Kozink, McClernon, & Smoski (2012)Remitted MDD, HC+$1 for wins67Yes20 potential win, 20 neutralCaudate: remitted MDD > HC, reward anticipation
Mori et al. (2016)Students with/without subthreshold depression±¥0, ¥20, ¥100, ¥500N.S.N.S.40 gain, 40 loss, 10 neutralDifferences not within striatumVS: Reward anticipation
Misaki, Suzuki, Savitz, Drevets, & Bodurka (2016)MDD, HC±$0.2, $166Yes (individually calibrated RT threshold)15 high win, 15 low win, 15 neutral, 15 high loss, 15 low lossLeft VS: HC > MDD during high win anticipationNo differences seen at low reward anticipation in left VS or low/high anticipation on right VS; no outcome- locked differences but overall activations not strong
Ubl et al. (2015)Remitted MDD, HCHigh (±€2), low (±€0.2) wins and losses50% (approx.)Yes (individually calibrated RT threshold)N.S.Differences not within striatum
Stringaris et al. (2015)Clinical, subthreshold depression, HC (adolescent)10, 2, 0 points66 (approx.)Yes (individually calibrated RT threshold)66 trialsVS: HC > clinical/ subthreshold depression, reward anticipation; reduced VS activation to reward anticipation also predicted transition to depression at 2-year follow-up and was related to symptoms of anhedonia. VS: Subthreshold depression > HC, positive outcomes Subthreshold depression and anhedonia > HC, negative outcomes

[i] Note. Table summarizing design and findings of studies of MDD or other depression-related cohorts that employed a reward-based version of the MID task. ADHD = attention-deficit hyperactivity disorder. AUD = alcohol use disorder. BD = bipolar disorder. N.S. = not stated. RT = reaction time. The contents of the table represent all the studies we were able to find using systematic searches for monetary incentive delay fMRI studies. A recent study of Admon et al. (2017) was not included, as it was focused on a dopaminergic drug manipulation, but it also found significant group (control > MDD) differences in the VS coupled to outcomes in the placebo condition.

Language: English
Submitted on: May 20, 2016
Accepted on: Jun 6, 2017
Published on: Dec 1, 2017
Published by: MIT Press
In partnership with: Paradigm Publishing Services
Publication frequency: 1 issue per year

© 2017 Oliver J. Robinson, Henry W. Chase, published by MIT Press
This work is licensed under the Creative Commons Attribution 4.0 License.