Anhedonic Traits Do Not Impair Performance in a 3-Arm Bandit Task

Arjun Ramaswamy; Yumeya Yamamori; Umesh Vivekananda; Vladimir Litvak; Jonathan P. Roiser

doi:10.5334/cpsy.135

Anhedonic Traits Do Not Impair Performance in a 3-Arm Bandit Task

Computational Psychiatry

Volume 10 (2026): Issue 1

By: Arjun Ramaswamy , Yumeya Yamamori , Umesh Vivekananda , Vladimir Litvak and Jonathan P. Roiser

Open Access

|Apr 2026

Abstract

Anhedonia, a transdiagnostic symptom marked by diminished reward sensitivity, is often linked to impairments in reinforcement learning (RL). Standard tasks (e.g., the 4-arm bandit) can place substantial demands on participants and may blur valuation with other processes. We therefore adapted a three-arm bandit (3AB) task from Seymour et al. (2012), incorporating design features intended to lessen task demands (fewer options; denser feedback) while enabling separate estimation of reward and punishment learning rates and sensitivities. In an online sample pre-screened for anhedonia (N = 206; 111 anhedonic, 95 non-anhedonic), hierarchical Bayesian modelling using a four-parameter specification showed no credible group differences in reward learning rate, punishment learning rate, reward sensitivity, or punishment sensitivity; Bayes factors favoured the null (BF₀₁ = 3.36–5.96). Model-agnostic win-stay/lose-shift strategies likewise showed no group differences (Welch’s tests, all p > .05). Posterior predictive checks indicated above-chance choice prediction: the model’s highest-probability action matched participants’ actual choices on 59.6% of trials (chance = 33%). Parameter recovery was excellent for valuation parameters (r = 0.96–0.97) and acceptable for learning rates (r = 0.67–0.85). Simulations generated from fitted parameters preserved individual-difference structure, with high correlations between observed and simulated win-stay (r = 0.89 anhedonic; 0.86 non-anhedonic) and moderate correlations for lose-shift (r = 0.62; 0.67), alongside small systematic mean-level biases (simulated win-stay lower by 3.5–4.9 percentage points; simulated lose-shift higher by 12.8–13.2 points). Model comparison showed that lapse-augmented variants achieved marginally better predictive fit, but group comparisons under both lapse models yielded overlapping posteriors with 95% HDIs including zero for all learning, sensitivity, and lapse parameters, indicating that the null findings were robust to inclusion of lapse terms. Non-anhedonic participants also responded more slowly on average than anhedonic participants, which we treat as exploratory. Together, these results suggest that in this 3AB task, anhedonia is not reliably associated with differences in core RL parameters or simple choice strategies, while providing a detailed characterisation of model performance and limitations in an online setting.

References

Ahn, W.-Y., Haines, N., & Zhang, L. (2017). Revealing neuro-computational mechanisms of reinforcement learning and decision-making with the hBayesDM package. Computational Psychiatry, 1, 24–57. 10.1162/CPSY_a_00002
Open DOI Search in Google Scholar Back to article
American Psychiatric Association. (2013). Diagnostic and Statistical Manual of Mental Disorders (5th ed.). American Psychiatric Publishing. 10.1176/appi.books.9780890425596
Open DOI Search in Google Scholar Back to article
Aylward, J., Valton, V., Ahn, W. Y., Bond, R. L., Dayan, P., Roiser, J. P., & Robinson, O. J. (2019). Altered learning under uncertainty in unmedicated mood and anxiety disorders. Nature human behaviour, 3(10), 1116–1123. 10.1038/s41562-019-0628-0
Open DOI Search in Google Scholar Back to article
Berridge, K. C., & Robinson, T. E. (2003). Parsing reward. Trends in Neurosciences, 26(9), 507–513. 10.1016/S0166-2236(03)00233-9
Open DOI Search in Google Scholar Back to article
Collins, A. G., & Frank, M. J. (2012). How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis. European Journal of Neuroscience, 35(7), 1024–1035. 10.1111/j.1460-9568.2011.07980.x
Open DOI Search in Google Scholar Back to article
Culbreth, A. J., Moran, E. K., & Barch, D. M. (2018). Effort-cost decision-making in psychosis and depression: Could a similar behavioral deficit arise from disparate psychological and neural mechanisms? Psychological Medicine, 48(6), 889–904. 10.1017/S0033291717002525
Open DOI Search in Google Scholar Back to article
Daw, N. D., Gershman, S. J., Seymour, B., Dayan, P., & Dolan, R. J. (2011). Model-based influences on humans’ choices and striatal prediction errors. Neuron, 69(6), 1204–1215. 10.1016/j.neuron.2011.02.027
Open DOI Search in Google Scholar Back to article
Daw, N. D., O’Doherty, J. P., Dayan, P., Seymour, B., & Dolan, R. J. (2006). Cortical substrates for exploratory decisions in humans. Nature, 441(7095), 876–879. 10.1038/nature04766
Open DOI Search in Google Scholar Back to article
Der-Avakian, A., & Markou, A. (2012). The neurobiology of anhedonia and other reward-related deficits. Trends in Neurosciences, 35(1), 68–77. 10.1016/j.tins.2011.11.005
Open DOI Search in Google Scholar Back to article
Eckstein, M. K., Wilbrecht, L., & Collins, A. G. (2021). What do reinforcement learning models measure? Interpreting model parameters in cognition and neuroscience. Current Opinion in Behavioral Sciences, 41, 128–137. 10.1016/j.cobeha.2021.06.004
Open DOI Search in Google Scholar Back to article
Eisenberg, I. W., Bissett, P. G., Enkavi, A. Z., Mazza, G. L., MacKinnon, D. P., Marsch, L. A., & Poldrack, R. A. (2019). Uncovering the structure of self-regulation through data-driven ontology discovery. Nature Communications, 10, 2319. 10.1038/s41467-019-10301-1
Open DOI Search in Google Scholar Back to article
Enkavi, A. Z., Eisenberg, I. W., Bissett, P. G., Mazza, G. L., MacKinnon, D. P., Marsch, L. A., & Poldrack, R. A. (2019). Large-scale analysis of test–retest reliabilities of self-regulation measures. Proceedings of the National Academy of Sciences of the United States of America, 116(12), 5472–5477. 10.1073/pnas.1818430116
Open DOI Search in Google Scholar Back to article
Garfield, J. B. B., Lubman, D. I., & Yücel, M. (2014). Anhedonia in substance use disorders: A systematic review of its nature, course and clinical correlates. Australian & New Zealand Journal of Psychiatry, 48(1), 36–51. 10.1177/0004867413508455
Open DOI Search in Google Scholar Back to article
Halahakoon, D. C., Kieslich, K., O’Driscoll, C., Nair, A., Lewis, G., & Roiser, J. P. (2020). Reward-processing behavior in depressed participants relative to healthy volunteers: A systematic review and meta-analysis. JAMA Psychiatry, 77(12), 1286–1295. 10.1001/jamapsychiatry.2020.2139
Open DOI Search in Google Scholar Back to article
Hall, A. F., Browning, M., & Huys, Q. J. M. (2024). The computational structure of consummatory anhedonia. Trends in Cognitive Sciences, 28(6), 541–553. 10.1016/j.tics.2024.01.006
Open DOI Search in Google Scholar Back to article
Harlé, K. M., Guo, D., Zhang, S., Paulus, M. P., & Yu, A. J. (2017). Anhedonia and anxiety underlying depressive symptomatology have distinct effects on reward-based decision-making. PLOS ONE, 12(10), e0186473. 10.1371/journal.pone.0186473
Open DOI Search in Google Scholar Back to article
Ho, Y.-C., Gau, S. S.-F., Wu, Y.-S., Chen, C.-H., Wang, J.-K., Lee, H.-C., … & Chang, H.-J. (2024). Determining cut-off values and predictors for the Snaith–Hamilton Pleasure Scale: Comparison between clinical and school settings. BJPsych Open, 10(3), e106. 10.1192/bjo.2024.35
Open DOI Search in Google Scholar Back to article
Husain, M., & Roiser, J. P. (2018). Neuroscience of apathy and anhedonia: A transdiagnostic approach. Nature Reviews Neuroscience, 19(8), 470–484. 10.1038/s41583-018-0029-9
Open DOI Search in Google Scholar Back to article
Huys, Q. J. M., Pizzagalli, D. A., Bogdan, R., & Dayan, P. (2013). Mapping anhedonia onto reinforcement learning: A behavioral meta-analysis. Biological Mood & Anxiety Disorders, 3, 12. 10.1186/2045-5380-3-12
Open DOI Search in Google Scholar Back to article
Kieslich, K., Valton, V., & Roiser, J. P. (2022). Pleasure, reward value, prediction error and anhedonia. In D. A. Pizzagalli (Ed.), Anhedonia: Preclinical, translational, and clinical integration (pp. 281–304). Current Topics in Behavioral Neurosciences (Vol. 58). Springer. 10.1007/7854_2021_295
Open DOI Search in Google Scholar Back to article
Kumar, P., Waiter, G., Ahearn, T., Milders, M., Reid, I., & Steele, J. D. (2008). Abnormal temporal-difference reward-learning signals in major depression. Brain, 131(8), 2084–2093. 10.1093/brain/awn136
Open DOI Search in Google Scholar Back to article
Liu, W. H., Wang, L. Z., Zhu, Y. H., Li, M. H., & Chan, R. C. K. (2012). Clinical utility of the Snaith-Hamilton-Pleasure scale in the Chinese settings. BMC Psychiatry, 12, 184. 10.1186/1471-244X-12-184
Open DOI Search in Google Scholar Back to article
Mittmann, G., Thomas, M. F., & Steiner-Hofbauer, V. (2025). The snaith-hamilton pleasure scale (SHAPS) and dimensional anhedonia rating scale (DARS) diverge in measuring anhedonia in a community sample of young adults. Psychiatry Research, 353, 116736. 10.1016/j.psychres.2025.116736
Open DOI Search in Google Scholar Back to article
Mkrtchian, A., Valton, V., & Roiser, J. P. (2023). Reliability of decision-making and reinforcement learning computational parameters. Computational Psychiatry, 7(1), 30–46. 10.5334/cpsy.86
Open DOI Search in Google Scholar Back to article
Nawijn, L., van Zuiden, M., Frijling, J. L., Koch, S. B. J., Veltman, D. J., & Olff, M. (2015). Reward functioning in PTSD: A systematic review exploring the mechanisms underlying anhedonia. Neuroscience & Biobehavioral Reviews, 51, 189–204. 10.1016/j.neubiorev.2015.01.019
Open DOI Search in Google Scholar Back to article
Niu, S., Yin, X., Pan, B., Chen, H., Dai, C., Tong, C., Chen, F., & Feng, X. (2024). Understanding comorbidity between non-suicidal self-injury and depressive symptoms in a clinical sample of adolescents: A network analysis. Neuropsychiatric Disease and Treatment, 20, 1–17. 10.2147/NDT.S443454
Open DOI Search in Google Scholar Back to article
Pike, A. C., & Robinson, O. J. (2022). Reinforcement learning in patients with mood and anxiety disorders vs control individuals: A systematic review and meta-analysis. JAMA Psychiatry, 79(4), 313–322. 10.1001/jamapsychiatry.2022.0051
Open DOI Search in Google Scholar Back to article
Rizvi, S. J., Pizzagalli, D. A., Sproule, B. A., & Kennedy, S. H. (2016). Assessing anhedonia in depression: Potentials and pitfalls. Neuroscience & Biobehavioral Reviews, 65, 21–35. 10.1016/j.neubiorev.2016.03.004
Open DOI Search in Google Scholar Back to article
Rizvi, S. J., Quilty, L. C., Sproule, B. A., Cyriac, A., Bagby, R. M., & Kennedy, S. H. (2015). Development and validation of the Dimensional Anhedonia Rating Scale (DARS) in a community sample and individuals with major depression. Psychiatry Research, 229(1–2), 109–119. 10.1016/j.psychres.2015.07.062
Open DOI Search in Google Scholar Back to article
Romera, I., Delgado-Cohen, H., Perez, T., Caballero, L., & Gilaberte, I. (2008). Factor analysis of the Zung self-rating depression scale in a large sample of patients with major depressive disorder in primary care. BMC Psychiatry, 8(1), 4. 10.1186/1471-244X-8-4
Open DOI Search in Google Scholar Back to article
Seymour, B., Daw, N. D., Roiser, J. P., Dayan, P., & Dolan, R. J. (2012). Serotonin selectively modulates reward value in human decision-making. Journal of Neuroscience, 32(17), 5833–5842. 10.1523/JNEUROSCI.0053-12.2012
Open DOI Search in Google Scholar Back to article
Snaith, R. P., Hamilton, M., Morley, S., Humayan, A., Hargreaves, D., & Trigwell, P. (1995). A scale for the assessment of hedonic tone: The Snaith–Hamilton Pleasure Scale. British Journal of Psychiatry, 167(1), 99–103. 10.1192/bjp.167.1.99
Open DOI Search in Google Scholar Back to article
Spitzer, R. L., Kroenke, K., Williams, J. B., & Löwe, B. (2006). A brief measure for assessing generalized anxiety disorder: The GAD-7. Archives of Internal Medicine, 166(10), 1092–1097. 10.1001/archinte.166.10.1092
Open DOI Search in Google Scholar Back to article
Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction (2nd ed.). MIT Press.
Search in Google Scholar Back to article
Treadway, M. T., Buckholtz, J. W., Schwartzman, A. N., Lambert, W. E., & Zald, D. H. (2009). Worth the ‘EEfRT’? The effort expenditure for rewards task as an objective measure of motivation and anhedonia. PLOS ONE, 4(8), e6598. 10.1371/journal.pone.0006598
Open DOI Search in Google Scholar Back to article
Treadway, M. T., & Zald, D. H. (2011). Reconsidering anhedonia in depression: Lessons from translational neuroscience. Neuroscience & Biobehavioral Reviews, 35(3), 537–555. 10.1016/j.neubiorev.2010.06.006
Open DOI Search in Google Scholar Back to article
Vehtari, A., Gelman, A., & Gabry, J. (2017). Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC. Statistics and Computing, 27, 1413–1432. 10.1007/s11222-016-9696-4
Open DOI Search in Google Scholar Back to article
Wellan, S. A., Daniels, A., & Walter, H. (2021). State Anhedonia in Young Healthy Adults: Psychometric Properties of the German Dimensional Anhedonia Rating Scale (DARS) and Effects of the COVID-19 Pandemic. Frontiers in Psychology, 12. 10.3389/fpsyg.2021.682824
Open DOI Search in Google Scholar Back to article
Whitton, A. E., Treadway, M. T., & Pizzagalli, D. A. (2015). Reward processing dysfunction in major depression, bipolar disorder and schizophrenia. Current Opinion in Psychiatry, 28(1), 7–12. 10.1097/YCO.0000000000000122
Open DOI Search in Google Scholar Back to article
Wiecki, T. V., Sofer, I., & Frank, M. J. (2013). HDDM: Hierarchical Bayesian estimation of the Drift-Diffusion Model in Python. Frontiers in neuroinformatics, 7, 14. 10.3389/fninf.2013.00014
Open DOI Search in Google Scholar Back to article
Wilson, R. C., & Collins, A. G. (2019). Ten simple rules for the computational modeling of behavioral data. eLife, 8. 10.7554/eLife.49547
Open DOI Search in Google Scholar Back to article
Winer, E. S., Jordan, D. G., & Collins, A. C. (2019). Conceptualizing anhedonias and implications for depression treatments. Psychology Research and Behavior Management, 12, 325–335. 10.2147/PRBM.S159260
Open DOI Search in Google Scholar Back to article
Yan, X., Ebitz, R. B., Grissom, N., Darrow, D. P., & Herman, A. B. (2025). Distinct computational mechanisms of uncertainty processing explain opposing exploratory behaviors in anxiety and apathy. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging. Advance online publication. 10.1016/j.bpsc.2025.01.005
Open DOI Search in Google Scholar Back to article
Zorowitz, S., Solis, J., Niv, Y., & Bennett, D. (2023). Inattentive responding can induce spurious associations between task behaviour and symptom measures. Nature human behaviour, 7(10), 1667–1681. 10.1038/s41562-023-01640-7
Open DOI Search in Google Scholar Back to article
Zung, W. W. (1965). A Self-Rating Depression Scale. Archives of general psychiatry, 12, 63–70. 10.1001/archpsyc.1965.01720310065008
Open DOI Search in Google Scholar Back to article

Articles in this issue

DOI: https://doi.org/10.5334/cpsy.135 | Journal eISSN: 2379-6227

Journal RSS Feed

Language: English

Submitted on: Feb 4, 2025

Accepted on: Mar 10, 2026

Published on: Apr 13, 2026

Published by: Ubiquity Press

In partnership with: Paradigm Publishing Services

Publication frequency: 1 issue per year

Keywords:

Anhedonia,

reinforcement learning,

3-arm bandit,

Hierarchical Bayesian modelling,

reward and punishment sensitivity,

Online behavioural study

© 2026 Arjun Ramaswamy, Yumeya Yamamori, Umesh Vivekananda, Vladimir Litvak, Jonathan P. Roiser, published by Ubiquity Press
This work is licensed under the Creative Commons Attribution 4.0 License.

Volume 10 (2026): Issue 1