0
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Neural and computational underpinnings of biased confidence in human reinforcement learning

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          While navigating a fundamentally uncertain world, humans and animals constantly evaluate the probability of their decisions, actions or statements being correct. When explicitly elicited, these confidence estimates typically correlates positively with neural activity in a ventromedial-prefrontal (VMPFC) network and negatively in a dorsolateral and dorsomedial prefrontal network. Here, combining fMRI with a reinforcement-learning paradigm, we leverage the fact that humans are more confident in their choices when seeking gains than avoiding losses to reveal a functional dissociation: whereas the dorsal prefrontal network correlates negatively with a condition-specific confidence signal, the VMPFC network positively encodes task-wide confidence signal incorporating the valence-induced bias. Challenging dominant neuro-computational models, we found that decision-related VMPFC activity better correlates with confidence than with option-values inferred from reinforcement-learning models. Altogether, these results identify the VMPFC as a key node in the neuro-computational architecture that builds global feeling-of-confidence signals from latent decision variables and contextual biases during reinforcement-learning.

          Abstract

          The mechanism of confidence formation in learning remains poorly understood. Here, the authors show that both dorsal and ventral prefrontal networks encode confidence, but only the ventral network incorporates the valence-induced bias.

          Related collections

          Most cited references88

          • Record: found
          • Abstract: found
          • Article: not found

          The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity.

          The authors present a unified account of 2 neural systems concerned with the development and expression of adaptive behaviors: a mesencephalic dopamine system for reinforcement learning and a "generic" error-processing system associated with the anterior cingulate cortex. The existence of the error-processing system has been inferred from the error-related negativity (ERN), a component of the event-related brain potential elicited when human participants commit errors in reaction-time tasks. The authors propose that the ERN is generated when a negative reinforcement learning signal is conveyed to the anterior cingulate cortex via the mesencephalic dopamine system and that this signal is used by the anterior cingulate cortex to modify performance on the task at hand. They provide support for this proposal using both computational modeling and psychophysiological experimentation.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Normalization as a canonical neural computation.

            There is increasing evidence that the brain relies on a set of canonical neural computations, repeating them across brain regions and modalities to apply similar operations to different problems. A promising candidate for such a computation is normalization, in which the responses of neurons are divided by a common factor that typically includes the summed activity of a pool of neurons. Normalization was developed to explain responses in the primary visual cortex and is now thought to operate throughout the visual system, and in many other sensory modalities and brain regions. Normalization may underlie operations such as the representation of odours, the modulatory effects of visual attention, the encoding of value and the integration of multisensory information. Its presence in such a diversity of neural systems in multiple species, from invertebrates to mammals, suggests that it serves as a canonical neural computation.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Measuring utility by a single-response sequential method.

                Bookmark

                Author and article information

                Contributors
                chihchung.ting@uni-hamburg.de
                j.b.engelmann@uva.nl
                mael.lebreton@psemail.eu
                Journal
                Nat Commun
                Nat Commun
                Nature Communications
                Nature Publishing Group UK (London )
                2041-1723
                28 October 2023
                28 October 2023
                2023
                : 14
                : 6896
                Affiliations
                [1 ]General Psychology, Universität Hamburg, ( https://ror.org/00g30e956) Von-Melle-Park 11, 20146 Hamburg, Germany
                [2 ]CREED, Amsterdam School of Economics (ASE), Universiteit van Amsterdam, ( https://ror.org/04dkp9463) Roetersstraat 11, 1018 WB Amsterdam, the Netherlands
                [3 ]Swiss Center for Affective Science, Faculty of Psychology and Educational Sciences, University of Geneva, ( https://ror.org/01swzsf04) Chem. des Mines 9, 1202 Genève, Switzerland
                [4 ]GRID grid.440907.e, ISNI 0000 0004 1784 3645, Département d’Études Cognitives, École Normale Supérieure, , PSL Research University, ; 29 rue d’Ulm, 75230, Paris cedex 05, France
                [5 ]GRID grid.462870.f, ISNI 0000 0004 1808 0475, Laboratoire de Neurosciences Cognitives et Computationnelles, Institut National de la Santé et de la Recherche Médicale, ; 29 rue d’Ulm 75230, Paris cedex 05, France
                [6 ]The Tinbergen Institute, ( https://ror.org/054xxtt73) Gustav Mahlerplein 117, 1082 MS Amsterdam, the Netherlands
                [7 ]Economics of Human Behavior group, Paris-Jourdan Sciences Économiques UMR8545, Paris School of Economics, ( https://ror.org/01qtp1053) 48 Boulevard Jourdan, 75014 Paris, France
                Author information
                http://orcid.org/0000-0002-0620-128X
                http://orcid.org/0000-0003-2332-3104
                http://orcid.org/0000-0001-5768-6646
                http://orcid.org/0000-0001-6493-8792
                http://orcid.org/0000-0002-2071-4890
                Article
                42589
                10.1038/s41467-023-42589-5
                10613217
                37898640
                255896ab-7322-41de-b72c-6c6d3ea94b5d
                © The Author(s) 2023

                Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

                History
                : 8 March 2023
                : 16 October 2023
                Funding
                Funded by: GSSA, MOE Taiwan Scholarship (1081007012)
                Funded by: FundRef https://doi.org/10.13039/501100001711, Schweizerischer Nationalfonds zur Förderung der Wissenschaftlichen Forschung (Swiss National Science Foundation);
                Award ID: Ambizione PZ00P3 174127
                Award Recipient :
                Funded by: FundRef https://doi.org/10.13039/100010663, EC | EU Framework Programme for Research and Innovation H2020 | H2020 Priority Excellent Science | H2020 European Research Council (H2020 Excellent Science - European Research Council);
                Award ID: RaReMem-101043804
                Award Recipient :
                Funded by: FundRef https://doi.org/10.13039/501100001665, Agence Nationale de la Recherche (French National Research Agency);
                Award ID: ANR-21-CE23-0002-02
                Award ID: ANR-21-CE37-0008-01
                Award ID: ANR-21-CE28-0024-01
                Award Recipient :
                Categories
                Article
                Custom metadata
                © Springer Nature Limited 2023

                Uncategorized
                human behaviour,decision making,decision,learning algorithms
                Uncategorized
                human behaviour, decision making, decision, learning algorithms

                Comments

                Comment on this article