6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      The credit assignment problem in cortico‐basal ganglia‐thalamic networks: A review, a problem and a possible solution

      Read this article at

      ScienceOpenPublisherPubMed
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          <p class="first" id="d1646855e117">The question of how cortico-basal ganglia-thalamic (CBGT) pathways use dopaminergic feedback signals to modify future decisions has challenged computational neuroscientists for decades. Reviewing the literature on computational representations of dopaminergic corticostriatal plasticity, we show how the field is converging on a normative, synaptic-level learning algorithm that elegantly captures both neurophysiological properties of CBGT circuits and behavioral dynamics during reinforcement learning. Unfortunately, the computational studies that have led to this normative algorithmic model have all relied on simplified circuits that use abstracted action-selection rules. As a result, the application of this corticostriatal plasticity algorithm to a full model of the CBGT pathways immediately fails because the spatiotemporal distance between integration (corticostriatal circuits), action selection (thalamocortical loops) and learning (nigrostriatal circuits) means that the network does not know which synapses should be reinforced to favor previously rewarding actions. We show how observations from neurophysiology, in particular the sustained activation of selected action representations, can provide a simple means of resolving this credit assignment problem in models of CBGT learning. Using a biologically realistic spiking model of the full CBGT circuit, we demonstrate how this solution can allow a network to learn to select optimal targets and to relearn action-outcome contingencies when the environment changes. This simple illustration highlights how the normative framework for corticostriatal plasticity can be expanded to capture macroscopic network dynamics during learning and decision-making. </p>

          Related collections

          Author and article information

          Contributors
          (View ORCID Profile)
          (View ORCID Profile)
          Journal
          European Journal of Neuroscience
          Eur J Neurosci
          Wiley
          0953-816X
          1460-9568
          May 04 2020
          Affiliations
          [1 ]Department of Mathematics Center for the Neural Basis of Cognition University of Pittsburgh Pittsburgh PA USA
          [2 ]Department de Matemàtiques i Informàtica Institute of Applied Computing and Community Code Universitat de les Illes Balears Palma Spain
          [3 ]Carnegie Mellon Neuroscience Institute Carnegie Mellon University Pittsburgh PA USA
          [4 ]Micron School of Materials Science and Engineering Boise State University Boise ID USA
          [5 ]Department of Psychology Center for the Neural Basis of Cognition Carnegie Mellon University Pittsburgh PA USA
          Article
          10.1111/ejn.14745
          32302439
          abe28012-e842-42bd-8be3-0295756b8b3d
          © 2020

          http://onlinelibrary.wiley.com/termsAndConditions#vor

          http://doi.wiley.com/10.1002/tdm_license_1.1

          History

          Comments

          Comment on this article