IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1004540.html
   My bibliography  Save this article

Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum

Author

Listed:
  • Makoto Ito
  • Kenji Doya

Abstract

Previous theoretical studies of animal and human behavioral learning have focused on the dichotomy of the value-based strategy using action value functions to predict rewards and the model-based strategy using internal models to predict environmental states. However, animals and humans often take simple procedural behaviors, such as the “win-stay, lose-switch” strategy without explicit prediction of rewards or states. Here we consider another strategy, the finite state-based strategy, in which a subject selects an action depending on its discrete internal state and updates the state depending on the action chosen and the reward outcome. By analyzing choice behavior of rats in a free-choice task, we found that the finite state-based strategy fitted their behavioral choices more accurately than value-based and model-based strategies did. When fitted models were run autonomously with the same task, only the finite state-based strategy could reproduce the key feature of choice sequences. Analyses of neural activity recorded from the dorsolateral striatum (DLS), the dorsomedial striatum (DMS), and the ventral striatum (VS) identified significant fractions of neurons in all three subareas for which activities were correlated with individual states of the finite state-based strategy. The signal of internal states at the time of choice was found in DMS, and for clusters of states was found in VS. In addition, action values and state values of the value-based strategy were encoded in DMS and VS, respectively. These results suggest that both the value-based strategy and the finite state-based strategy are implemented in the striatum.Author Summary: The neural mechanism of decision-making, a cognitive process to select one action among multiple possibilities, is a fundamental issue in neuroscience. Previous studies have revealed the roles of the cerebral cortex and the basal ganglia in decision-making, by assuming that subjects take a value-based reinforcement learning strategy, in which the expected reward for each action candidate is updated. However, animals and humans often use simple procedural strategies, such as “win-stay, lose-switch.” In this study, we consider a finite state-based strategy, in which a subject acts depending on its discrete internal state and updates the state based on reward feedback. We found that the finite state-based strategy could reproduce the choice behavior of rats in a binary choice task with higher accuracy than the value-based strategy. Interestingly, neuronal activity in the striatum, a crucial brain region for reward-based learning, encoded information regarding both strategies. These results suggest that both the value-based strategy and the finite state-based strategy are implemented in the striatum.

Suggested Citation

  • Makoto Ito & Kenji Doya, 2015. "Parallel Representation of Value-Based and Finite State-Based Strategies in the Ventral and Dorsal Striatum," PLOS Computational Biology, Public Library of Science, vol. 11(11), pages 1-25, November.
  • Handle: RePEc:plo:pcbi00:1004540
    DOI: 10.1371/journal.pcbi.1004540
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1004540
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1004540&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1004540?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Anitha Pasupathy & Earl K. Miller, 2005. "Different time courses of learning-related activity in the prefrontal cortex and striatum," Nature, Nature, vol. 433(7028), pages 873-876, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Richard Freund & Marta Favara & Catherine Porter & Jere Behrman, 2024. "Social Protection and Foundational Cognitive Skills during Adolescence: Evidence from a Large Public Works Program," The World Bank Economic Review, World Bank, vol. 38(2), pages 296-318.
    2. Lisa Katharina Pendt & Iris Reuter & Hermann Müller, 2011. "Motor Skill Learning, Retention, and Control Deficits in Parkinson's Disease," PLOS ONE, Public Library of Science, vol. 6(7), pages 1-10, July.
    3. Francesco Ceccarelli & Lorenzo Ferrucci & Fabrizio Londei & Surabhi Ramawat & Emiliano Brunamonti & Aldo Genovesio, 2023. "Static and dynamic coding in distinct cell types during associative learning in the prefrontal cortex," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    4. Johannes Algermissen & Jennifer C. Swart & René Scheeringa & Roshan Cools & Hanneke E. M. den Ouden, 2024. "Prefrontal signals precede striatal signals for biased credit assignment in motivational learning biases," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    5. Naveen Sendhilnathan & Anna Ipata & Michael E. Goldberg, 2021. "Mid-lateral cerebellar complex spikes encode multiple independent reward-related signals during reinforcement learning," Nature Communications, Nature, vol. 12(1), pages 1-10, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1004540. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.