IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1006839.html
   My bibliography  Save this article

The gradient of the reinforcement landscape influences sensorimotor learning

Author

Listed:
  • Joshua G A Cashaback
  • Christopher K Lao
  • Dimitrios J Palidis
  • Susan K Coltman
  • Heather R McGregor
  • Paul L Gribble

Abstract

Consideration of previous successes and failures is essential to mastering a motor skill. Much of what we know about how humans and animals learn from such reinforcement feedback comes from experiments that involve sampling from a small number of discrete actions. Yet, it is less understood how we learn through reinforcement feedback when sampling from a continuous set of possible actions. Navigating a continuous set of possible actions likely requires using gradient information to maximize success. Here we addressed how humans adapt the aim of their hand when experiencing reinforcement feedback that was associated with a continuous set of possible actions. Specifically, we manipulated the change in the probability of reward given a change in motor action—the reinforcement gradient—to study its influence on learning. We found that participants learned faster when exposed to a steep gradient compared to a shallow gradient. Further, when initially positioned between a steep and a shallow gradient that rose in opposite directions, participants were more likely to ascend the steep gradient. We introduce a model that captures our results and several features of motor learning. Taken together, our work suggests that the sensorimotor system relies on temporally recent and spatially local gradient information to drive learning.Author summary: In recent years it has been shown that reinforcement feedback may also subserve our ability to acquire new motor skills. Here we address how the reinforcement gradient influences motor learning. We found that a steeper gradient increased both the rate and likelihood of learning. Moreover, while many mainstream theories posit that we build a full representation of the reinforcement landscape, both our data and model suggest that the sensorimotor system relies primarily on temporally recent and spatially local gradient information to drive learning. Our work provides new insights into how we sample from a continuous action-reward landscape to maximize success.

Suggested Citation

  • Joshua G A Cashaback & Christopher K Lao & Dimitrios J Palidis & Susan K Coltman & Heather R McGregor & Paul L Gribble, 2019. "The gradient of the reinforcement landscape influences sensorimotor learning," PLOS Computational Biology, Public Library of Science, vol. 15(3), pages 1-27, March.
  • Handle: RePEc:plo:pcbi00:1006839
    DOI: 10.1371/journal.pcbi.1006839
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006839
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1006839&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1006839?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Tversky, Amos & Kahneman, Daniel, 1992. "Advances in Prospect Theory: Cumulative Representation of Uncertainty," Journal of Risk and Uncertainty, Springer, vol. 5(4), pages 297-323, October.
    2. Joshua G A Cashaback & Heather R McGregor & Ayman Mohatarem & Paul L Gribble, 2017. "Dissociating error-based and reinforcement-based loss functions during sensorimotor learning," PLOS Computational Biology, Public Library of Science, vol. 13(7), pages 1-28, July.
    3. Jonathan B Dingwell & Joby John & Joseph P Cusumano, 2010. "Do Humans Optimally Exploit Redundancy to Control Step Variability in Walking?," PLOS Computational Biology, Public Library of Science, vol. 6(7), pages 1-15, July.
    4. Paul L. Gribble & Stephen H. Scott, 2002. "Overlap of internal models in motor cortex for mechanical loads during reaching," Nature, Nature, vol. 417(6892), pages 938-941, June.
    5. Luigi Acerbi & Sethu Vijayakumar & Daniel M Wolpert, 2014. "On the Origins of Suboptimality in Human Probabilistic Inference," PLOS Computational Biology, Public Library of Science, vol. 10(6), pages 1-23, June.
    6. Xiuli Chen & Kieran Mohr & Joseph M Galea, 2017. "Predicting explorative motor learning using decision-making and motor noise," PLOS Computational Biology, Public Library of Science, vol. 13(4), pages 1-33, April.
    7. Ryan J. Tibshirani & Andrew Price & Jonathan Taylor, 2011. "A statistician plays darts," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 174(1), pages 213-226, January.
    8. Kang He & You Liang & Farnaz Abdollahi & Moria Fisher Bittmann & Konrad Kording & Kunlin Wei, 2016. "The Statistical Determinants of the Speed of Motor Learning," PLOS Computational Biology, Public Library of Science, vol. 12(9), pages 1-20, September.
    9. Joby John & Jonathan B Dingwell & Joseph P Cusumano, 2016. "Error Correction and the Structure of Inter-Trial Fluctuations in a Redundant Movement Task," PLOS Computational Biology, Public Library of Science, vol. 12(9), pages 1-30, September.
    10. Konrad P. Körding & Daniel M. Wolpert, 2004. "Bayesian integration in sensorimotor learning," Nature, Nature, vol. 427(6971), pages 244-247, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Nina M van Mastrigt & Jeroen B J Smeets & Katinka van der Kooij, 2020. "Quantifying exploration in reward-based motor learning," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-14, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jingwei Sun & Jian Li & Hang Zhang, 2019. "Human representation of multimodal distributions as clusters of samples," PLOS Computational Biology, Public Library of Science, vol. 15(5), pages 1-29, May.
    2. Adam N Sanborn & Ulrik R Beierholm, 2016. "Fast and Accurate Learning When Making Discrete Numerical Estimates," PLOS Computational Biology, Public Library of Science, vol. 12(4), pages 1-28, April.
    3. Seth W. Egger & Stephen G. Lisberger, 2022. "Neural structure of a sensory decoder for motor control," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    4. Tim Genewein & Eduard Hez & Zeynab Razzaghpanah & Daniel A Braun, 2015. "Structure Learning in Bayesian Sensorimotor Integration," PLOS Computational Biology, Public Library of Science, vol. 11(8), pages 1-27, August.
    5. Luigi Acerbi & Sethu Vijayakumar & Daniel M Wolpert, 2014. "On the Origins of Suboptimality in Human Probabilistic Inference," PLOS Computational Biology, Public Library of Science, vol. 10(6), pages 1-23, June.
    6. Jordi Grau-Moya & Pedro A Ortega & Daniel A Braun, 2016. "Decision-Making under Ambiguity Is Modulated by Visual Framing, but Not by Motor vs. Non-Motor Context. Experiments and an Information-Theoretic Ambiguity Model," PLOS ONE, Public Library of Science, vol. 11(4), pages 1-21, April.
    7. Jonathan B Dingwell & Joseph P Cusumano, 2019. "Humans use multi-objective control to regulate lateral foot placement when walking," PLOS Computational Biology, Public Library of Science, vol. 15(3), pages 1-28, March.
    8. Nina M van Mastrigt & Jeroen B J Smeets & Katinka van der Kooij, 2020. "Quantifying exploration in reward-based motor learning," PLOS ONE, Public Library of Science, vol. 15(4), pages 1-14, April.
    9. Luigi Acerbi & Sethu Vijayakumar & Daniel M Wolpert, 2017. "Target Uncertainty Mediates Sensorimotor Error Correction," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-21, January.
    10. Daniel Blustein & Ahmed Shehata & Kevin Englehart & Jonathon Sensinger, 2018. "Conventional analysis of trial-by-trial adaptation is biased: Empirical and theoretical support using a Bayesian estimator," PLOS Computational Biology, Public Library of Science, vol. 14(12), pages 1-15, December.
    11. Hang Zhang & Nathaniel D Daw & Laurence T Maloney, 2013. "Testing Whether Humans Have an Accurate Model of Their Own Motor Uncertainty in a Speeded Reaching Task," PLOS Computational Biology, Public Library of Science, vol. 9(5), pages 1-11, May.
    12. Todd E Hudson & Laurence T Maloney & Michael S Landy, 2008. "Optimal Compensation for Temporal Uncertainty in Movement Planning," PLOS Computational Biology, Public Library of Science, vol. 4(7), pages 1-9, July.
    13. William T Adler & Wei Ji Ma, 2018. "Comparing Bayesian and non-Bayesian accounts of human confidence reports," PLOS Computational Biology, Public Library of Science, vol. 14(11), pages 1-34, November.
    14. Oliver Linton & Esfandiar Maasoumi & Yoon-Jae Wang, 2002. "Consistent testing for stochastic dominance: a subsampling approach," CeMMAP working papers 03/02, Institute for Fiscal Studies.
    15. van den Bergh, J.C.J.M. & Botzen, W.J.W., 2015. "Monetary valuation of the social cost of CO2 emissions: A critical survey," Ecological Economics, Elsevier, vol. 114(C), pages 33-46.
    16. Heiko Karle & Georg Kirchsteiger & Martin Peitz, 2015. "Loss Aversion and Consumption Choice: Theory and Experimental Evidence," American Economic Journal: Microeconomics, American Economic Association, vol. 7(2), pages 101-120, May.
    17. Shoji, Isao & Kanehiro, Sumei, 2016. "Disposition effect as a behavioral trading activity elicited by investors' different risk preferences," International Review of Financial Analysis, Elsevier, vol. 46(C), pages 104-112.
    18. Muhammad Kashif & Thomas Leirvik, 2022. "The MAX Effect in an Oil Exporting Country: The Case of Norway," JRFM, MDPI, vol. 15(4), pages 1-16, March.
    19. Jonathan Meng & Feng Fu, 2020. "Understanding Gambling Behavior and Risk Attitudes Using Cryptocurrency-based Casino Blockchain Data," Papers 2008.05653, arXiv.org, revised Aug 2020.
    20. Daniel Fonseca Costa & Francisval Carvalho & Bruno César Moreira & José Willer Prado, 2017. "Bibliometric analysis on the association between behavioral finance and decision making with cognitive biases such as overconfidence, anchoring effect and confirmation bias," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1775-1799, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1006839. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.