IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1008552.html
   My bibliography  Save this article

Model based planners reflect on their model-free propensities

Author

Listed:
  • Rani Moran
  • Mehdi Keramati
  • Raymond J Dolan

Abstract

Dual-reinforcement learning theory proposes behaviour is under the tutelage of a retrospective, value-caching, model-free (MF) system and a prospective-planning, model-based (MB), system. This architecture raises a question as to the degree to which, when devising a plan, a MB controller takes account of influences from its MF counterpart. We present evidence that such a sophisticated self-reflective MB planner incorporates an anticipation of the influences its own MF-proclivities exerts on the execution of its planned future actions. Using a novel bandit task, wherein subjects were periodically allowed to design their environment, we show that reward-assignments were constructed in a manner consistent with a MB system taking account of its MF propensities. Thus, in the task participants assigned higher rewards to bandits that were momentarily associated with stronger MF tendencies. Our findings have implications for a range of decision making domains that includes drug abuse, pre-commitment, and the tension between short and long-term decision horizons in economics.Author summary: Extant theories of model-based (MB) planning usually ignore whether one’s model-free (MF) tendencies cohere, or conflict, with one’s desired goal-directed actions. From the perspective of these theories, one’s MF disposition to abstinence or indulgent alcohol-drinking is irrelevant for a planner who intends to avoid alcohol consumption at a forthcoming party. However, MF tendencies can hijack one’s actions such that plans consistent with MF tendencies are often adhered to, whereas plans that conflict with MF tendencies are prone to be disregarded. Here, we show that planners consider whether their MF proclivities facilitate or hinder goal-pursuit. We show that people are disposed to design a task-environment in such a way that renders MB goals consistent (rather than conflicting) with their MF dispositions.

Suggested Citation

  • Rani Moran & Mehdi Keramati & Raymond J Dolan, 2021. "Model based planners reflect on their model-free propensities," PLOS Computational Biology, Public Library of Science, vol. 17(1), pages 1-28, January.
  • Handle: RePEc:plo:pcbi00:1008552
    DOI: 10.1371/journal.pcbi.1008552
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1008552
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1008552&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1008552?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Loewenstein, George & O'Donoghue, Ted, 2004. "Animal Spirits: Affective and Deliberative Processes in Economic Behavior," Working Papers 04-14, Cornell University, Center for Analytic Economics.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Alexander L. Brown & Zhikang Eric Chua & Colin F. Camerer, 2009. "Learning and Visceral Temptation in Dynamic Saving Experiments," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 124(1), pages 197-231.
    2. Robinson, Angela & Covey, Judith & Spencer, Anne & Loomes, Graham, 2010. "Are some deaths worse than others? The effect of 'labelling' on people's perceptions," Journal of Economic Psychology, Elsevier, vol. 31(3), pages 444-455, June.
    3. Grenet, Julien & He, YingHua & Kübler, Dorothea, 2022. "Preference Discovery in University Admissions: The Case for Dynamic Multioffer Mechanisms," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 130(6), pages 1-1.
    4. Khwaja, Ahmed & Silverman, Dan & Sloan, Frank, 2007. "Time preference, time discounting, and smoking decisions," Journal of Health Economics, Elsevier, vol. 26(5), pages 927-949, September.
    5. Cobb-Clark, Deborah A. & Dahmann, Sarah C. & Kamhöfer, Daniel A. & Schildberg-Hörisch, Hannah, 2023. "Self-control and unhealthy body weight: The role of impulsivity and restraint," Economics & Human Biology, Elsevier, vol. 50(C).
    6. Laurette Dubé & Antoine Bechara & Ulf Böckenholt & Asim Ansari & Alain Dagher & Mark Daniel & Wayne DeSarbo & Lesley Fellows & Ross Hammond & Terry Huang & Scott Huettel & Yan Kestens & Bärbel Knäuper, 2009. "Towards a brain-to-society systems model of individual choice," Marketing Letters, Springer, vol. 20(1), pages 105-106, March.
    7. Nicolas Brisset & Dorian Jullien, 2019. "Models as Speech Acts: A Restatement and a new Case Study," GREDEG Working Papers 2019-09, Groupe de REcherche en Droit, Economie, Gestion (GREDEG CNRS), Université Côte d'Azur, France.
    8. Daniel J. Benjamin & Sebastian A. Brown & Jesse M. Shapiro, 2013. "Who Is ‘Behavioral’? Cognitive Ability And Anomalous Preferences," Journal of the European Economic Association, European Economic Association, vol. 11(6), pages 1231-1255, December.
    9. Recalde, María P. & Riedl, Arno & Vesterlund, Lise, 2018. "Error-prone inference from response time: The case of intuitive generosity in public-good games," Journal of Public Economics, Elsevier, vol. 160(C), pages 132-147.
    10. Stephan Meier & Charles Sprenger, 2007. "Impatience and credit behavior: evidence from a field experiment," Working Papers 07-3, Federal Reserve Bank of Boston.
    11. Zhou, Li & Turvey, Calum & Hu, Wuyang & Ying, Ruiyao, 2015. "Fear and Trust: How Risk Perceptions of Avian Influenza Affect the Demand for Chicken," 2015 AAEA & WAEA Joint Annual Meeting, July 26-28, San Francisco, California 202077, Agricultural and Applied Economics Association.
    12. Bracha, Anat & Brown, Donald J., 2012. "Affective decision making: A theory of optimism bias," Games and Economic Behavior, Elsevier, vol. 75(1), pages 67-80.
    13. Ola Andersson & Håkan J. Holm & Jean-Robert Tyran & Erik Wengström, 2016. "Deciding for Others Reduces Loss Aversion," Management Science, INFORMS, vol. 62(1), pages 29-36, January.
    14. Isabelle Brocas & Juan D. Carrillo, 2008. "The Brain as a Hierarchical Organization," American Economic Review, American Economic Association, vol. 98(4), pages 1312-1346, September.
    15. Drew Fudenberg, 2006. "Advancing Beyond Advances in Behavioral Economics," Journal of Economic Literature, American Economic Association, vol. 44(3), pages 694-711, September.
    16. McLeish, Kendra N & Oxoby, Robert J, 2006. "Measuring Impatience: Elicited Discount Rates and the Barratt Impulsiveness Scale," MPRA Paper 1524, University Library of Munich, Germany.
    17. D.Dragone, 2005. "Incoerenza Dinamica ed Autocontrollo: Proposta per un'Analisi Interdisciplinare," Working Papers 549, Dipartimento Scienze Economiche, Universita' di Bologna.
    18. Emre Ozdenoren & Stephen W. Salant & Dan Silverman, 2012. "Willpower And The Optimal Control Of Visceral Urges," Journal of the European Economic Association, European Economic Association, vol. 10(2), pages 342-368, April.
    19. Drew Fudenberg & David K. Levine, 2011. "Risk, Delay, and Convex Self-Control Costs," American Economic Journal: Microeconomics, American Economic Association, vol. 3(3), pages 34-68, August.
    20. George Ainslie, 2012. "Pure hyperbolic discount curves predict “eyes open” self-control," Theory and Decision, Springer, vol. 73(1), pages 3-34, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008552. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.