Reward-predictive representations generalize across tasks in reinforcement learning
Author
Abstract
Suggested Citation
DOI: 10.1371/journal.pcbi.1008317
Download full text from publisher
References listed on IDEAS
- Nicholas T Franklin & Michael J Frank, 2018. "Compositional clustering in task structure learning," PLOS Computational Biology, Public Library of Science, vol. 14(4), pages 1-25, April.
- I. Momennejad & E. M. Russek & J. H. Cheong & M. M. Botvinick & N. D. Daw & S. J. Gershman, 2017. "The successor representation in human reinforcement learning," Nature Human Behaviour, Nature, vol. 1(9), pages 680-692, September.
- Teh, Yee Whye & Jordan, Michael I. & Beal, Matthew J. & Blei, David M., 2006. "Hierarchical Dirichlet Processes," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 1566-1581, December.
- Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
- Nicky J. Welton & Howard H. Z. Thom, 2015. "Value of Information," Medical Decision Making, , vol. 35(5), pages 564-566, July.
- Evan M Russek & Ida Momennejad & Matthew M Botvinick & Samuel J Gershman & Nathaniel D Daw, 2017. "Predictive representations can link model-based reinforcement learning to model-free mechanisms," PLOS Computational Biology, Public Library of Science, vol. 13(9), pages 1-35, September.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Amirhosein Mosavi & Yaser Faghan & Pedram Ghamisi & Puhong Duan & Sina Faizollahzadeh Ardabili & Ely Salwana & Shahab S. Band, 2020. "Comprehensive Review of Deep Reinforcement Learning Methods and Applications in Economics," Mathematics, MDPI, vol. 8(10), pages 1-42, September.
- Jaron T Colas & Wolfgang M Pauli & Tobias Larsen & J Michael Tyszka & John P O’Doherty, 2017. "Distinct prediction errors in mesostriatal circuits of the human brain mediate learning about the values of both states and actions: evidence from high-resolution fMRI," PLOS Computational Biology, Public Library of Science, vol. 13(10), pages 1-32, October.
- Momchil S Tomov & Samyukta Yagati & Agni Kumar & Wanqian Yang & Samuel J Gershman, 2020. "Discovery of hierarchical representations for efficient planning," PLOS Computational Biology, Public Library of Science, vol. 16(4), pages 1-42, April.
- Liu, Hui & Yu, Chengqing & Wu, Haiping & Duan, Zhu & Yan, Guangxi, 2020. "A new hybrid ensemble deep reinforcement learning model for wind speed short term forecasting," Energy, Elsevier, vol. 202(C).
- Ruohan Zhang & Shun Zhang & Matthew H Tong & Yuchen Cui & Constantin A Rothkopf & Dana H Ballard & Mary M Hayhoe, 2018. "Modeling sensory-motor decisions in natural behavior," PLOS Computational Biology, Public Library of Science, vol. 14(10), pages 1-22, October.
- Vincenzo Varriale & Antonello Cammarano & Francesca Michelino & Mauro Caputo, 2021. "Sustainable Supply Chains with Blockchain, IoT and RFID: A Simulation on Order Management," Sustainability, MDPI, vol. 13(11), pages 1-23, June.
- Valeria Costantini & Francesco Crespi & Giovanni Marin & Elena Paglialunga, 2016. "Eco-innovation, sustainable supply chains and environmental performance in European industries," LEM Papers Series 2016/19, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
- Lee, Alice J. & Ames, Daniel R., 2017. "“I can’t pay more” versus “It’s not worth more”: Divergent effects of constraint and disparagement rationales in negotiations," Organizational Behavior and Human Decision Processes, Elsevier, vol. 141(C), pages 16-28.
- Hussain, Hadia & Murtaza, Murtaza & Ajmal, Areeb & Ahmed, Afreen & Khan, Muhammad Ovais Khalid, 2020. "A study on the effects of social media advertisement on consumer’s attitude and customer response," MPRA Paper 104675, University Library of Munich, Germany.
- A. G. Fatullayev & Nizami A. Gasilov & Şahin Emrah Amrahov, 2019. "Numerical solution of linear inhomogeneous fuzzy delay differential equations," Fuzzy Optimization and Decision Making, Springer, vol. 18(3), pages 315-326, September.
- Cyril Chalendard, 2015.
"Use of internal information, external information acquisition and customs underreporting,"
Working Papers
halshs-01179445, HAL.
- Cyril CHALENDARD, 2015. "Use of Internal Information, External Information Acquisition and Customs Underreporting," Working Papers 201522, CERDI.
- Arun Advani & William Elming & Jonathan Shaw, 2023.
"The Dynamic Effects of Tax Audits,"
The Review of Economics and Statistics, MIT Press, vol. 105(3), pages 545-561, May.
- Arun Advani & William Elming & Jonathan Shaw, 2017. "The dynamic effects of tax audits," IFS Working Papers W17/24, Institute for Fiscal Studies.
- Advani, Arun & Elming, William & Shaw, Jonathan, 2019. "The Dynamic Effects of Tax Audits," CAGE Online Working Paper Series 414, Competitive Advantage in the Global Economy (CAGE).
- Advani, Arun & Elming, William & Shaw, Jonathan, 2019. "The Dynamic Effects of Tax Audits," The Warwick Economics Research Paper Series (TWERPS) 1198, University of Warwick, Department of Economics.
- Philippe Aghion & Ufuk Akcigit & Matthieu Lequien & Stefanie Stantcheva, 2017.
"Tax simplicity and heterogeneous learning,"
CEP Discussion Papers
dp1516, Centre for Economic Performance, LSE.
- P. Aghion & U. Akcigit & M. Lequien & S. Stantcheva, 2018. "Tax Simplicity and Heterogeneous Learning," Working papers 665, Banque de France.
- Aghion, Philippe & Akcigit, Ufuk & Lequien, Matthieu & Stantcheva, Stefanie, 2017. "Tax simplicity and heterogeneous learning," LSE Research Online Documents on Economics 86613, London School of Economics and Political Science, LSE Library.
- Stantcheva, Stefanie & Aghion, Philippe & Lequien, Matthieu & Akcigit, Ufuk, 2017. "Tax Simplicity and Heterogeneous Learning," CEPR Discussion Papers 12471, C.E.P.R. Discussion Papers.
- Tulika Saha & Sriparna Saha & Pushpak Bhattacharyya, 2020. "Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-28, July.
- Marie Bjørneby & Annette Alstadsæter & Kjetil Telle, 2018.
"Collusive tax evasion by employers and employees. Evidence from a randomized fi eld experiment in Norway,"
Discussion Papers
891, Statistics Norway, Research Department.
- Marie Bjørneby & Annette Alstadsæter & Kjetil Telle, 2018. "Collusive Tax Evasion by Employers and Employees: Evidence from a Randomized Field Experiment in Norway," CESifo Working Paper Series 7381, CESifo.
- Chuangen Gao & Shuyang Gu & Jiguo Yu & Hai Du & Weili Wu, 2022. "Adaptive seeding for profit maximization in social networks," Journal of Global Optimization, Springer, vol. 82(2), pages 413-432, February.
- Koessler, Frederic & Laclau, Marie & Renault, Jérôme & Tomala, Tristan, 2022.
"Long information design,"
Theoretical Economics, Econometric Society, vol. 17(2), May.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2021. "Long Information Design," PSE Working Papers halshs-02400053, HAL.
- Frédéric Koessler & Marie Laclau & Jerôme Renault & Tristan Tomala, 2022. "Long information design," PSE-Ecole d'économie de Paris (Postprint) hal-03700394, HAL.
- Koessler, Frédéric & Laclau, Marie & Renault, Jérôme & Tomala, Tristan, 2022. "Long information design," TSE Working Papers 22-1341, Toulouse School of Economics (TSE).
- Marie Laclau & Frédéric Koessler & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," Post-Print halshs-03342880, HAL.
- Marie Laclau & Frédéric Koessler & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," PSE-Ecole d'économie de Paris (Postprint) halshs-03342880, HAL.
- Frédéric Koessler & Marie Laclau & Jerôme Renault & Tristan Tomala, 2022. "Long information design," Post-Print hal-03700394, HAL.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2021. "Long Information Design," Working Papers halshs-02400053, HAL.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," Post-Print halshs-02400053, HAL.
- Frédéric Koessler & Marie Laclau & Jérôme Renault & Tristan Tomala, 2022. "Long Information Design," PSE-Ecole d'économie de Paris (Postprint) halshs-02400053, HAL.
- Jamal El-Den & Pratap Adikhari & Pratap Adikhari, 2017. "Social media in the service of social entrepreneurship: Identifying factors for better services," Journal of Advances in Humanities and Social Sciences, Dr. Yi-Hsing Hsieh, vol. 3(2), pages 105-114.
- Michelle Dietzen & Haoran Zhai & Olivia Lucas & Oriol Pich & Christopher Barrington & Wei-Ting Lu & Sophia Ward & Yanping Guo & Robert E. Hynds & Simone Zaccaria & Charles Swanton & Nicholas McGranaha, 2024. "Replication timing alterations are associated with mutation acquisition during breast and lung cancer evolution," Nature Communications, Nature, vol. 15(1), pages 1-23, December.
- Annette Alstadsæter & Wojciech Kopczuk & Kjetil Telle, 2019.
"Social networks and tax avoidance: evidence from a well-defined Norwegian tax shelter,"
International Tax and Public Finance, Springer;International Institute of Public Finance, vol. 26(6), pages 1291-1328, December.
- Annette Alstadsæter & Wojciech Kopczuk & Kjetil Telle, 2018. "Social Networks and Tax Avoidance: Evidence from a Well-Defined Norwegian Tax Shelter," NBER Working Papers 25191, National Bureau of Economic Research, Inc.
- Annette Alstadsæter & Wojciech Kopczuk & Kjetil Telle, 2018. "Social networks and tax avoidance. Evidence from a well-defined Norwegian tax shelter," Discussion Papers 886, Statistics Norway, Research Department.
- Kopczuk, Wojciech & Alstadsæter, Annette & Telle, Kjetil, 2018. "Social networks and tax avoidance: Evidence from a well-defined Norwegian tax shelter," CEPR Discussion Papers 13251, C.E.P.R. Discussion Papers.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1008317. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.