To switch or not to switch? Balanced policy switching in offline reinforcement learning
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Tore Nilssen, 1992.
"Two Kinds of Consumer Switching Costs,"
RAND Journal of Economics, The RAND Corporation, vol. 23(4), pages 579-589, Winter.
- Nilssen, T., 1990. "Two Kinds of Consumer Switching Costs," Papers 12-90, Norwegian School of Economics and Business Administration-.
- Lynn M. LoPucki & Joseph W. Doherty, 2004. "The Determinants of Professional Fees in Large Bankruptcy Reorganization Cases," Journal of Empirical Legal Studies, John Wiley & Sons, vol. 1(1), pages 111-141, March.
- David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Annabi, Amira & Breton, Michèle & François, Pascal, 2012. "Resolution of financial distress under Chapter 11," Journal of Economic Dynamics and Control, Elsevier, vol. 36(12), pages 1867-1887.
- Lam, W., 2015. "Switching Costs in Two-sided Markets," LIDAM Discussion Papers CORE 2015024, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Arturo Bris & Alan Schwartz & Ivo Welch, 2005.
"Who Should Pay for Bankruptcy Costs?,"
The Journal of Legal Studies, University of Chicago Press, vol. 34(2), pages 295-341, June.
- Ivo Welch & Arturo Bris & Alan Schwartz, 2003. "Who Should Pay for Bankruptcy Costs?," Yale School of Management Working Papers ysm365, Yale School of Management, revised 01 Sep 2004.
- Ivo Welch & Arturo Bris & Alan Schwartz, 2003. "Who Should Pay for Bankruptcy Costs?," Yale School of Management Working Papers ysm365, Yale School of Management, revised 01 Sep 2004.
- Daníelsson, Jón & Macrae, Robert & Uthemann, Andreas, 2022.
"Artificial intelligence and systemic risk,"
Journal of Banking & Finance, Elsevier, vol. 140(C).
- Danielsson, Jon & Macrae, Robert & Uthemann, Andreas, 2022. "Artificial intelligence and systemic risk," LSE Research Online Documents on Economics 111601, London School of Economics and Political Science, LSE Library.
- Zhang, Xi & Wang, Qin & Bi, Xiaowen & Li, Donghong & Liu, Dong & Yu, Yuanjin & Tse, Chi Kong, 2024. "Mitigating cascading failure in power grids with deep reinforcement learning-based remedial actions," Reliability Engineering and System Safety, Elsevier, vol. 250(C).
- Thamayanthi Chellathurai, 2017. "Probability Density Of Recovery Rate Given Default Of A Firm’S Debt And Its Constituent Tranches," International Journal of Theoretical and Applied Finance (IJTAF), World Scientific Publishing Co. Pte. Ltd., vol. 20(04), pages 1-34, June.
- Stefano Colombo, 2018. "Behavior‐ and characteristic‐based price discrimination," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 27(2), pages 237-250, June.
- Adnan Jafar & Alessandra Kobayati & Michael A. Tsoukas & Ahmad Haidar, 2024. "Personalized insulin dosing using reinforcement learning for high-fat meals and aerobic exercises in type 1 diabetes: a proof-of-concept trial," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
- Andrei Hagiu & Julian Wright, 2023. "Data‐enabled learning, network effects, and competitive advantage," RAND Journal of Economics, RAND Corporation, vol. 54(4), pages 638-667, December.
- Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
- Rohan Pitchford & Mark L. J. Wright, 2012.
"Holdouts in Sovereign Debt Restructuring: A Theory of Negotiation in a Weak Contractual Environment,"
The Review of Economic Studies, Review of Economic Studies Ltd, vol. 79(2), pages 812-837.
- Rohan Pitchford & Mark L. J. Wright, 2008. "Holdouts In Sovereign Debt Restructuring: A Theory Of Negotiation In A Weak Contractual Environment," CAMA Working Papers 2008-37, Centre for Applied Macroeconomic Analysis, Crawford School of Public Policy, The Australian National University.
- Rohan Pitchford & Mark L. J. Wright, 2010. "Holdouts in Sovereign Debt Restructuring: A Theory of Negotiation in a Weak Contractual Environment," NBER Working Papers 16632, National Bureau of Economic Research, Inc.
- Zhang, Yihao & Chai, Zhaojie & Lykotrafitis, George, 2021. "Deep reinforcement learning with a particle dynamics environment applied to emergency evacuation of a room with obstacles," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 571(C).
- Ruqu Wang & Quan Wen, 1998. "Strategic Invasion in Markets with Switching Costs," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 7(4), pages 521-549, December.
- Keller, Alexander & Dahm, Ken, 2019. "Integral equations and machine learning," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 161(C), pages 2-12.
- Nogata, Daisuke, 2022. "Determinants of household switching between natural gas suppliers: Evidence from Japan," Utilities Policy, Elsevier, vol. 76(C).
- Canhoto, Ana Isabel & Clear, Fintan, 2020. "Artificial intelligence and machine learning as business tools: A framework for diagnosing value destruction potential," Business Horizons, Elsevier, vol. 63(2), pages 183-193.
- Zhaobin Mo & Xuan Di & Rongye Shi, 2023. "Robust Data Sampling in Machine Learning: A Game-Theoretic Framework for Training and Validation Data Selection," Games, MDPI, vol. 14(1), pages 1-13, January.
- Bouckaert, J.M.C. & Degryse, H.A., 2002.
"Softening Competition by Enhancing entry : An Example from the Banking Industry,"
Other publications TiSEM
1cf58bbb-25a9-4e6e-a11f-8, Tilburg University, School of Economics and Management.
- Bouckaert, J.M.C. & Degryse, H.A., 2002. "Softening Competition by Enhancing entry : An Example from the Banking Industry," Discussion Paper 2002-86, Tilburg University, Center for Economic Research.
- Jan Bouckaert & Hans Degryse, 2002. "Softening Competition by Enhancing Entry: An Example from the Banking Industry," CESifo Working Paper Series 782, CESifo.
- Jan Bouckaert & Hans Degryse, 2002. "Softening Competition by Enhancing Entry: An Example from the Banking Industry," CSEF Working Papers 85, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
- Yang, Kaiyuan & Huang, Houjing & Vandans, Olafs & Murali, Adithya & Tian, Fujia & Yap, Roland H.C. & Dai, Liang, 2023. "Applying deep reinforcement learning to the HP model for protein structure prediction," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).
- Gravelle, Hugh & Masiero, Giuliano, 2000.
"Quality incentives in a regulated market with imperfect information and switching costs: capitation in general practice,"
Journal of Health Economics, Elsevier, vol. 19(6), pages 1067-1088, November.
- Hugh Gravelle & Giuliano Masiero, "undated". "Quality incentives in a regulated market with imperfect information and switching costs: capitation in general practice," Discussion Papers 00/18, Department of Economics, University of York.
More about this item
JEL classification:
- C1 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General
NEP fields
This paper has been announced in the following NEP Reports:- NEP-BIG-2024-08-26 (Big Data)
- NEP-CMP-2024-08-26 (Computational Economics)
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ehl:lserod:124144. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: LSERO Manager (email available below). General contact details of provider: https://edirc.repec.org/data/lsepsuk.html .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.