Undiscounted Bandit Games

My bibliography Save this paper

Undiscounted Bandit Games

Author

Listed:

Rady, Sven
Keller, R Godfrey

Registered:

Abstract

We analyze undiscounted continuous-time games of strategic experimentation with two-armed bandits. The risky arm generates payoffs according to a LÃ©vy process with an unknown average payoff per unit of time which nature draws from an arbitrary finite set. Observing all actions and realized payoffs, players use Markov strategies with the common posterior belief about the unknown parameter as the state variable. We show that the unique symmetric Markov perfect equilibrium can be computed in a simple closed form involving only the payoff of the safe arm, the expected current payoff of the risky arm, and the expected full-information payoff, given the current belief. In particular, the equilibrium does not depend on the precise specification of the payoff-generating processes.

Suggested Citation

Rady, Sven & Keller, R Godfrey, 2019. "Undiscounted Bandit Games," CEPR Discussion Papers 14046, C.E.P.R. Discussion Papers.

Handle: RePEc:cpr:ceprdp:14046

Download full text from publisher

As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

Other versions of this item:

Keller, Godfrey & Rady, Sven, 2020. "Undiscounted bandit games," Games and Economic Behavior, Elsevier, vol. 124(C), pages 43-61.

Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," CRC TR 224 Discussion Paper Series crctr224_2019_130, University of Bonn and University of Mannheim, Germany.
Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," Papers 1909.13323, arXiv.org, revised Aug 2020.
Godfrey Keller & Sven Rady, 2019. "Undiscounted Bandit Games," Economics Series Working Papers 882, University of Oxford, Department of Economics.
Godfrey Keller & Sven Rady, 2020. "Undiscounted Bandit Games," CRC TR 224 Discussion Paper Series crctr224_2020_130v2, University of Bonn and University of Mannheim, Germany.
Keller, Godfrey & Rady, Sven, 2015. "Undiscounted Bandit Games," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 520, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.

References listed on IDEAS

Asaf Cohen & Eilon Solan, 2013. "Bandit Problems with Lévy Processes," Mathematics of Operations Research, INFORMS, vol. 38(1), pages 92-107, February.
Godfrey Keller & Sven Rady, 1999. "Optimal Experimentation in a Changing Environment," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 66(3), pages 475-507.
- Godfrey Keller & Sven Rady, 1997. "Optimal Experimentation in a Changing Environment," STICERD - Theoretical Economics Paper Series 333, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
- Godfrey Keller & Sven Rady, 1998. "Optimal Experimentation in a Changing Environment," Game Theory and Information 9801001, University Library of Munich, Germany.
Bergemann, Dirk & Valimaki, Juuso, 2002. "Entry and Vertical Differentiation," Journal of Economic Theory, Elsevier, vol. 106(1), pages 91-125, September.
- Dirk Bergemann & Juuso Valimaki, 2000. "Entry and Vertical Differentiation," Cowles Foundation Discussion Papers 1277, Cowles Foundation for Research in Economics, Yale University.
- Dirk Bergemann & Valimaki Juuso, 2001. "Entry and Vertical Differentiation," Cowles Foundation Discussion Papers 1302, Cowles Foundation for Research in Economics, Yale University.
Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
- Godfrey Keller & Martin Cripps & Olin School of Business & Washington University & Sven Rady & Department of Economics & University of Munich, 2003. "Strategic Experimentation with Exponential Bandits," Economics Series Working Papers 143, University of Oxford, Department of Economics.
- Cripps, Martin & Keller, Godfrey & Rady, Sven, 2003. "Strategic Experimentation with Exponential Bandits," Discussion Papers in Economics 4, University of Munich, Department of Economics.
- Rady, Sven & Cripps, Martin William & Keller, R Godfrey, 2003. "Strategic Experimentation with Exponential Bandits," CEPR Discussion Papers 3814, C.E.P.R. Discussion Papers.
Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
Dirk Bergemann & Juuso Valimaki, 1997. "Market Diffusion with Two-Sided Learning," RAND Journal of Economics, The RAND Corporation, vol. 28(4), pages 773-795, Winter.
- Dirk Bergemann & Juuso Valimaki, 1996. "Market Diffusion with Two-Sided Learning," Cowles Foundation Discussion Papers 1138, Cowles Foundation for Research in Economics, Yale University.
Ke, T. Tony & Villas-Boas, J. Miguel, 2019. "Optimal learning before choice," Journal of Economic Theory, Elsevier, vol. 180(C), pages 383-437.
, & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
- Sven Rady & Godfrey Keller, 2007. "Strategic Experimentation with Poisson Bandits," 2007 Meeting Papers 332, Society for Economic Dynamics.
- Rady, Sven & Keller, R Godfrey, 2009. "Strategic Experimentation with Poisson Bandits," CEPR Discussion Papers 7270, C.E.P.R. Discussion Papers.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 260, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Keller, Godfrey & Rady, Sven, 2009. "Strategic Experimentation with Poisson Bandits," Discussion Papers in Economics 10575, University of Munich, Department of Economics.
Martin Peitz & Sven Rady & Piers Trepper, 2017. "Experimentation in Two-Sided Markets," Journal of the European Economic Association, European Economic Association, vol. 15(1), pages 128-172.
- Peitz, Martin & Rady, Sven & Trepper, Piers, 2011. "Experimentation in Two-Sided Markets," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 365, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Peitz, Martin & Rady, Sven & Trepper, Piers, 2017. "Experimentation in Two-Sided Markets," Munich Reprints in Economics 55039, University of Munich, Department of Economics.
- Peitz, Martin & Rady, Sven & Trepper, Piers, 2013. "Experimentation in Two-Sided Markets," Working Papers 13-03, University of Mannheim, Department of Economics.
- Rady, Sven & Peitz, Martin & Trepper, Piers, 2011. "Experimentation in Two-Sided Markets," CEPR Discussion Papers 8670, C.E.P.R. Discussion Papers.
- Martin Peitz & Sven Rady & Piers Trepper, 2015. "Experimentation in Two-Sided Markets," CESifo Working Paper Series 5346, CESifo.
Christopher Harris, 1993. "Generalized Solutions of Stochastic Differential Games in One Dimension," Papers 0044, Boston University - Industry Studies Programme.
- Harris, C., 1993. "Generalized Solutions of Stochastic Differential Games in One Dimension," Papers 44, Boston University - Industry Studies Programme.
Pietro Veronesi, 2000. "How Does Information Quality Affect Stock Returns?," Journal of Finance, American Finance Association, vol. 55(2), pages 807-837, April.
Alessandro Bonatti, 2011. "Menu Pricing and Learning," American Economic Journal: Microeconomics, American Economic Association, vol. 3(3), pages 124-163, August.
Dutta, Prajit K., 1991. "What do discounted optima converge to?: A theory of discount rate asymptotics in economic models," Journal of Economic Theory, Elsevier, vol. 55(1), pages 64-94, October.
Moscarini, Giuseppe & Squintani, Francesco, 2010. "Competitive experimentation with private information: The survivor's curse," Journal of Economic Theory, Elsevier, vol. 145(2), pages 639-660, March.
Jovanovic, Boyan, 1979. "Job Matching and the Theory of Turnover," Journal of Political Economy, University of Chicago Press, vol. 87(5), pages 972-990, October.
- Thomas Sargent, "undated". "Matlab code for Jovanovic's matching model," QM&RBC Codes 24, Quantitative Macroeconomics & Real Business Cycles.
Keller, Godfrey & Rady, Sven, 2003. "Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," RAND Journal of Economics, The RAND Corporation, vol. 34(1), pages 138-165, Spring.
- Keller, Godfrey & Rady, Sven, 2001. "Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," Discussion Papers in Economics 21, University of Munich, Department of Economics.
- Rady, Sven & Keller, R Godfrey, 2001. "Price Dispersion and Learning in a Dynamic Differentiated-Goods Duopoly," CEPR Discussion Papers 2919, C.E.P.R. Discussion Papers.
Dutta, P.K., 1991. "What Do Discounted Optima Converge To? A Theory of Discount Rate Asymptotics in Economic Models," RCER Working Papers 264, University of Rochester - Center for Economic Research (RCER).

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Weng, Xi, 2015. "Dynamic pricing in the presence of individual learning," Journal of Economic Theory, Elsevier, vol. 155(C), pages 262-299.
Martin Peitz & Sven Rady & Piers Trepper, 2017. "Experimentation in Two-Sided Markets," Journal of the European Economic Association, European Economic Association, vol. 15(1), pages 128-172.
- Peitz, Martin & Rady, Sven & Trepper, Piers, 2011. "Experimentation in Two-Sided Markets," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 365, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Peitz, Martin & Rady, Sven & Trepper, Piers, 2017. "Experimentation in Two-Sided Markets," Munich Reprints in Economics 55039, University of Munich, Department of Economics.
- Peitz, Martin & Rady, Sven & Trepper, Piers, 2013. "Experimentation in Two-Sided Markets," Working Papers 13-03, University of Mannheim, Department of Economics.
- Rady, Sven & Peitz, Martin & Trepper, Piers, 2011. "Experimentation in Two-Sided Markets," CEPR Discussion Papers 8670, C.E.P.R. Discussion Papers.
- Martin Peitz & Sven Rady & Piers Trepper, 2015. "Experimentation in Two-Sided Markets," CESifo Working Paper Series 5346, CESifo.
Alessandro Bonatti, 2008. "Continuous-Time Screening Contracts," 2008 Meeting Papers 493, Society for Economic Dynamics.
Keller, Godfrey & Rady, Sven, 2015. "Breakdowns," Theoretical Economics, Econometric Society, vol. 10(1), January.
- Keller, Godfrey & Rady, Sven, 2012. "Breakdowns," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 396, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Godfrey Keller & Sven Rady, 2013. "Breakdowns," Levine's Working Paper Archive 786969000000000635, David K. Levine.
Décamps, Jean-Paul & Mariotti, Thomas & Villeneuve, Stéphane, 2000. "Investment Timing under Incomplete Information," IDEI Working Papers 115, Institut d'Économie Industrielle (IDEI), Toulouse, revised Apr 2004.
- Décamps, Jean-Paul & Mariotti, Thomas & Villeneuve, Stephane, 2003. "Investment timing under incomplete information," LSE Research Online Documents on Economics 19325, London School of Economics and Political Science, LSE Library.
- Jean-Paul Decamps & Thomas Mariotti & Stephane Villeneuve, 2003. "Investment Timing under Incomplete Information," STICERD - Theoretical Economics Paper Series 444, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
Rosenberg, Dinah & Salomon, Antoine & Vieille, Nicolas, 2013. "On games of strategic experimentation," Games and Economic Behavior, Elsevier, vol. 82(C), pages 31-51.
- Dinah Rosenberg & Antoine Salomon & Nicolas Vieille, 2010. "On Games of Strategic Experimentation," Working Papers hal-00579613, HAL.
- Rosenberg, Dinah & Salomon , Antoine & Vieille , Nicolas, 2013. "On Games of Strategic Experimentation," HEC Research Papers Series 1008, HEC Paris.
Lizzeri, Alessandro & Shmaya, Eran & Yariv, Leeat, 2024. "Disentangling Exploration from Exploitation," CEPR Discussion Papers 19058, C.E.P.R. Discussion Papers.
- Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," Working Papers 334, Princeton University, Department of Economics, Center for Economic Policy Studies..
- Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," Papers 2404.19116, arXiv.org.
- Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," NBER Working Papers 32424, National Bureau of Economic Research, Inc.
Bloch, Francis & Fabrizi, Simona & Lippert, Steffen, 2022. "Hiding and herding in market entry," Journal of Economic Theory, Elsevier, vol. 206(C).
- Francis Bloch & Simona Fabrizi & Steffen Lippert, 2022. "Hiding and herding in market entry," PSE-Ecole d'économie de Paris (Postprint) halshs-03956373, HAL.
- Francis Bloch & Simona Fabrizi & Steffen Lippert, 2022. "Hiding and herding in market entry," Post-Print halshs-03956373, HAL.
Dinah Rosenberg & Eilon Solan & Nicolas Vieille, 2007. "Social Learning in One-Arm Bandit Problems," Econometrica, Econometric Society, vol. 75(6), pages 1591-1611, November.
- Dinah Rosenberg & Eilon Solan & Nicolas Vieille, 2004. "Social Learning in One-Arm Bandit Problems," Discussion Papers 1396, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Nicolas Vieille & Dinah Rosenberg & Eilon Solan, 2007. "Social Learning in One-Arm Bandit Problems," Post-Print hal-00464609, HAL.
Jan Eeckhout & Xi Weng, 2022. "Assortative Learning," Economica, London School of Economics and Political Science, vol. 89(355), pages 647-688, July.
- Xi Weng & Jan Eeckhout, 2010. "Assortative Learning," 2010 Meeting Papers 356, Society for Economic Dynamics.
Farzad Pourbabaee, 2024. "Reputation, learning and project choice in frictional economies," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 78(4), pages 1075-1115, December.
Wagner, Peter A. & Klein, Nicolas, 2022. "Strategic investment and learning with private information," Journal of Economic Theory, Elsevier, vol. 204(C).
- Nicolas KLEIN & Peter WAGNER, 2018. "Strategic Investment and Learning with Private Information," Cahiers de recherche 13-2018, Centre interuniversitaire de recherche en Ã©conomie quantitative, CIREQ.
- KLEIN, Nicolas & WAGNER, Peter, 2018. "Strategic investment and learning with private information," Cahiers de recherche 2018-10, Universite de Montreal, Departement de sciences economiques.
Eeckhout, Jan & Weng, Xi, 2015. "Common value experimentation," Journal of Economic Theory, Elsevier, vol. 160(C), pages 317-339.
Godfrey Keller & Sven Rady, 1998. "Market Experimentation in a Dynamic Differentiated-Goods Duopoly," Game Theory and Information 9810001, University Library of Munich, Germany, revised 20 Aug 1999.
- Godfrey Keller & Sven Rady, 1999. "Market Experimentation in a Dynamic Differentiated-Goods Duopoly," STICERD - Theoretical Economics Paper Series 369, Suntory and Toyota International Centres for Economics and Related Disciplines, LSE.
- Keller, Godfrey & Rady, Sven, 1999. "Market experimentation in a dynamic differentiated-goods duopoly," LSE Research Online Documents on Economics 19346, London School of Economics and Political Science, LSE Library.
Boyarchenko, Svetlana, 2021. "Inefficiency of sponsored research," Journal of Mathematical Economics, Elsevier, vol. 95(C).
Axel Anderson & Luís M. B. Cabral, 2007. "Go for broke or play it safe? Dynamic competition with choice of variance," RAND Journal of Economics, RAND Corporation, vol. 38(3), pages 593-609, September.
- Cabral, Luis & Anderson, Axel, 2004. "Go For Broke or Play it Safe? Dynamic Competition with Choice of Variance," CEPR Discussion Papers 4249, C.E.P.R. Discussion Papers.
Strulovici, Bruno & Szydlowski, Martin, 2015. "On the smoothness of value functions and the existence of optimal strategies in diffusion models," Journal of Economic Theory, Elsevier, vol. 159(PB), pages 1016-1055.
Roland Fryer & Philipp Harms, 2018. "Two-Armed Restless Bandits with Imperfect Information: Stochastic Control and Indexability," Mathematics of Operations Research, INFORMS, vol. 43(2), pages 399-427, May.
Dinah Rosenberg & Eilon Solan & Nicolas Vieille, 2004. "Timing Games with Informational Externalities," Levine's Working Paper Archive 122247000000000704, David K. Levine.
Bonatti, Alessandro & Hörner, Johannes, 2017. "Learning to disagree in a game of experimentation," Journal of Economic Theory, Elsevier, vol. 169(C), pages 234-269.
- Alessandro Bonatti & Johannes Horner, 2015. "Learning to Disagree in a Game of Experimentation," Cowles Foundation Discussion Papers 1991, Cowles Foundation for Research in Economics, Yale University.
- Bonatti, Alessandro & Hörner, Johannes, 2017. "Learning to Disagree in a Game of Experimentation," TSE Working Papers 17-791, Toulouse School of Economics (TSE).

More about this item

Keywords

Strategic experimentation; Two-armed bandit; Strong long-run average criterion; Markov perfect equilibrium; Hjb equation; Viscosity solution;
All these keywords.

JEL classification:

C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

NEP fields

This paper has been announced in the following NEP Reports:

NEP-ORE-2019-10-21 (Operations Research)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cpr:ceprdp:14046. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://www.cepr.org .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Undiscounted Bandit Games

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

Keywords

JEL classification:

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data