IDEAS home Printed from https://ideas.repec.org/p/tse/wpaper/124603.html
   My bibliography  Save this paper

Overcoming Free-Riding in Bandit Games

Author

Listed:
  • Hörner, Johannes
  • Klein, Nicolas
  • Rady, Sven

Abstract

This paper considers a class of experimentation games with L´evy bandits encompassing those of Bolton and Harris (1999) and Keller, Rady and Cripps (2005). Its main result is that efficient (perfect Bayesian) equilibria exist whenever players’ payoffs have a diffusion component. Hence, the trade-offs emphasized in the literature do not rely on the intrinsic nature of bandit models but on the commonly adopted solution concept (MPE). This is not an artifact of continuous time: we prove that such equilibria arise as limits of equilibria in the discretetime game. Furthermore, it suffices to relax the solution concept to strongly symmetric equilibrium.

Suggested Citation

  • Hörner, Johannes & Klein, Nicolas & Rady, Sven, 2020. "Overcoming Free-Riding in Bandit Games," TSE Working Papers 20-1132, Toulouse School of Economics (TSE).
  • Handle: RePEc:tse:wpaper:124603
    as

    Download full text from publisher

    File URL: https://www.tse-fr.eu/sites/default/files/TSE/documents/doc/wp/2020/wp_tse_1132.pdf
    File Function: Full Text
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
    2. Drew Fudenberg & David K. Levine & Satoru Takahashi, 2008. "Perfect public equilibrium when players are patient," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 16, pages 345-367, World Scientific Publishing Co. Pte. Ltd..
    3. Bruno Biais & Thomas Mariotti & Guillaume Plantin & Jean-Charles Rochet, 2007. "Dynamic Security Design: Convergence to Continuous Time and Asset Pricing Implications," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(2), pages 345-390.
    4. Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
    5. Tomasz Sadzik & Ennio Stacchetti, 2015. "Agency Models With Frequent Actions," Econometrica, Econometric Society, vol. 83, pages 193-237, January.
    6. Simon, Leo K & Stinchcombe, Maxwell B, 1995. "Equilibrium Refinement for Infinite Normal-Form Games," Econometrica, Econometric Society, vol. 63(6), pages 1421-1443, November.
    7. , & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
    8. Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
    9. Dutta Prajit K., 1995. "A Folk Theorem for Stochastic Games," Journal of Economic Theory, Elsevier, vol. 66(1), pages 1-32, June.
    10. Abrea Dilip & Pearce David & Stacchetti Ennio, 1993. "Renegotiation and Symmetry in Repeated Games," Journal of Economic Theory, Elsevier, vol. 60(2), pages 217-240, August.
    11. Avinash K. Dixit & Robert S. Pindyck, 1994. "Investment under Uncertainty," Economics Books, Princeton University Press, edition 1, number 5474.
    12. repec:cwl:cwldpp:1726rrr is not listed on IDEAS
    13. repec:cwl:cwldpp:1726rr is not listed on IDEAS
    14. Bergin, James & MacLeod, W Bentley, 1993. "Continuous Time Repeated Games," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 34(1), pages 21-37, February.
    15. Abreu, Dilip & Pearce, David & Stacchetti, Ennio, 1986. "Optimal cartel equilibria with imperfect monitoring," Journal of Economic Theory, Elsevier, vol. 39(1), pages 251-269, June.
    16. Abreu, Dilip, 1986. "Extremal equilibria of oligopolistic supergames," Journal of Economic Theory, Elsevier, vol. 39(1), pages 191-225, June.
    17. Johannes Hörner & Takuo Sugaya & Satoru Takahashi & Nicolas Vieille, 2011. "Recursive Methods in Discounted Stochastic Games: An Algorithm for δ→ 1 and a Folk Theorem," Econometrica, Econometric Society, vol. 79(4), pages 1277-1318, July.
    18. Drew Fudenberg & David K. Levine, 2009. "Repeated Games with Frequent Signals," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 124(1), pages 233-265.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Doruk Cetemen & Can Urgun & Leeat Yariv, 2023. "Collective Progress: Dynamics of Exit Waves," Journal of Political Economy, University of Chicago Press, vol. 131(9), pages 2402-2450.
    2. Hwang, Ilwoo, 2023. "Policy experimentation with repeated elections," Games and Economic Behavior, Elsevier, vol. 142(C), pages 623-644.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sven Rady & Nicolas Klein & Johannes Horner, 2013. "Strongly Symmetric Equilibria in Bandit Games," 2013 Meeting Papers 1107, Society for Economic Dynamics.
    2. Weng, Xi, 2015. "Dynamic pricing in the presence of individual learning," Journal of Economic Theory, Elsevier, vol. 155(C), pages 262-299.
    3. , & ,, 2015. "A folk theorem for stochastic games with infrequent state changes," Theoretical Economics, Econometric Society, vol. 10(1), January.
    4. Keller, Godfrey & Novák, Vladimír & Willems, Tim, 2019. "A note on optimal experimentation under risk aversion," Journal of Economic Theory, Elsevier, vol. 179(C), pages 476-487.
    5. Kaustav Das, 2014. "Strategic Experimentation with Competition and Private Arrival of Information," Discussion Papers 1404, University of Exeter, Department of Economics.
    6. Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
    7. Fudenberg, Drew & Ishii, Yuhta & Kominers, Scott Duke, 2014. "Delayed-response strategies in repeated games with observation lags," Journal of Economic Theory, Elsevier, vol. 150(C), pages 487-514.
    8. Georgiadis, George, 2017. "Deadlines and infrequent monitoring in the dynamic provision of public goods," Journal of Public Economics, Elsevier, vol. 152(C), pages 1-12.
    9. Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
    10. Rodivilov, Alexander, 2022. "Monitoring innovation," Games and Economic Behavior, Elsevier, vol. 135(C), pages 297-326.
    11. Kimmo Berg, 2016. "Elementary Subpaths in Discounted Stochastic Games," Dynamic Games and Applications, Springer, vol. 6(3), pages 304-323, September.
    12. Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
    13. Xie, Yinxi & Xie, Yang, 2017. "Machiavellian experimentation," Journal of Comparative Economics, Elsevier, vol. 45(4), pages 685-711.
    14. Kaustav Das & Nicolas Klein & Katharina Schmid, 2020. "Strategic experimentation with asymmetric players," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 69(4), pages 1147-1175, June.
    15. Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," NBER Working Papers 32424, National Bureau of Economic Research, Inc.
    16. Klein, Nicolas, 2013. "Strategic learning in teams," Games and Economic Behavior, Elsevier, vol. 82(C), pages 636-657.
    17. Mira Frick & Yuhta Ishii, 2015. "Innovation Adoption by Forward-Looking Social Learners," Cowles Foundation Discussion Papers 1877, Cowles Foundation for Research in Economics, Yale University.
    18. Keller, Godfrey & Rady, Sven, 2015. "Breakdowns," Theoretical Economics, Econometric Society, vol. 10(1), January.
    19. Song, Yangbo & Zhao, Mofei, 2021. "Dynamic R&D competition under uncertainty and strategic disclosure," Journal of Economic Behavior & Organization, Elsevier, vol. 181(C), pages 169-210.
    20. Catherine Bobtcheff & Raphaël Levy, 2017. "More Haste, Less Speed? Signaling through Investment Timing," American Economic Journal: Microeconomics, American Economic Association, vol. 9(3), pages 148-186, August.

    More about this item

    Keywords

    Two-Armed Bandit; Bayesian Learning; Strategic Experimentation; Strongly Symmetric Equilibrium.;
    All these keywords.

    JEL classification:

    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:tse:wpaper:124603. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/tsetofr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.