IDEAS home Printed from https://ideas.repec.org/p/ecm/wc2000/0878.html
   My bibliography  Save this paper

Strategic Experimentation: The Case of the Poisson Bandits

Author

Listed:
  • Martin Cripps

    (University of Warwick)

  • Godfrey Keller

    (London School of Economics)

  • Sven Rady

    (University of Munich)

Abstract

This paper studies a game of strategic experimentation in which the players learn from the experiments of others as well as their own. We first establish the efficient benchmark where the players co-ordinate in order to maximise joint expected payoffs, and then show that, because of free-riding, the strategic problem leads to inefficiently low levels of experimentation in any equilibrium when the players use stationary Markovian strategies. Efficiency can be approximately retrieved provided that the players adopt strategies which slow down the rate at which information is acquired; this is achieved by their taking periodic breaks from experimenting, which get progressively longer. In the public information case (actions and experimental outcomes are both observable), we exhibit a class of non-stationary equilibria in which the $\varepsilon$-efficient amount of experimentation is performed, but only in infinite time. In the private information case (only actions are observable, not outcomes), the breaks have two additional effects: not only do they enable the players to finesse the inference problem, but also they serve to signal their experimental outcome to the other player. We describe an equilibrium with similar non-stationary strategies in which the $\varepsilon$-efficient amount of experimentation is again performed in infinite time, but with a faster rate of information acquisition. The equilibrium rate of information acquisition is slower in the former case because the short-run temptation to free-ride on information acquisition is greater when information is public.

Suggested Citation

  • Martin Cripps & Godfrey Keller & Sven Rady, 2000. "Strategic Experimentation: The Case of the Poisson Bandits," Econometric Society World Congress 2000 Contributed Papers 0878, Econometric Society.
  • Handle: RePEc:ecm:wc2000:0878
    as

    Download full text from publisher

    File URL: http://fmwww.bc.edu/RePEc/es2000/0878.pdf
    File Function: main text
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Leslie M. Marx & Steven A. Matthews, 2000. "Dynamic Voluntary Contribution to a Public Project," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 67(2), pages 327-358.
    2. Bergemann, Dirk & Hege, Ulrich, 1998. "Venture capital financing, moral hazard, and learning," Journal of Banking & Finance, Elsevier, vol. 22(6-8), pages 703-735, August.
    3. David A. Malueg & Shunichi O. Tsutsui, 1997. "Dynamic R&D Competition with Learning," RAND Journal of Economics, The RAND Corporation, vol. 28(4), pages 751-772, Winter.
    4. Dirk Bergemann & Ulrigh Hege, 2005. "The Financing of Innovation: Learning and Stopping," RAND Journal of Economics, The RAND Corporation, vol. 36(4), pages 719-752, Winter.
    5. Patrick Bolton & Christopher Harris, 1999. "Strategic Experimentation," Econometrica, Econometric Society, vol. 67(2), pages 349-374, March.
    6. Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
    7. Anat R. Admati & Motty Perry, 1991. "Joint Projects without Commitment," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 58(2), pages 259-276.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Edward Cartwright & Myrna Wooders, 2009. "On equilibrium in pure strategies in games with many players," International Journal of Game Theory, Springer;Game Theory Society, vol. 38(1), pages 137-153, March.
    2. Lukach, R. & Plasmans, J.E.J., 2002. "Measuring Knowledge Spillovers using Patent Citations : Evidence from the Belgian Firm's Data," Other publications TiSEM d78bf59a-e0ff-4451-86b9-1, Tilburg University, School of Economics and Management.
    3. Dinah Rosenberg & Eilon Solan & Nicolas Vieille, 2004. "Timing Games with Informational Externalities," Levine's Working Paper Archive 122247000000000704, David K. Levine.
    4. Bøg, Martin, 2006. "Whom to Observe?," MPRA Paper 8773, University Library of Munich, Germany, revised 14 May 2008.
    5. Dinah Rosenberg & Eilon Solan & Nicolas Vieille, 2007. "Social Learning in One-Arm Bandit Problems," Econometrica, Econometric Society, vol. 75(6), pages 1591-1611, November.
    6. Krähmer, Daniel, 2003. "Learning and self-confidence in contests [Lernen und Selbstvertrauen in Wettkämpfen]," Discussion Papers, Research Unit: Market Processes and Governance SP II 2003-10, WZB Berlin Social Science Center.
    7. Wälde, Klaus, 2001. "Capital accumulation in a model of growth and creative destruction," Dresden Discussion Paper Series in Economics 09/01, Technische Universität Dresden, Faculty of Business and Economics, Department of Economics.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Godfrey Keller & Sven Rady & Martin Cripps, 2005. "Strategic Experimentation with Exponential Bandits," Econometrica, Econometric Society, vol. 73(1), pages 39-68, January.
    2. Heidhues, Paul & Rady, Sven & Strack, Philipp, 2015. "Strategic experimentation with private payoffs," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 531-551.
    3. Johannes Hörner & Larry Samuelson, 2013. "Incentives for experimenting agents," RAND Journal of Economics, RAND Corporation, vol. 44(4), pages 632-663, December.
    4. Sorensen, Morten, 2007. "Learning by Investing: Evidence from Venture Capital," SIFR Research Report Series 53, Institute for Financial Research.
    5. Chen, Yi, 2020. "A revision game of experimentation on a common threshold," Journal of Economic Theory, Elsevier, vol. 186(C).
    6. Chen, Chia-Hui & Ishida, Junichiro, 2018. "Hierarchical experimentation," Journal of Economic Theory, Elsevier, vol. 177(C), pages 365-404.
    7. , & ,, 2010. "Strategic experimentation with Poisson bandits," Theoretical Economics, Econometric Society, vol. 5(2), May.
    8. Nicolas Klein & Tymofiy Mylovanov, 2011. "Should the Flatterers be Avoided?," 2011 Meeting Papers 1273, Society for Economic Dynamics.
    9. Doruk Cetemen & Can Urgun & Leeat Yariv, 2023. "Collective Progress: Dynamics of Exit Waves," Journal of Political Economy, University of Chicago Press, vol. 131(9), pages 2402-2450.
    10. Besanko, David & Tong, Jian & Wu, Jianjun, 2016. "Subsidizing research programs with "if" and "when" uncertainty in the face of severe informational constraints," Discussion Paper Series In Economics And Econometrics 1605, Economics Division, School of Social Sciences, University of Southampton.
    11. Georgiadis, George, 2017. "Deadlines and infrequent monitoring in the dynamic provision of public goods," Journal of Public Economics, Elsevier, vol. 152(C), pages 1-12.
    12. Khalil, Fahad & Lawarree, Jacques & Rodivilov, Alexander, 2020. "Learning from failures: Optimal contracts for experimentation and production," Journal of Economic Theory, Elsevier, vol. 190(C).
    13. Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
    14. Rosenberg, Dinah & Salomon, Antoine & Vieille, Nicolas, 2013. "On games of strategic experimentation," Games and Economic Behavior, Elsevier, vol. 82(C), pages 31-51.
    15. Forand, Jean Guillaume, 2015. "Keeping your options open," Journal of Economic Dynamics and Control, Elsevier, vol. 53(C), pages 47-68.
    16. Alessandro Lizzeri & Eran Shmaya & Leeat Yariv, 2024. "Disentangling Exploration from Exploitation," NBER Working Papers 32424, National Bureau of Economic Research, Inc.
    17. Bruno Strulovici, 2010. "Learning While Voting: Determinants of Collective Experimentation," Econometrica, Econometric Society, vol. 78(3), pages 933-971, May.
    18. Alessandro Bonatti & Johannes Horner, 2011. "Collaborating," American Economic Review, American Economic Association, vol. 101(2), pages 632-663, April.
    19. Arthur Charpentier & Romuald Elie & Carl Remlinger, 2020. "Reinforcement Learning in Economics and Finance," Papers 2003.10014, arXiv.org.
    20. Gomes, Renato & Gottlieb, Daniel & Maestri, Lucas, 2016. "Experimentation and project selection: Screening and learning," Games and Economic Behavior, Elsevier, vol. 96(C), pages 145-169.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ecm:wc2000:0878. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: https://edirc.repec.org/data/essssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.