IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2211.15331.html
   My bibliography  Save this paper

On the Emergence of Cooperation in the Repeated Prisoner's Dilemma

Author

Listed:
  • Maximilian Schaefer

Abstract

Using simulations between pairs of $\epsilon$-greedy q-learners with one-period memory, this article demonstrates that the potential function of the stochastic replicator dynamics (Foster and Young, 1990) allows it to predict the emergence of error-proof cooperative strategies from the underlying parameters of the repeated prisoner's dilemma. The observed cooperation rates between q-learners are related to the ratio between the kinetic energy exerted by the polar attractors of the replicator dynamics under the grim trigger strategy. The frontier separating the parameter space conducive to cooperation from the parameter space dominated by defection can be found by setting the kinetic energy ratio equal to a critical value, which is a function of the discount factor, $f(\delta) = \delta/(1-\delta)$, multiplied by a correction term to account for the effect of the algorithms' exploration probability. The gradient at the frontier increases with the distance between the game parameters and the hyperplane that characterizes the incentive compatibility constraint for cooperation under grim trigger. Building on literature from the neurosciences, which suggests that reinforcement learning is useful to understanding human behavior in risky environments, the article further explores the extent to which the frontier derived for q-learners also explains the emergence of cooperation between humans. Using metadata from laboratory experiments that analyze human choices in the infinitely repeated prisoner's dilemma, the cooperation rates between humans are compared to those observed between q-learners under similar conditions. The correlation coefficients between the cooperation rates observed for humans and those observed for q-learners are consistently above $0.8$. The frontier derived from the simulations between q-learners is also found to predict the emergence of cooperation between humans.

Suggested Citation

  • Maximilian Schaefer, 2022. "On the Emergence of Cooperation in the Repeated Prisoner's Dilemma," Papers 2211.15331, arXiv.org, revised Feb 2023.
  • Handle: RePEc:arx:papers:2211.15331
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2211.15331
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Drew Fudenberg & David G. Rand & Anna Dreber, 2012. "Slow to Anger and Fast to Forgive: Cooperation in an Uncertain World," American Economic Review, American Economic Association, vol. 102(2), pages 720-749, April.
    2. Pedro Dal Bó & Guillaume R. Fréchette, 2018. "On the Determinants of Cooperation in Infinitely Repeated Games: A Survey," Journal of Economic Literature, American Economic Association, vol. 56(1), pages 60-114, March.
    3. Pedro Dal Bo & Guillaume R. Frochette, 2011. "The Evolution of Cooperation in Infinitely Repeated Games: Experimental Evidence," American Economic Review, American Economic Association, vol. 101(1), pages 411-429, February.
    4. Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2020. "Artificial Intelligence, Algorithmic Pricing, and Collusion," American Economic Review, American Economic Association, vol. 110(10), pages 3267-3297, October.
    5. Matthias Blonski & Peter Ockenfels & Giancarlo Spagnolo, 2011. "Equilibrium Selection in the Repeated Prisoner's Dilemma: Axiomatic Approach and Experimental Evidence," American Economic Journal: Microeconomics, American Economic Association, vol. 3(3), pages 164-192, August.
    6. Martino Banchio & Giacomo Mantegazza, 2022. "Artificial Intelligence and Spontaneous Collusion," Papers 2202.05946, arXiv.org, revised Sep 2023.
    7. Bigoni, Maria & Casari, Marco & Salvanti, Andrea & Skrzypacz, Andrzej & Spagnolo, Giancarlo, 2022. "It's Payback time: new insights on cooperation in the repeated prisoners' dilemma," CEPR Discussion Papers 16912, C.E.P.R. Discussion Papers.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Normann, Hans-Theo & Sternberg, Martin, 2023. "Human-algorithm interaction: Algorithmic pricing in hybrid laboratory markets," European Economic Review, Elsevier, vol. 152(C).
    2. Ghidoni, Riccardo & Suetens, Sigrid, 2019. "Empirical Evidence on Repeated Sequential Games," Other publications TiSEM ff3a441f-e196-4e45-ba59-c, Tilburg University, School of Economics and Management.
    3. Maximilian Andres, 2024. "Equilibrium selection in infinitely repeated games with communication," CEPA Discussion Papers 75, Center for Economic Policy Analysis.
    4. Heller, Yuval & Tubul, Itay, 2023. "Strategies in the repeated prisoner’s dilemma: A cluster analysis," MPRA Paper 117444, University Library of Munich, Germany.
    5. Eugenio Proto & Aldo Rustichini & Andis Sofianos, 2020. "Intelligence, Errors and Strategic Choices in the Repeated Prisoners Dilemma," Working Papers 2020_07, Business School - Economics, University of Glasgow.
    6. Marco Lambrecht & Eugenio Proto & Aldo Rustichini & Andis Sofianos, 2024. "Intelligence Disclosure and Cooperation in Repeated Interactions," American Economic Journal: Microeconomics, American Economic Association, vol. 16(3), pages 199-231, August.
    7. Normann, Hans-Theo & Sternberg, Martin, 2022. "Human-algorithm interaction: Algorithmic pricing in hybrid laboratory markets," DICE Discussion Papers 392, Heinrich Heine University Düsseldorf, Düsseldorf Institute for Competition Economics (DICE).
    8. Maximilian Andres, 2023. "Communication in the Infinitely Repeated Prisoner's Dilemma: Theory and Experiments," Papers 2304.12297, arXiv.org.
    9. Bigoni, Maria & Casari, Marco & Salvanti, Andrea & Skrzypacz, Andrzej & Spagnolo, Giancarlo, 2022. "It’s Payback Time: New Insights on Cooperation in the Repeated Prisoners’ Dilemma," IZA Discussion Papers 15023, Institute of Labor Economics (IZA).
    10. Hans-Theo Normann & Martin Sternberg, 2021. "Human-Algorithm Interaction: Algorithmic Pricing in Hybrid Laboratory Markets," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2021_11, Max Planck Institute for Research on Collective Goods, revised 13 Apr 2022.
    11. Bigoni, Maria & Camera, Gabriele & Casari, Marco, 2020. "Money is more than memory," Journal of Monetary Economics, Elsevier, vol. 110(C), pages 99-115.
    12. Evans, Alecia & Sesmero, Juan Pablo, 2022. "Noisy Payoffs in an Infinitely Repeated Prisoner’s Dilemma – Experimental Evidence," 2022 Annual Meeting, July 31-August 2, Anaheim, California 322434, Agricultural and Applied Economics Association.
    13. Casoria, Fortuna & Ciccone, Alice, 2021. "Do upfront investments increase cooperation? A laboratory experiment," European Economic Review, Elsevier, vol. 140(C).
    14. John Duffy & Félix Muñoz-García, 2012. "Patience or Fairness? Analyzing Social Preferences in Repeated Games," Games, MDPI, vol. 3(1), pages 1-22, March.
    15. Yves Breitmoser, 2015. "Cooperation, but No Reciprocity: Individual Strategies in the Repeated Prisoner's Dilemma," American Economic Review, American Economic Association, vol. 105(9), pages 2882-2910, September.
    16. Timothy Cason & Sau-Him Lau & Vai-Lam Mui, 2013. "Learning, teaching, and turn taking in the repeated assignment game," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 54(2), pages 335-357, October.
    17. Eungik Lee & Andrew Choi & Syngjoo Choi & Yves Guéron, 2023. "Irreversibility And Monitoring In Dynamic Games: Experimental Evidence," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 64(1), pages 387-412, February.
    18. Jones, Matthew T., 2014. "Strategic complexity and cooperation: An experimental study," Journal of Economic Behavior & Organization, Elsevier, vol. 106(C), pages 352-366.
    19. Willemien Kets & Alvaro Sandroni, 2021. "A Theory of Strategic Uncertainty and Cultural Diversity," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 88(1), pages 287-333.
    20. Emanuel Vespa & Taylor Weidman & Alistair J. Wilson, 2021. "Testing Models of Strategic Uncertainty: Equilibrium Selection in Repeated Games," Papers 2101.05900, arXiv.org.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2211.15331. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.