IDEAS home Printed from https://ideas.repec.org/a/sae/jocore/v46y2002i5p629-653.html
   My bibliography  Save this article

Stochastic Collusion and the Power Law of Learning

Author

Listed:
  • ANDREAS FLACHE

    (Department of Sociology University of Groningen, the Netherlands)

  • MICHAEL W. MACY

    (Department of Sociology Cornell University)

Abstract

Concerns about models of cultural adaptation as analogs of genetic selection have led cognitive game theorists to explore learning-theoretic specifications. Two prominent examples, the Bush-Mosteller stochastic learning model and the Roth-Erev payoff-matching model, are aligned and integrated as special cases of a general reinforcement learning model. Both models predict stochastic collusion as a backward-looking solution to the problem of cooperation in social dilemmas based on a random walk into a self-reinforcing cooperative equilibrium. The integration uncovers hidden assumptions that constrain the generality of the theoretical derivations. Specifically, Roth and Erev assume a “power law of learning†—the curious but plausible tendency for learning to diminish with success and intensify with failure. Computer simulation is used to explore the effects on stochastic collusion in three social dilemma games. The analysis shows how the integration of alternative models can uncover underlying principles and lead to a more general theory.

Suggested Citation

  • Andreas Flache & Michael W. Macy, 2002. "Stochastic Collusion and the Power Law of Learning," Journal of Conflict Resolution, Peace Science Society (International), vol. 46(5), pages 629-653, October.
  • Handle: RePEc:sae:jocore:v:46:y:2002:i:5:p:629-653
    DOI: 10.1177/002200202236167
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/002200202236167
    Download Restriction: no

    File URL: https://libkey.io/10.1177/002200202236167?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
    2. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    3. Dawes, Robyn M & Thaler, Richard H, 1988. "Anomalies: Cooperation," Journal of Economic Perspectives, American Economic Association, vol. 2(3), pages 187-197, Summer.
    4. Weibull, Jorgen W., 1998. "Evolution, rationality and equilibrium in games," European Economic Review, Elsevier, vol. 42(3-5), pages 641-649, May.
    5. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
    6. Richard J. Herrnstein & Drazen Prelec, 1991. "Melioration: A Theory of Distributed Choice," Journal of Economic Perspectives, American Economic Association, vol. 5(3), pages 137-156, Summer.
    7. Karl H. Schlag & Gregory B. Pollock, 1999. "Social Roles As An Effective Learning Mechanism," Rationality and Society, , vol. 11(4), pages 371-397, November.
    8. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
    9. Michael D. Cohen & Rick L. Riolo & Robert Axelrod, 2001. "The Role Of Social Structure In The Maintenance Of Cooperative Regimes," Rationality and Society, , vol. 13(1), pages 5-32, February.
    10. Erev, Ido & Bereby-Meyer, Yoella & Roth, Alvin E., 1999. "The effect of adding a constant to all payoffs: experimental investigation, and implications for reinforcement learning models," Journal of Economic Behavior & Organization, Elsevier, vol. 39(1), pages 111-128, May.
    11. Michael W. Macy, 1989. "Walking out of Social Traps," Rationality and Society, , vol. 1(2), pages 197-219, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. José Manuel Galán & Luis R. Izquierdo & Segismundo S. Izquierdo & José Ignacio Santos & Ricardo del Olmo & Adolfo López-Paredes & Bruce Edmonds, 2009. "Errors and Artefacts in Agent-Based Modelling," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 12(1), pages 1-1.
    2. Luis R. Izquierdo & J. Gareth Polhill, 2006. "Is Your Model Susceptible to Floating-Point Errors?," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 9(4), pages 1-4.
    3. Segismundo S. Izquierdo & Luis R. Izquierdo & Nicholas M. Gotts, 2008. "Reinforcement Learning Dynamics in Social Dilemmas," Journal of Artificial Societies and Social Simulation, Journal of Artificial Societies and Social Simulation, vol. 11(2), pages 1-1.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ido Erev & Eyal Ert & Alvin E. Roth, 2010. "A Choice Prediction Competition for Market Entry Games: An Introduction," Games, MDPI, vol. 1(2), pages 1-20, May.
    2. Ido Erev & Alvin Roth & Robert Slonim & Greg Barron, 2007. "Learning and equilibrium as useful approximations: Accuracy of prediction on randomly selected constant sum games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 33(1), pages 29-51, October.
    3. Walter Gutjahr, 2006. "Interaction dynamics of two reinforcement learners," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 14(1), pages 59-86, February.
    4. Faison P. Gibson, 2002. "Is It Better to Forget? Stimulus-Response, Prediction, and the Weight of Past Experience in a Fast-Paced Bargaining Task," Computational and Mathematical Organization Theory, Springer, vol. 8(1), pages 31-47, May.
    5. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    6. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
    7. Anthony Ziegelmeyer & Frédéric Koessler & Kene Boun My & Laurent Denant-Boèmont, 2008. "Road Traffic Congestion and Public Information: An Experimental Investigation," Journal of Transport Economics and Policy, University of Bath, vol. 42(1), pages 43-82, January.
    8. DeJong, D.V. & Blume, A. & Neumann, G., 1998. "Learning in Sender-Receiver Games," Other publications TiSEM 4a8b4f46-f30b-4ad2-bb0c-1, Tilburg University, School of Economics and Management.
    9. Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," Theory and Decision, Springer, vol. 79(1), pages 51-93, July.
    10. Arifovic, Jasmina & Karaivanov, Alexander, 2010. "Learning by doing vs. learning from others in a principal-agent model," Journal of Economic Dynamics and Control, Elsevier, vol. 34(10), pages 1967-1992, October.
    11. Dürsch, Peter & Kolb, Albert & Oechssler, Jörg & Schipper, Burkhard C., 2005. "Rage Against the Machines: How Subjects Learn to Play Against Computers," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 63, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
    12. Martino Banchio & Giacomo Mantegazza, 2022. "Artificial Intelligence and Spontaneous Collusion," Papers 2202.05946, arXiv.org, revised Sep 2023.
    13. Gary Charness & Dan Levin, 2003. "Bayesian Updating vs. Reinforcement and Affect: A Laboratory Study," Levine's Bibliography 666156000000000180, UCLA Department of Economics.
    14. Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
    15. Gary Charness & Dan Levin, 2005. "When Optimal Choices Feel Wrong: A Laboratory Study of Bayesian Updating, Complexity, and Affect," American Economic Review, American Economic Association, vol. 95(4), pages 1300-1309, September.
    16. Ying-Fang Kao & Ragupathy Venkatachalam, 2021. "Human and Machine Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(3), pages 889-909, March.
    17. Blume, Andreas & Gneezy, Uri, 2000. "An Experimental Investigation of Optimal Learning in Coordination Games," Journal of Economic Theory, Elsevier, vol. 90(1), pages 161-172, January.
    18. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
    19. Martin G. Kocher & Matthias Sutter, 2005. "The Decision Maker Matters: Individual Versus Group Behaviour in Experimental Beauty-Contest Games," Economic Journal, Royal Economic Society, vol. 115(500), pages 200-223, January.
    20. Albert Banal-Estañol & Augusto Rupérez-Micola, 2010. "Are agent-based simulations robust? The wholesale electricity trading case," Economics Working Papers 1214, Department of Economics and Business, Universitat Pompeu Fabra.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:jocore:v:46:y:2002:i:5:p:629-653. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: http://pss.la.psu.edu/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.