IDEAS home Printed from https://ideas.repec.org/p/hal/wpaper/hal-00817201.html
   My bibliography  Save this paper

Learning in a Black Box

Author

Listed:
  • Heinrich H. Nax

    (PSE - Paris-Jourdan Sciences Economiques - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - INRA - Institut National de la Recherche Agronomique - EHESS - École des hautes études en sciences sociales - ENPC - École des Ponts ParisTech - CNRS - Centre National de la Recherche Scientifique, PSE - Paris School of Economics - UP1 - Université Paris 1 Panthéon-Sorbonne - ENS-PSL - École normale supérieure - Paris - PSL - Université Paris Sciences et Lettres - EHESS - École des hautes études en sciences sociales - ENPC - École des Ponts ParisTech - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement)

  • Maxwell N. Burton-Chellew

    (Department of Zoology [Oxford] - University of Oxford)

  • Stuart A. West

    (Department of Zoology [Oxford] - University of Oxford)

  • H. Peyton Young

    (Department of Economics - University of Oxford)

Abstract

Many interactive environments can be represented as games, but they are so large and complex that individual players are in the dark about what others are doing and how their own payo s are a ected. This paper analyzes learning behavior in such 'black box' environments, where players' only source of information is their own history of actions taken and payoff s received. Speci fically we study repeated public goods games, where players must decide how much to contribute at each stage, but they do not know how much others have contributed or how others' contributions a effect their own payoff s. We identify two key features of the players' learning dynamics. First, if a player's realized payoff increases he is less inclined to change his strategy, whereas if his realized payo ff decreases he is more inclined to change his strategy. Second, if increasing his own contribution results in higher payoff s he will tend to increase his contribution still further, whereas the reverse holds if an increase in contribution leads to lower payo ffs. These two e ffects are clearly present when players have no information about the game; moreover they are still present even when players have full information. Convergence to Nash equilibrium occurs at about the same rate in both situations.

Suggested Citation

  • Heinrich H. Nax & Maxwell N. Burton-Chellew & Stuart A. West & H. Peyton Young, 2013. "Learning in a Black Box," Working Papers hal-00817201, HAL.
  • Handle: RePEc:hal:wpaper:hal-00817201
    Note: View the original document on HAL open archive server: https://pjse.hal.science/hal-00817201v2
    as

    Download full text from publisher

    File URL: https://pjse.hal.science/hal-00817201v2/document
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Andreoni, James, 1988. "Why free ride? : Strategies and learning in public goods experiments," Journal of Public Economics, Elsevier, vol. 37(3), pages 291-304, December.
    2. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    3. Sergiu Hart & Andreu Mas-Colell, 2013. "Stochastic Uncoupled Dynamics And Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 8, pages 165-189, World Scientific Publishing Co. Pte. Ltd..
    4. R. Mark Isaac & James M. Walker, 1988. "Group Size Effects in Public Goods Provision: The Voluntary Contributions Mechanism," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 103(1), pages 179-199.
    5. Selten, Reinhard & Stoecker, Rolf, 1986. "End behavior in sequences of finite Prisoner's Dilemma supergames A learning theory approach," Journal of Economic Behavior & Organization, Elsevier, vol. 7(1), pages 47-70, March.
    6. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
    7. , P. & , Peyton, 2006. "Regret testing: learning to play Nash equilibrium without knowing you have an opponent," Theoretical Economics, Econometric Society, vol. 1(3), pages 341-367, September.
    8. Friedman, Daniel & Huck, Steffen & Oprea, Ryan & Weidenholzer, Simon, 2015. "From imitation to collusion: Long-run learning in a low-information environment," Journal of Economic Theory, Elsevier, vol. 155(C), pages 185-205.
    9. Ralph-C. Bayer & Elke Renner & Rupert Sausgruber, 2013. "Confusion and learning in the voluntary contributions game," Experimental Economics, Springer;Economic Science Association, vol. 16(4), pages 478-496, December.
    10. Huck, Steffen & Jehiel, Philippe & Rutter, Tom, 2011. "Feedback spillover and analogy-based expectations: A multi-game experiment," Games and Economic Behavior, Elsevier, vol. 71(2), pages 351-365, March.
    11. Sergiu Hart & Andreu Mas-Colell, 2013. "Uncoupled Dynamics Do Not Lead To Nash Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 7, pages 153-163, World Scientific Publishing Co. Pte. Ltd..
    12. Larry G. Epstein, 1999. "A Definition of Uncertainty Aversion," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 66(3), pages 579-608.
    13. Oechssler, Jorg & Schipper, Burkhard, 2003. "Can you guess the game you are playing?," Games and Economic Behavior, Elsevier, vol. 43(1), pages 137-152, April.
    14. Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
    15. Erev, Ido & Rapoport, Amnon, 1998. "Coordination, "Magic," and Reinforcement Learning in a Market Entry Game," Games and Economic Behavior, Elsevier, vol. 23(2), pages 146-175, May.
    16. Ananish Chaudhuri, 2011. "Sustaining cooperation in laboratory public goods experiments: a selective survey of the literature," Experimental Economics, Springer;Economic Science Association, vol. 14(1), pages 47-83, March.
    17. V. P. Crawford, 2014. "Boundedly rational versus optimization-based models of strategic thinking and learning in games," Voprosy Ekonomiki, NP Voprosy Ekonomiki, issue 5.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," Theory and Decision, Springer, vol. 79(1), pages 51-93, July.
    2. Mäs, Michael & Nax, Heinrich H., 2016. "A behavioral study of “noise” in coordination games," Journal of Economic Theory, Elsevier, vol. 162(C), pages 195-208.
    3. Heinrich H. Nax, 2016. "When is Market the Benchmark? Reinforcement Evidence from Repurchase Decisions," Economics Series Working Papers 781, University of Oxford, Department of Economics.
    4. Nax, Heinrich H. & Murphy, Ryan O. & Helbing, Dirk, 2014. "Stability and welfare of 'merit-based' group-matching mechanisms in voluntary contribution game," LSE Research Online Documents on Economics 65444, London School of Economics and Political Science, LSE Library.
    5. Mäs, Michael & Nax, Heinrich H., 2016. "A behavioral study of “noise” in coordination games," LSE Research Online Documents on Economics 65422, London School of Economics and Political Science, LSE Library.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nax, Heinrich H. & Burton-Chellew, Maxwell N. & West, Stuart A. & Young, H. Peyton, 2016. "Learning in a black box," Journal of Economic Behavior & Organization, Elsevier, vol. 127(C), pages 1-15.
    2. H Peyton Young & H.H. Nax & M.N. Burton-Chellew & S.A. Westor, 2013. "Learning in a Black Box: Trial-and-Error in Voluntary Contribuitons Games," Economics Series Working Papers 653, University of Oxford, Department of Economics.
    3. Nax, Heinrich H. & Burton-Chellew, Maxwell N. & West, Stuart A. & Young, H. Peyton, 2016. "Learning in a black box," LSE Research Online Documents on Economics 68714, London School of Economics and Political Science, LSE Library.
    4. Heinrich H. Nax & Maxwell N. Burton-Chellew & Stuart A. West & H. Peyton Young, 2013. "Learning in a Black Box," PSE Working Papers hal-00817201, HAL.
    5. Nax, Heinrich H., 2015. "Equity dynamics in bargaining without information exchange," LSE Research Online Documents on Economics 65426, London School of Economics and Political Science, LSE Library.
    6. Heinrich Nax, 2015. "Equity dynamics in bargaining without information exchange," Journal of Evolutionary Economics, Springer, vol. 25(5), pages 1011-1026, November.
    7. Jonathan Newton, 2018. "Evolutionary Game Theory: A Renaissance," Games, MDPI, vol. 9(2), pages 1-67, May.
    8. Jean-François Laslier & Bernard Walliser, 2015. "Stubborn learning," Theory and Decision, Springer, vol. 79(1), pages 51-93, July.
    9. Mäs, Michael & Nax, Heinrich H., 2016. "A behavioral study of “noise” in coordination games," LSE Research Online Documents on Economics 65422, London School of Economics and Political Science, LSE Library.
    10. Mäs, Michael & Nax, Heinrich H., 2016. "A behavioral study of “noise” in coordination games," Journal of Economic Theory, Elsevier, vol. 162(C), pages 195-208.
    11. Heinrich Nax & Bary Pradelski, 2015. "Evolutionary dynamics and equitable core selection in assignment games," International Journal of Game Theory, Springer;Game Theory Society, vol. 44(4), pages 903-932, November.
    12. Nax, Heinrich H. & Pradelski, Bary S. R., 2015. "Evolutionary dynamics and equitable core selection in assignment games," LSE Research Online Documents on Economics 65428, London School of Economics and Political Science, LSE Library.
    13. Yakov Babichenko, 2010. "Completely Uncoupled Dynamics and Nash Equilibria," Discussion Paper Series dp529, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem.
    14. Young, H. Peyton, 2009. "Learning by trial and error," Games and Economic Behavior, Elsevier, vol. 65(2), pages 626-643, March.
    15. Burkhard C. Schipper, 2022. "Strategic Teaching and Learning in Games," American Economic Journal: Microeconomics, American Economic Association, vol. 14(3), pages 321-352, August.
    16. Arifovic, Jasmina & Ledyard, John, 2012. "Individual evolutionary learning, other-regarding preferences, and the voluntary contributions mechanism," Journal of Public Economics, Elsevier, vol. 96(9-10), pages 808-823.
    17. Marden, Jason R. & Shamma, Jeff S., 2015. "Game Theory and Distributed Control****Supported AFOSR/MURI projects #FA9550-09-1-0538 and #FA9530-12-1-0359 and ONR projects #N00014-09-1-0751 and #N0014-12-1-0643," Handbook of Game Theory with Economic Applications,, Elsevier.
    18. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
    19. Nax, Heinrich H. & Murphy, Ryan O. & Helbing, Dirk, 2014. "Stability and welfare of 'merit-based' group-matching mechanisms in voluntary contribution game," LSE Research Online Documents on Economics 65444, London School of Economics and Political Science, LSE Library.
    20. Cox, Caleb A. & Stoddard, Brock, 2018. "Strategic thinking in public goods games with teams," Journal of Public Economics, Elsevier, vol. 161(C), pages 31-43.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:wpaper:hal-00817201. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.