A Theoretical Analysis of Cooperative Behavior in Multi-Agent Q-learning
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Kandori Michihiro & Rob Rafael, 1995.
"Evolution of Equilibria in the Long Run: A General Theory and Applications,"
Journal of Economic Theory, Elsevier, vol. 65(2), pages 383-414, April.
- M. Kandori & R. Rob, 2010. "Evolution of Equilibria in the Long Run: A General Theory and Applications," Levine's Working Paper Archive 502, David K. Levine.
- Christina Fang & Steven Orla Kimbrough & Stefano Pace & Annapurna Valluri & Zhiqiang Zheng, 2002. "On Adaptive Emergence of Trust Behavior in the Game of Stag Hunt," Group Decision and Negotiation, Springer, vol. 11(6), pages 449-467, November.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Sanjeev Goyal & Fernando Vega-Redondo, 2000.
"Learning, Network Formation and Coordination,"
Econometric Society World Congress 2000 Contributed Papers
0113, Econometric Society.
- Sanjeev Goyal & Fernando Vega-Redondo, 2000. "Learning, Network Formation and Coordination," Tinbergen Institute Discussion Papers 00-093/1, Tinbergen Institute.
- Goyal, S. & Vega-Redondo, F., 2000. "Learning, Network Formation and Coordination," Econometric Institute Research Papers EI 9954-/A, Erasmus University Rotterdam, Erasmus School of Economics (ESE), Econometric Institute.
- Fernando Vega Redondo & Sanjeev Goyal, 2001. "Learning, Network Formation And Coordination," Working Papers. Serie AD 2001-19, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
- Hofbauer, Josef & Sorger, Gerhard, 1999.
"Perfect Foresight and Equilibrium Selection in Symmetric Potential Games,"
Journal of Economic Theory, Elsevier, vol. 85(1), pages 1-23, March.
- Josef HOFBAUER & Gerhard SORGER, 1998. "Perfect Foresight and Equilibrium Selection in Symmetric Potential Games," Vienna Economics Papers vie9802, University of Vienna, Department of Economics.
- Carlos Alós-Ferrer & Georg Kirchsteiger & Markus Walzl, 2010.
"On the Evolution of Market Institutions: The Platform Design Paradox,"
Economic Journal, Royal Economic Society, vol. 120(543), pages 215-243, March.
- Kirchsteiger, Georg & Alos-Ferrer, Carlos & Walzl, Markus, 2006. "On the Evolution of Market Institutions: The Platform Design Paradox," CEPR Discussion Papers 5538, C.E.P.R. Discussion Papers.
- Alos-Ferrer, C. & Kirchsteiger, G. & Walzl, M., 2006. "On the evolution of market institutions: the platform design paradox," Research Memorandum 004, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
- Carlos Alós-Ferrer & Georg Kirchsteiger & Markus Walzl, 2007. "On the Evolution of Market Institutions: The Platform Design Paradox," CESifo Working Paper Series 2012, CESifo.
- Carlos Alós-Ferrer & Georg Kirchsteiger & Markus Walzl, 2010. "On the Evolution of Market Institutions: The Platform Design Paradox," ULB Institutional Repository 2013/149586, ULB -- Universite Libre de Bruxelles.
- , & , & ,, 2008.
"Monotone methods for equilibrium selection under perfect foresight dynamics,"
Theoretical Economics, Econometric Society, vol. 3(2), June.
- Oyama, Daisuke & Takahashi, Satoru & Hofbauer, Josef, 2003. "Monotone Methods for Equilibrium Selection under Perfect Foresight Dynamics," MPRA Paper 6721, University Library of Munich, Germany.
- Josef Hofbauer & Daisuke Oyama & Satoru Takahashi, 2004. "Monotone Methods for Equilibrium Selection under Perfect Foresight Dynamics," Econometric Society 2004 North American Winter Meetings 339, Econometric Society.
- Daisuke Oyama & Satoru Takahashi & Josef Hofbauer, 2003. "Monotone Methods for Equilibrium Selection under Perfect Foresight Dynamics," Levine's Bibliography 666156000000000420, UCLA Department of Economics.
- Deisuke Oyama & Satoru Takahashi & Josef Hofbauer, 2003. "Monotone Methods for Equilibrium Selection under Perfect Foresight Dynamics," Vienna Economics Papers vie0318, University of Vienna, Department of Economics.
- Jehiel, Philippe, 1998. "Learning to Play Limited Forecast Equilibria," Games and Economic Behavior, Elsevier, vol. 22(2), pages 274-298, February.
- Hehenkamp, Burkhard & Kaarbøe, Oddvar M., 2004.
"Equilibrium selection in supermodular games with mean payoff technologies,"
Working Papers in Economics
08/04, University of Bergen, Department of Economics.
- Burkhard Hehenkamp & Oddvar Kaarbøe, 2004. "Equilibrium Selection in Supermodular Games with Mean Payoff Technologies," Discussion Papers in Economics 04_05, University of Dortmund, Department of Economics.
- Oechssler, Jorg, 1997.
"An Evolutionary Interpretation of Mixed-Strategy Equilibria,"
Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 203-237, October.
- Joerg Oechssler, 1994. "An Evolutionary Interpretation Of Mixed-Strategy Equilibria," Game Theory and Information 9404001, University Library of Munich, Germany.
- Ennio Bilancini & Leonardo Boncinelli, 2020.
"The evolution of conventions under condition-dependent mistakes,"
Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 69(2), pages 497-521, March.
- Ennio Bilancini & Leonardo Boncinelli, 2016. "The Evolution of Conventions under Condition-Dependent Mistakes," Working Papers - Economics wp2016_11.rdf, Universita' degli Studi di Firenze, Dipartimento di Scienze per l'Economia e l'Impresa.
- Maruta, Toshimasa, 1997.
"On the Relationship between Risk-Dominance and Stochastic Stability,"
Games and Economic Behavior, Elsevier, vol. 19(2), pages 221-234, May.
- Toshimasa Maruta, 1995. "On the Relationship Between Risk-Dominance and Stochastic Stability," Discussion Papers 1122, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- Fudenberg, Drew & Imhof, Lorens A., 2006.
"Imitation processes with small mutations,"
Journal of Economic Theory, Elsevier, vol. 131(1), pages 251-262, November.
- Drew Fudenberg & Lorens A. Imhof, 2004. "Imitation Processes with Small Mutations," Harvard Institute of Economic Research Working Papers 2050, Harvard - Institute of Economic Research.
- Fudenberg, Drew & Imhof, Lorens, 2006. "Imitation Processes with Small Mutations," Scholarly Articles 3190369, Harvard University Department of Economics.
- Hsiao-Chi Chen & Yunshyong Chow & Li-Chau Wu, 2013. "Imitation, local interaction, and coordination," International Journal of Game Theory, Springer;Game Theory Society, vol. 42(4), pages 1041-1057, November.
- Tanaka, Yasuhito, 2001. "Evolution to equilibrium in an asymmetric oligopoly with differentiated goods," International Journal of Industrial Organization, Elsevier, vol. 19(9), pages 1423-1440, November.
- Khan, Abhimanyu, 2022. "Expected utility versus cumulative prospect theory in an evolutionary model of bargaining," Journal of Economic Dynamics and Control, Elsevier, vol. 137(C).
- Ianni, Antonella, 2000. "Learning correlated equilibria in potential games," Discussion Paper Series In Economics And Econometrics 0012, Economics Division, School of Social Sciences, University of Southampton.
- Alos-Ferrer, Carlos & Weidenholzer, Simon, 2007. "Partial bandwagon effects and local interactions," Games and Economic Behavior, Elsevier, vol. 61(2), pages 179-197, November.
- Kukushkin, Nikolai S., 2015.
"Cournot tatonnement and potentials,"
Journal of Mathematical Economics, Elsevier, vol. 59(C), pages 117-127.
- Kukushkin, Nikolai S., 2012. "Cournot tatonnement and potentials," MPRA Paper 43188, University Library of Munich, Germany.
- Banerjee, Abhijit & Weibull, Jorgen W., 2000.
"Neutrally Stable Outcomes in Cheap-Talk Coordination Games,"
Games and Economic Behavior, Elsevier, vol. 32(1), pages 1-24, July.
- Abhijit Banerjee & Jörgen W. Weibull, "undated". "Neutrally Stable Outcomes in Cheap Talk Coordination Games," ELSE working papers 012, ESRC Centre on Economics Learning and Social Evolution.
- Ana Mauleon & Nils Roehl & Vincent Vannetelbosch, 2014.
"Constitutions and Social Networks,"
Working Papers CIE
74, Paderborn University, CIE Center for International Economics.
- Mauleon, Ana & Roehl, Nils & Vannetelbosch, Vincent, 2015. "Constitutions and Social Networks," Climate Change and Sustainable Development 206451, Fondazione Eni Enrico Mattei (FEEM).
- Ana Mauleon & Nils Roehl & Vincent Vannetelbosch, 2015. "Constitutions and Social Networks," Working Papers 2015.59, Fondazione Eni Enrico Mattei.
- MAULEON, Ana & ROEHL, Nils & VANNETELBOSCH, Vincent, 2014. "Constitutions and social networks," LIDAM Discussion Papers CORE 2014003, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Ana Mauleon & Nils Roehl & Vincent Vannetelbosch, 2014. "Constitutions and Social Networks," Working Papers Dissertations 02, Paderborn University, Faculty of Business Administration and Economics.
- Oddvar M. Kaarbøe & Alexander F. Tieman, 0000.
"Equilibrium Selection in Games with Macroeconomic Complementarities,"
Tinbergen Institute Discussion Papers
99-096/1, Tinbergen Institute.
- Kaarboe, O.M. & Tieman, A.F., 2000. "Equilibrium Selection in Games with Macroeconomic Complementarities," Norway; Department of Economics, University of Bergen 2199, Department of Economics, University of Bergen.
- Dawid, Herbert, 2000. "On the emergence of exchange and mediation in a production economy," Journal of Economic Behavior & Organization, Elsevier, vol. 41(1), pages 27-53, January.
More about this item
Keywords
Cooperation; Multi-Agent Q-Learning; Multi-Agent Reinforcement Learning; Nash Equilibrium; Prisoner’s Dilemma;All these keywords.
JEL classification:
- C51 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Construction and Estimation
- L15 - Industrial Organization - - Market Structure, Firm Strategy, and Market Performance - - - Information and Product Quality
- M - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics
- O32 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Management of Technological Innovation and R&D
NEP fields
This paper has been announced in the following NEP Reports:- NEP-CBE-2006-02-19 (Cognitive and Behavioural Economics)
- NEP-MIC-2006-02-19 (Microeconomics)
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ems:eureri:7323. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: RePub (email available below). General contact details of provider: https://edirc.repec.org/data/erimanl.html .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.