If multi-agent learning is the answer, what is the question?
Author
Abstract
Suggested Citation
Download full text from publisher
References listed on IDEAS
- Fudenberg, Drew & Levine, David, 1998.
"Learning in games,"
European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
- Foster, Dean P. & Vohra, Rakesh, 1999. "Regret in the On-Line Decision Problem," Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 7-35, October.
- Fudenberg, Drew & Levine, David K., 1995.
"Consistency and cautious fictitious play,"
Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1065-1089.
- Fudenberg, Drew & Levine, David, 1995. "Consistency and Cautious Fictitious Play," Scholarly Articles 3198694, Harvard University Department of Economics.
- Drew Fudenberg & David K. Levine, 1996. "Consistency and Cautious Fictitious Play," Levine's Working Paper Archive 470, David K. Levine.
- Sergiu Hart & Andreu Mas-Colell, 2013.
"A Simple Adaptive Procedure Leading To Correlated Equilibrium,"
World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46,
World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2000. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Econometrica, Econometric Society, vol. 68(5), pages 1127-1150, September.
- Sergiu Hart & Andreu Mas-Colell, 1996. "A simple adaptive procedure leading to correlated equilibrium," Economics Working Papers 200, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 1996.
- S. Hart & A. Mas-Collel, 2010. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Levine's Working Paper Archive 572, David K. Levine.
- Sergiu Hart & Andreu Mas-Colell, 1997. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Game Theory and Information 9703006, University Library of Munich, Germany, revised 25 Nov 1997.
- Young, H. Peyton, 2004. "Strategic Learning and its Limits," OUP Catalogue, Oxford University Press, number 9780199269181.
- Richard Bellman, 1957. "On a Dynamic Programming Approach to the Caterer Problem--I," Management Science, INFORMS, vol. 3(3), pages 270-278, April.
- Jehiel, Philippe & Samet, Dov, 2005.
"Learning to play games in extensive form by valuation,"
Journal of Economic Theory, Elsevier, vol. 124(2), pages 129-148, October.
- Philippe Jehiel & Dov Samet, 2001. "Learning to play games in extensive form by valuation," Game Theory and Information 0012001, University Library of Munich, Germany.
- Philippe Jehiel & Dov Samet, 2010. "Learning to play games in extensive form by valuation," Levine's Working Paper Archive 391749000000000040, David K. Levine.
- Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000010, David K. Levine.
- Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," NajEcon Working Paper Reviews 391749000000000010, www.najecon.org.
- Philippe Jehiel & Dov Samet, 2010. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000034, David K. Levine.
- Philippe Jehiel & Dov Samet, 2005. "Learning to play games in extensive form by valuation," Post-Print halshs-00754057, HAL.
- Nachbar, J H, 1990. ""Evolutionary" Selection Dynamics in Games: Convergence and Limit Properties," International Journal of Game Theory, Springer;Game Theory Society, vol. 19(1), pages 59-89.
- Shie Mannor & Nahum Shimkin, 2003. "The Empirical Bayes Envelope and Regret Minimization in Competitive Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 28(2), pages 327-345, May.
- Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
- Drew Fudenberg & David K. Levine, 1998.
"The Theory of Learning in Games,"
MIT Press Books,
The MIT Press,
edition 1, volume 1, number 0262061945, April.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
- Shih-Fen Cheng & Evan Leung & Kevin M. Lochner & Kevin O'Malley & Daniel M. Reeves & L. Julian Schvartzman & Michael P. Wellman, 2003. "Walverine: A Walrasian Trading Agent," Computational Economics 0302003, University Library of Munich, Germany.
- Arrow, Kenneth J, 1986. "Rationality of Self and Others in an Economic System," The Journal of Business, University of Chicago Press, vol. 59(4), pages 385-399, October.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Scott E. Page, 2008. "Uncertainty, Difficulty, and Complexity," Journal of Theoretical Politics, , vol. 20(2), pages 115-149, April.
- H. Peyton Young, 2007. "The Possible and the Impossible in Multi-Agent Learning," Economics Series Working Papers 304, University of Oxford, Department of Economics.
- Russell Golman & Scott Page, 2009. "General Blotto: games of allocative strategic mismatch," Public Choice, Springer, vol. 138(3), pages 279-299, March.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Germano, Fabrizio & Lugosi, Gabor, 2007.
"Global Nash convergence of Foster and Young's regret testing,"
Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
- Fabrizio Germano & Gábor Lugosi, 2004. "Global Nash convergence of Foster and Young's regret testing," Economics Working Papers 788, Department of Economics and Business, Universitat Pompeu Fabra.
- Dean P Foster & Peyton Young, 2006. "Regret Testing Leads to Nash Equilibrium," Levine's Working Paper Archive 784828000000000676, David K. Levine.
- Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009.
"Learning in games with unstable equilibria,"
Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
- Ed Hopkins & Josef Hofbauer & Michel Benaim, 2005. "Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 135, Edinburgh School of Economics, University of Edinburgh.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2006. "Learning in Games with Unstable Equilibria," Levine's Bibliography 321307000000000547, UCLA Department of Economics.
- Michel Benaim & Josef Hofbauer & Ed Hopkins, 2005. "Learning in Games with Unstable Equilibria," Levine's Bibliography 784828000000000609, UCLA Department of Economics.
- Emerson Melo, 2021. "Learning in Random Utility Models Via Online Decision Problems," Papers 2112.10993, arXiv.org, revised Aug 2022.
- Hofbauer, Josef & Sandholm, William H., 2009. "Stable games and their dynamics," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1665-1693.4, July.
- Rene Saran & Roberto Serrano, 2012.
"Regret Matching with Finite Memory,"
Dynamic Games and Applications, Springer, vol. 2(1), pages 160-175, March.
- Rene Saran & Roberto Serrano, 2010. "Regret Matching with Finite Memory," Levine's Working Paper Archive 661465000000000078, David K. Levine.
- Rene Saran & Roberto Serrano, 2010. "Regret matching with finite memory," Working Papers 2010-10, Instituto Madrileño de Estudios Avanzados (IMDEA) Ciencias Sociales.
- Saran, R.R.S. & Serrano, R., 2010. "Regret matching with finite memory," Research Memorandum 033, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
- Rene Saran & Roberto Serrano, 2010. "Regret Matching with Finite Memory," Working Papers 2010-10, Brown University, Department of Economics.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2006.
"Stochastic Approximations and Differential Inclusions, Part II: Applications,"
Mathematics of Operations Research, INFORMS, vol. 31(4), pages 673-695, November.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions; Part II: Applications," Working Papers hal-00242974, HAL.
- Ying-Fang Kao & Ragupathy Venkatachalam, 2021. "Human and Machine Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(3), pages 889-909, March.
- Jehiel, Philippe & Samet, Dov, 2005.
"Learning to play games in extensive form by valuation,"
Journal of Economic Theory, Elsevier, vol. 124(2), pages 129-148, October.
- Philippe Jehiel & Dov Samet, 2001. "Learning to play games in extensive form by valuation," Game Theory and Information 0012001, University Library of Munich, Germany.
- Philippe Jehiel & Dov Samet, 2010. "Learning to play games in extensive form by valuation," Levine's Working Paper Archive 391749000000000040, David K. Levine.
- Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000010, David K. Levine.
- Philippe Jehiel & Dov Samet, 2001. "Learning To Play Games In Extensive Form By Valuation," NajEcon Working Paper Reviews 391749000000000010, www.najecon.org.
- Philippe Jehiel & Dov Samet, 2010. "Learning To Play Games In Extensive Form By Valuation," Levine's Working Paper Archive 391749000000000034, David K. Levine.
- Philippe Jehiel & Dov Samet, 2005. "Learning to play games in extensive form by valuation," Post-Print halshs-00754057, HAL.
- Vivaldo M. Mendes & Diana A. Mendes & Orlando Gomes, 2008. "Learning to Play Nash in Deterministic Uncoupled Dynamics," Working Papers Series 1 ercwp1808, ISCTE-IUL, Business Research Unit (BRU-IUL).
- Eric Friedman & Scott Shenker & Amy Greenwald, 1998. "Learning in Networks Contexts: Experimental Results from Simulations," Departmental Working Papers 199825, Rutgers University, Department of Economics.
- Chernov, G. & Susin, I., 2019. "Models of learning in games: An overview," Journal of the New Economic Association, New Economic Association, vol. 44(4), pages 77-125.
- Sergiu Hart & Andreu Mas-Colell, 2013.
"A General Class Of Adaptive Strategies,"
World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76,
World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mas-Colell, Andreu, 2001. "A General Class of Adaptive Strategies," Journal of Economic Theory, Elsevier, vol. 98(1), pages 26-54, May.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A general class of adaptative strategies," Economics Working Papers 373, Department of Economics and Business, Universitat Pompeu Fabra.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A General Class of Adaptive Strategies," Game Theory and Information 9904001, University Library of Munich, Germany, revised 23 Mar 2000.
- Nicolò Cesa-Bianchi & Gábor Lugosi & Gilles Stoltz, 2006. "Regret Minimization Under Partial Monitoring," Mathematics of Operations Research, INFORMS, vol. 31(3), pages 562-580, August.
- Jim Engle-Warnick & Ed Hopkins, 2006.
"A Simple Test of Learning Theory,"
Levine's Bibliography
321307000000000724, UCLA Department of Economics.
- Jim Engle-Warnick & Ed Hopkins, 2006. "A Simple Test of Learning Theory," CIRANO Working Papers 2006s-30, CIRANO.
- Jim Engle-Warnick & Ed Hopkins, 2006. "A Simple Test of Learning Theory," Edinburgh School of Economics Discussion Paper Series 153, Edinburgh School of Economics, University of Edinburgh.
- Emerson Melo, 2021. "Learning In Random Utility Models Via Online Decision Problems," CAEPR Working Papers 2022-003 Classification-D, Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington.
- Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
- Fudenberg, Drew & Levine, David K., 1999.
"Conditional Universal Consistency,"
Games and Economic Behavior, Elsevier, vol. 29(1-2), pages 104-130, October.
- Drew Fudenberg & David K. Levine, 1997. "Conditional Universal Consistency," Levine's Working Paper Archive 471, David K. Levine.
- Fudenberg, Drew & Levine, David, 1999. "Conditional Universal Consistency," Scholarly Articles 3204826, Harvard University Department of Economics.
- Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
- Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
More about this item
NEP fields
This paper has been announced in the following NEP Reports:- NEP-CBE-2006-03-05 (Cognitive and Behavioural Economics)
- NEP-GTH-2006-03-05 (Game Theory)
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cla:levarc:122247000000001156. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: David K. Levine (email available below). General contact details of provider: http://www.dklevine.com/ .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.