Single-leader-multiple-follower games with boundedly rational agents

My bibliography Save this article

Single-leader-multiple-follower games with boundedly rational agents

Author

Listed:

Tharakunnel, Kurian
Bhattacharyya, Siddhartha

Registered:

Abstract

This paper studies a class of hierarchical games called single-leader-multiple-follower games (SLMFGs) that have important applications in economics and engineering. We consider such games in the context of boundedly rational agents that are limited in the information and computational power they may possess. Agents in our SLMFG are modeled as adaptive learners that use simple reinforcement learning schemes to learn their optimal behavior. The proposed learning approach is illustrated using a well-studied problem in economics. It is shown that with a patiently learning leader the repeated plays of the game result in approximate equilibrium outcomes.

Suggested Citation

Tharakunnel, Kurian & Bhattacharyya, Siddhartha, 2009. "Single-leader-multiple-follower games with boundedly rational agents," Journal of Economic Dynamics and Control, Elsevier, vol. 33(8), pages 1593-1603, August.

Handle: RePEc:eee:dyncon:v:33:y:2009:i:8:p:1593-1603

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Kalai, Ehud & Ledyard, John O., 1998. "Repeated Implementation," Journal of Economic Theory, Elsevier, vol. 83(2), pages 308-317, December.
- Kalai, Ehud & Ledyard, John, 1997. "Repeated Implementation," Working Papers 1027, California Institute of Technology, Division of the Humanities and Social Sciences.
- Ehud Kalai & John O. Ledyard, 1997. "Repeated Implementation," Discussion Papers 1205, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
Kutschinski, Erich & Uthmann, Thomas & Polani, Daniel, 2003. "Learning competitive pricing strategies by multi-agent reinforcement learning," Journal of Economic Dynamics and Control, Elsevier, vol. 27(11), pages 2207-2218.
William A. Brock & Cars H. Hommes, 1997. "A Rational Route to Randomness," Econometrica, Econometric Society, vol. 65(5), pages 1059-1096, September.
- Brock, W.A. & Hommes, C.H., 1995. "Rational Routes to Randomness," Working papers 9506, Wisconsin Madison - Social Systems.
- Brock, W.A., 1995. "A Rational Route to Randomness," Working papers 9530, Wisconsin Madison - Social Systems.
- William A. Brock & Cars H. Hommes, 1995. "Rational Routes to Randomness," Working Papers 95-03-029, Santa Fe Institute.
- Brock, W.A. & Hommes, C.H., 1996. "A Rational Route to Randomness," Working papers 9530r, Wisconsin Madison - Social Systems.
William A. Brock & Cars H. Hommes, 2001. "A Rational Route to Randomness," Chapters, in: W. D. Dechert (ed.), Growth Theory, Nonlinear Dynamics and Economic Modelling, chapter 16, pages 402-438, Edward Elgar Publishing.
- William A. Brock & Cars H. Hommes, 1997. "A Rational Route to Randomness," Econometrica, Econometric Society, vol. 65(5), pages 1059-1096, September.
- Brock, W.A. & Hommes, C.H., 1995. "Rational Routes to Randomness," Working papers 9506, Wisconsin Madison - Social Systems.
- William A. Brock & Cars H. Hommes, 1995. "Rational Routes to Randomness," Working Papers 95-03-029, Santa Fe Institute.
- Brock, W.A. & Hommes, C.H., 1996. "A Rational Route to Randomness," Working papers 9530r, Wisconsin Madison - Social Systems.
Vallee, Thomas & Basar, Tamer, 1999. "Off-Line Computation of Stackelberg Solutions with the Genetic Algorithm," Computational Economics, Springer;Society for Computational Economics, vol. 13(3), pages 201-209, June.
- Thomas Vallée & Tamer Başar, 1999. "Off-line computation of Stackelberg solutions with the genetic algorithm," Post-Print hal-03193665, HAL.
Alemdar, Nedim M. & Sirakaya, Sibel, 2003. "On-line computation of Stackelberg equilibria with synchronous parallel genetic algorithms," Journal of Economic Dynamics and Control, Elsevier, vol. 27(8), pages 1503-1515, June.
Radner, Roy, 1985. "Repeated Principal-Agent Games with Discounting," Econometrica, Econometric Society, vol. 53(5), pages 1173-1198, September.
- R. Radner, 1998. "Repeated Principal Agent Games With Discounting," Levine's Working Paper Archive 618, David K. Levine.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Rudolf Vetschera, 2003. "Experimentation and Learning in Repeated Cooperation," Computational and Mathematical Organization Theory, Springer, vol. 9(1), pages 37-60, May.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Kutschinski, Erich & Uthmann, Thomas & Polani, Daniel, 2003. "Learning competitive pricing strategies by multi-agent reinforcement learning," Journal of Economic Dynamics and Control, Elsevier, vol. 27(11-12), pages 2207-2218, September.
Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
Groves, Theodore, 1973. "Incentives in Teams," Econometrica, Econometric Society, vol. 41(4), pages 617-631, July.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Junyi Xu, 2021. "Reinforcement Learning in a Cournot Oligopoly Model," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 1001-1024, December.
Guoling Wang & Miao Wang & Hui Yang & Guanghui Yang & Chun Wang, 2024. "Existence of $$\alpha $$ α -Robust Weak Nash Equilibria for Leader–Follower Population Games with Fuzzy Parameters," Journal of Optimization Theory and Applications, Springer, vol. 203(3), pages 2739-2758, December.
Grigory Belyavsky & Natalya Danilova & Guennady Ougolnitsky, 2018. "A Markovian Mechanism of Proportional Resource Allocation in the Incentive Model as a Dynamic Stochastic Inverse Stackelberg Game," Mathematics, MDPI, vol. 6(8), pages 1-10, July.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Hanaki, Nobuyuki & Ishikawa, Ryuichiro & Akiyama, Eizo, 2009. "Learning games," Journal of Economic Dynamics and Control, Elsevier, vol. 33(10), pages 1739-1756, October.
Mauersberger, Felix, 2019. "Thompson Sampling: Endogenously Random Behavior in Games and Markets," VfS Annual Conference 2019 (Leipzig): 30 Years after the Fall of the Berlin Wall - Democracy and Market Economy 203600, Verein für Socialpolitik / German Economic Association.
Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
Pangallo, Marco & Sanders, James B.T. & Galla, Tobias & Farmer, J. Doyne, 2022. "Towards a taxonomy of learning dynamics in 2 × 2 games," Games and Economic Behavior, Elsevier, vol. 132(C), pages 1-21.
- Marco Pangallo & James Sanders & Tobias Galla & Doyne Farmer, 2017. "Towards a taxonomy of learning dynamics in 2 x 2 games," Papers 1701.09043, arXiv.org, revised Sep 2021.
Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
- John Duffy, 2004. "Agent-Based Models and Human Subject Experiments," Computational Economics 0412001, University Library of Munich, Germany.
Sonnemans, Joep & Hommes, Cars & Tuinstra, Jan & van de Velden, Henk, 2004. "The instability of a heterogeneous cobweb economy: a strategy experiment on expectation formation," Journal of Economic Behavior & Organization, Elsevier, vol. 54(4), pages 453-481, August.
- Sonnemans, J. & Hommes, C.H. & Tuinstra, J. & van de Velden, H., 1999. "The Instability of a Heterogeneous Cobweb economy: a Strategy Experiment on Expectation Formation," CeNDEF Working Papers 99-06, Universiteit van Amsterdam, Center for Nonlinear Dynamics in Economics and Finance.
Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
Mele, Antonio & Molnár, Krisztina & Santoro, Sergio, 2020. "On the perils of stabilizing prices when agents are learning," Journal of Monetary Economics, Elsevier, vol. 115(C), pages 339-353.
- Mele, Antonio & Molnar, Krisztina & Santoro, Sergio, 2014. "On the perils of stabilizing prices when agents are learning," Discussion Paper Series in Economics 1/2015, Norwegian School of Economics, Department of Economics.
- Antonio Mele & Krisztina Molnar & Sergio Santoro, 2015. "On the perils of stabilizing prices when agents are learning," School of Economics Discussion Papers 0215, School of Economics, University of Surrey.
- Mele, Antonio & Molnar, Krisztina & Santoro, Sergio, 2018. "On the perils of stabilizing prices when agents are learning," Discussion Paper Series in Economics 22/2018, Norwegian School of Economics, Department of Economics.
- Antonio Mele & Krisztina Molnár & Sergio Santoro, 2015. "On the Perils of Stabilizing Prices when Agents are Learning," CESifo Working Paper Series 5173, CESifo.
Anufriev, Mikhail & Kopányi, Dávid & Tuinstra, Jan, 2013. "Learning cycles in Bertrand competition with differentiated commodities and competing learning rules," Journal of Economic Dynamics and Control, Elsevier, vol. 37(12), pages 2562-2581.
- Anufriev, M. & Tuinstra, J. & Kopányi, D., 2012. "Learning Cycles in Bertrand Competition with Differentiated Commodities and Competing Learning Rules," CeNDEF Working Papers 12-05, Universiteit van Amsterdam, Center for Nonlinear Dynamics in Economics and Finance.
- Mikhail Anufriev & D?ï¿½vid Kop?ï¿½nyiz & Jan Tuinstra, 2013. "Learning Cycles in Bertrand Competition with Differentiated Commodities and Competing Learning Rules," Working Paper Series 8, Economics Discipline Group, UTS Business School, University of Technology, Sydney.
De Grauwe, Paul & Markiewicz, Agnieszka, 2013. "Learning to forecast the exchange rate: Two competing approaches," Journal of International Money and Finance, Elsevier, vol. 32(C), pages 42-76.
- Paul De Grauwe & Agnieszka Markiewicz, 2006. "Learning to Forecast the Exchange Rate: Two Competing Approaches," CESifo Working Paper Series 1717, CESifo.
- Paul De Grauwe & Agnieszka Markiewicz, 2006. "Learning to Forecast the Exchange Rate: Two Competing Approaches," Computing in Economics and Finance 2006 367, Society for Computational Economics.
Adriaan R. Soetevent, 2006. "Empirics of the Identification of Social Interactions; An Evaluation of the Approaches and Their Results," Journal of Economic Surveys, Wiley Blackwell, vol. 20(2), pages 193-228, April.
Michele Berardi, 2021. "Discrete beliefs space and equilibrium: a cautionary note," Journal of Evolutionary Economics, Springer, vol. 31(2), pages 505-532, April.
- Michele Berardi, 2018. "Discrete beliefs space and equilibrium: a cautionary note," Centre for Growth and Business Cycle Research Discussion Paper Series 242, Economics, The University of Manchester.
Troy Tassier, 2013. "Handbook of Research on Complexity, by J. Barkley Rosser, Jr. and Edward Elgar," Eastern Economic Journal, Palgrave Macmillan;Eastern Economic Association, vol. 39(1), pages 132-133.
Bischi, Gian Italo & Kopel, Michael, 2001. "Equilibrium selection in a nonlinear duopoly game with adaptive expectations," Journal of Economic Behavior & Organization, Elsevier, vol. 46(1), pages 73-100, September.
Michele Berardi, 2011. "Heterogeneous sunspots solutions under learning and replicator dynamics," Centre for Growth and Business Cycle Research Discussion Paper Series 160, Economics, The University of Manchester.
Antonio Doria, Francisco, 2011. "J.B. Rosser Jr. , Handbook of Research on Complexity, Edward Elgar, Cheltenham, UK--Northampton, MA, USA (2009) 436 + viii pp., index, ISBN 978 1 84542 089 5 (cased)," Journal of Economic Behavior & Organization, Elsevier, vol. 78(1-2), pages 196-204, April.
Waters, George A., 2009. "Chaos in the cobweb model with a new learning dynamic," Journal of Economic Dynamics and Control, Elsevier, vol. 33(6), pages 1201-1216, June.
Michele Berardi, 2015. "Expectations formation under adaptive learning and evolutionary dynamics," Centre for Growth and Business Cycle Research Discussion Paper Series 206, Economics, The University of Manchester.
Berardi, Michele, 2015. "On the fragility of sunspot equilibria under learning and evolutionary dynamics," Journal of Economic Behavior & Organization, Elsevier, vol. 112(C), pages 251-265.
- Michele Berardi, 2013. "On the fragility of sunspot equilibria under learning and evolutionary dynamics," Centre for Growth and Business Cycle Research Discussion Paper Series 186, Economics, The University of Manchester.
Georges, Christophre, 2006. "Learning with misspecification in an artificial currency market," Journal of Economic Behavior & Organization, Elsevier, vol. 60(1), pages 70-84, May.

More about this item

Keywords

Leader-follower games Bounded rationality Reinforcement learning;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:dyncon:v:33:y:2009:i:8:p:1593-1603. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jedc .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Single-leader-multiple-follower games with boundedly rational agents

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data