Modelling Cournot Games as Multi-agent Multi-armed Bandits

My bibliography Save this paper

Modelling Cournot Games as Multi-agent Multi-armed Bandits

Author

Listed:

Kshitija Taywade
Brent Harrison
Adib Bagh

Registered:

Adib Bagh

Abstract

We investigate the use of a multi-agent multi-armed bandit (MA-MAB) setting for modeling repeated Cournot oligopoly games, where the firms acting as agents choose from the set of arms representing production quantity (a discrete value). Agents interact with separate and independent bandit problems. In this formulation, each agent makes sequential choices among arms to maximize its own reward. Agents do not have any information about the environment; they can only see their own rewards after taking an action. However, the market demand is a stationary function of total industry output, and random entry or exit from the market is not allowed. Given these assumptions, we found that an $\epsilon$-greedy approach offers a more viable learning mechanism than other traditional MAB approaches, as it does not require any additional knowledge of the system to operate. We also propose two novel approaches that take advantage of the ordered action space: $\epsilon$-greedy+HL and $\epsilon$-greedy+EL. These new approaches help firms to focus on more profitable actions by eliminating less profitable choices and hence are designed to optimize the exploration. We use computer simulations to study the emergence of various equilibria in the outcomes and do the empirical analysis of joint cumulative regrets.

Suggested Citation

Kshitija Taywade & Brent Harrison & Adib Bagh, 2022. "Modelling Cournot Games as Multi-agent Multi-armed Bandits," Papers 2201.01182, arXiv.org.

Handle: RePEc:arx:papers:2201.01182

Download full text from publisher

References listed on IDEAS

Nicholas C. Petruzzi & Maqbool Dada, 1999. "Pricing and the Newsvendor Problem: A Review with Extensions," Operations Research, INFORMS, vol. 47(2), pages 183-194, April.
Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Kostas Bimpikis & Shayan Ehsani & Rahmi İlkılıç, 2019. "Cournot Competition in Networked Markets," Management Science, INFORMS, vol. 67(6), pages 2467-2481, June.
Vriend, Nicolaas J., 2000. "An illustration of the essential difference between individual and social learning, and its consequences for computational analyses," Journal of Economic Dynamics and Control, Elsevier, vol. 24(1), pages 1-19, January.
- Nicolaas J. Vriend, 1998. "An Illustration of the Essential Difference between Individual and Social Learning, and its Consequences for Computational Analyses," Working Papers 387, Queen Mary University of London, School of Economics and Finance.
Kanishka Misra & Eric M. Schwartz & Jacob Abernethy, 2019. "Dynamic Online Pricing with Incomplete Information Using Multiarmed Bandit Experiments," Marketing Science, INFORMS, vol. 38(2), pages 226-252, March.
Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
Davide Radi, 2017. "Walrasian versus Cournot behavior in an oligopoly of boundedly rational firms," Journal of Evolutionary Economics, Springer, vol. 27(5), pages 933-961, November.
Fernando Vega-Redondo, 1997. "The Evolution of Walrasian Behavior," Econometrica, Econometric Society, vol. 65(2), pages 375-384, March.
- Fernando Vega Redondo, 1996. "The evolution of walrasian behavior," Working Papers. Serie AD 1996-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Jasmina Arifovic & Michael Maschek, 2006. "Revisiting Individual Evolutionary Learning in the Cobweb Model – An Illustration of the Virtual Spite-Effect," Computational Economics, Springer;Society for Computational Economics, vol. 28(4), pages 333-354, November.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Bischi, Gian Italo & Lamantia, Fabio & Radi, Davide, 2015. "An evolutionary Cournot model with limited market knowledge," Journal of Economic Behavior & Organization, Elsevier, vol. 116(C), pages 219-238.
Thomas Riechmann, 2006. "Cournot or Walras? Long-Run Results in Oligopoly Games," Journal of Institutional and Theoretical Economics (JITE), Mohr Siebeck, Tübingen, vol. 162(4), pages 702-720, December.
Steffen Huck & Hans-Theo Normann & Joerg Oechssler, 2004. "Through Trial and Error to Collusion," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 45(1), pages 205-224, February.
Pai, Mallesh & Hansen, Karsten, 2020. "Algorithmic Collusion: Supra-competitive Prices via Independent Algorithms," CEPR Discussion Papers 14372, C.E.P.R. Discussion Papers.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Kshitija Taywade & Brent Harrison & Judy Goldsmith, 2022. "Using Non-Stationary Bandits for Learning in Repeated Cournot Games with Non-Stationary Demand," Papers 2201.00486, arXiv.org.
Alós-Ferrer, Carlos & Buckenmaier, Johannes, 2017. "Cournot vs. Walras: A reappraisal through simulations," Journal of Economic Dynamics and Control, Elsevier, vol. 82(C), pages 257-272.
Junyi Xu, 2021. "Reinforcement Learning in a Cournot Oligopoly Model," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 1001-1024, December.
Anufriev, Mikhail & Kopányi, Dávid, 2018. "Oligopoly game: Price makers meet price takers," Journal of Economic Dynamics and Control, Elsevier, vol. 91(C), pages 84-103.
repec:ebl:ecbull:v:4:y:2006:i:29:p:1-8 is not listed on IDEAS
Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
Thomas Riechmann, 2006. "Mixed motives in a Cournot game," Economics Bulletin, AccessEcon, vol. 4(29), pages 1-8.
Arthur Charpentier & Romuald Elie & Carl Remlinger, 2020. "Reinforcement Learning in Economics and Finance," Papers 2003.10014, arXiv.org.
Andreas Nicklisch, 2011. "Learning strategic environments: an experimental study of strategy formation and transfer," Theory and Decision, Springer, vol. 71(4), pages 539-558, October.
Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
Arifovic, Jasmina & Karaivanov, Alexander, 2010. "Learning by doing vs. learning from others in a principal-agent model," Journal of Economic Dynamics and Control, Elsevier, vol. 34(10), pages 1967-1992, October.
- Jasmina Arifovic & Alexander Karaivanov, 2007. "Learning by Doing vs. Learning from Others in a Principal-Agent Model," Discussion Papers dp07-24, Department of Economics, Simon Fraser University.
Alós-Ferrer, Carlos & Ritschel, Alexander, 2021. "Multiple behavioral rules in Cournot oligopolies," Journal of Economic Behavior & Organization, Elsevier, vol. 183(C), pages 250-267.
- Carlos Alós-Ferrer & Alexander Ritschel, 2019. "Multiple behavioral rules in Cournot oligopolies," ECON - Working Papers 331, Department of Economics - University of Zurich, revised Jul 2020.
Waltman, L. & van Eck, N.J.P., 2009. "A Mathematical Analysis of the Long-run Behavior of Genetic Algorithms for Social Modeling," ERIM Report Series Research in Management ERS-2009-011-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
Gian Italo Bischi & Fabio Lamantia & Davide Radi, 2018. "Evolutionary oligopoly games with heterogeneous adaptive players," Chapters, in: Luis C. Corchón & Marco A. Marini (ed.), Handbook of Game Theory and Industrial Organization, Volume I, chapter 12, pages 343-370, Edward Elgar Publishing.
Floortje Alkemade & Han Poutré & Hans Amman, 2006. "Robust Evolutionary Algorithm Design for Socio-economic Simulation," Computational Economics, Springer;Society for Computational Economics, vol. 28(4), pages 355-370, November.
Vallée, Thomas & YIldIzoglu, Murat, 2009. "Convergence in the finite Cournot oligopoly with social and individual learning," Journal of Economic Behavior & Organization, Elsevier, vol. 72(2), pages 670-690, November.
- Thomas Vallée & Murat Yildizoglu, 2007. "Convergence in Finite Cournot Oligopoly with Social and Individual Learning," Post-Print hal-00293948, HAL.
- Thomas Vallée & Murat Yildizoglu, 2009. "Convergence in the Finite Cournot Oligopoly with Social and Individual Learning," Working Papers halshs-00368274, HAL.
- Thomas Vallée & Murat Yildizoğlu, 2009. "Convergence in the finite Cournot oligopoly with social and individual learning," Post-Print hal-00722790, HAL.
- Thomas VALLEE & Murat YILDIZOGLU, 2007. "Convergence in Finite Cournot Oligopoly with Social and Individual Learning," Cahiers du GREThA (2007-2019) 2007-07, Groupe de Recherche en Economie Théorique et Appliquée (GREThA).
- Murat Yildizoglu & Thomas Vallée, 2007. "Convergence in Finite Cournot Oligopoly with Social and Individual Learning," Post-Print hal-00394413, HAL.
- Thomas Vallée & Murat Yildizoglu, 2007. "Convergence in Finite Cournot Oligopoly with Social and Individual Learning," Post-Print hal-00293929, HAL.
Gian Italo Bischi & Fabio Lamantia, 2022. "Evolutionary oligopoly games with cooperative and aggressive behaviors," Journal of Economic Interaction and Coordination, Springer;Society for Economic Science with Heterogeneous Interacting Agents, vol. 17(1), pages 3-27, January.
Peter Duersch & Albert Kolb & Jörg Oechssler & Burkhard Schipper, 2010. "Rage against the machines: how subjects play against learning algorithms," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 43(3), pages 407-430, June.
Davide Radi, 2017. "Walrasian versus Cournot behavior in an oligopoly of boundedly rational firms," Journal of Evolutionary Economics, Springer, vol. 27(5), pages 933-961, November.
Bergin, James & Bernhardt, Dan, 2009. "Cooperation through imitation," Games and Economic Behavior, Elsevier, vol. 67(2), pages 376-388, November.
- James Bergin & Dan Bernhardt, 2006. "Cooperation Through Imitation," Working Paper 1042, Economics Department, Queen's University.
Mikhail Anufriev & Davide Radi & Fabio Tramontana, 2018. "Some reflections on past and future of nonlinear dynamics in economics and finance," Decisions in Economics and Finance, Springer;Associazione per la Matematica, vol. 41(2), pages 91-118, November.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-COM-2022-01-31 (Industrial Competition)
NEP-GTH-2022-01-31 (Game Theory)
NEP-IND-2022-01-31 (Industrial Organization)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2201.01182. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Modelling Cournot Games as Multi-agent Multi-armed Bandits

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data