IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2210.01380.html
   My bibliography  Save this paper

Inverse Game Theory for Stackelberg Games: the Blessing of Bounded Rationality

Author

Listed:
  • Jibang Wu
  • Weiran Shen
  • Fei Fang
  • Haifeng Xu

Abstract

Optimizing strategic decisions (a.k.a. computing equilibrium) is key to the success of many non-cooperative multi-agent applications. However, in many real-world situations, we may face the exact opposite of this game-theoretic problem -- instead of prescribing equilibrium of a given game, we may directly observe the agents' equilibrium behaviors but want to infer the underlying parameters of an unknown game. This research question, also known as inverse game theory, has been studied in multiple recent works in the context of Stackelberg games. Unfortunately, existing works exhibit quite negative results, showing statistical hardness and computational hardness, assuming follower's perfectly rational behaviors. Our work relaxes the perfect rationality agent assumption to the classic quantal response model, a more realistic behavior model of bounded rationality. Interestingly, we show that the smooth property brought by such bounded rationality model actually leads to provably more efficient learning of the follower utility parameters in general Stackelberg games. Systematic empirical experiments on synthesized games confirm our theoretical results and further suggest its robustness beyond the strict quantal response model.

Suggested Citation

  • Jibang Wu & Weiran Shen & Fei Fang & Haifeng Xu, 2022. "Inverse Game Theory for Stackelberg Games: the Blessing of Bounded Rationality," Papers 2210.01380, arXiv.org.
  • Handle: RePEc:arx:papers:2210.01380
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2210.01380
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Aumann, Robert J., 1997. "Rationality and Bounded Rationality," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 2-14, October.
    2. Panayotis Mertikopoulos & William H. Sandholm, 2016. "Learning in Games via Reinforcement and Regularization," Mathematics of Operations Research, INFORMS, vol. 41(4), pages 1297-1324, November.
    3. McKelvey Richard D. & Palfrey Thomas R., 1995. "Quantal Response Equilibria for Normal Form Games," Games and Economic Behavior, Elsevier, vol. 10(1), pages 6-38, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Saeed Hadikhanloo & Rida Laraki & Panayotis Mertikopoulos & Sylvain Sorin, 2022. "Learning in nonatomic games, part Ⅰ: Finite action spaces and population games," Post-Print hal-03767995, HAL.
    2. Willemien Kets, 2007. "The minority game: An economics perspective," Papers 0706.4432, arXiv.org.
    3. Marsili, Matteo & Challet, Damien & Zecchina, Riccardo, 2000. "Exact solution of a modified El Farol's bar problem: Efficiency and the role of market impact," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 280(3), pages 522-553.
    4. Emerson Melo, 2022. "On the uniqueness of quantal response equilibria and its application to network games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 74(3), pages 681-725, October.
    5. Stefanos Leonardos & Iosif Sakos & Costas Courcoubetis & Georgios Piliouras, 2020. "Catastrophe by Design in Population Games: Destabilizing Wasteful Locked-in Technologies," Papers 2007.12877, arXiv.org.
    6. Bosch-Domènech, Antoni & Vriend, Nicolaas J., 2013. "On the role of non-equilibrium focal points as coordination devices," Journal of Economic Behavior & Organization, Elsevier, vol. 94(C), pages 52-67.
    7. Kraemer, Carlo & Noth, Markus & Weber, Martin, 2006. "Information aggregation with costly information and random ordering: Experimental evidence," Journal of Economic Behavior & Organization, Elsevier, vol. 59(3), pages 423-432, March.
    8. Goeree, Jacob K. & Holt, Charles A. & Palfrey, Thomas R., 2002. "Quantal Response Equilibrium and Overbidding in Private-Value Auctions," Journal of Economic Theory, Elsevier, vol. 104(1), pages 247-272, May.
    9. Emmanuel Dechenaux & Dan Kovenock & Roman Sheremeta, 2015. "A survey of experimental research on contests, all-pay auctions and tournaments," Experimental Economics, Springer;Economic Science Association, vol. 18(4), pages 609-669, December.
    10. Steven N. Durlauf & Yannis M. Ioannides, 2010. "Social Interactions," Annual Review of Economics, Annual Reviews, vol. 2(1), pages 451-478, September.
    11. Marco Cipriani & Antonio Guarino, 2009. "Herd Behavior in Financial Markets: An Experiment with Financial Market Professionals," Journal of the European Economic Association, MIT Press, vol. 7(1), pages 206-233, March.
    12. Dutta, Rohan & Levine, David Knudsen & Modica, Salvatore, 2018. "Collusion constrained equilibrium," Theoretical Economics, Econometric Society, vol. 13(1), January.
    13. Ghidoni, Riccardo & Suetens, Sigrid, 2019. "Empirical Evidence on Repeated Sequential Games," Other publications TiSEM ff3a441f-e196-4e45-ba59-c, Tilburg University, School of Economics and Management.
    14. Tien Mai & Arunesh Sinha, 2022. "Safe Delivery of Critical Services in Areas with Volatile Security Situation via a Stackelberg Game Approach," Papers 2204.11451, arXiv.org.
    15. Bhattacharya, Sourav & Duffy, John & Kim, Sun-Tak, 2014. "Compulsory versus voluntary voting: An experimental study," Games and Economic Behavior, Elsevier, vol. 84(C), pages 111-131.
    16. Hoffmann, Magnus & Kolmar, Martin, 2017. "Distributional preferences in probabilistic and share contests," Journal of Economic Behavior & Organization, Elsevier, vol. 142(C), pages 120-139.
    17. Nieken, Petra & Schmitz, Patrick W., 2012. "Repeated moral hazard and contracts with memory: A laboratory experiment," Games and Economic Behavior, Elsevier, vol. 75(2), pages 1000-1008.
    18. David J. Cooper & Hanming Fang, 2008. "Understanding Overbidding in Second Price Auctions: An Experimental Study," Economic Journal, Royal Economic Society, vol. 118(532), pages 1572-1595, October.
    19. Chris Fields & James F. Glazebrook, 2024. "Nash Equilibria and Undecidability in Generic Physical Interactions—A Free Energy Perspective," Games, MDPI, vol. 15(5), pages 1-22, August.
    20. Philippe Jehiel, 2022. "Analogy-Based Expectation Equilibrium and Related Concepts:Theory, Applications, and Beyond," Working Papers halshs-03735680, HAL.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2210.01380. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.