IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v6y2018i8p131-d160726.html
   My bibliography  Save this article

A Markovian Mechanism of Proportional Resource Allocation in the Incentive Model as a Dynamic Stochastic Inverse Stackelberg Game

Author

Listed:
  • Grigory Belyavsky

    (Vorovich Institute of Mathematics, Mechanics and Computer Sciences, Southern Federal University, ul. Milchakova 8A, Rostov-on-Don 344090, Russia)

  • Natalya Danilova

    (Vorovich Institute of Mathematics, Mechanics and Computer Sciences, Southern Federal University, ul. Milchakova 8A, Rostov-on-Don 344090, Russia)

  • Guennady Ougolnitsky

    (Vorovich Institute of Mathematics, Mechanics and Computer Sciences, Southern Federal University, ul. Milchakova 8A, Rostov-on-Don 344090, Russia)

Abstract

This paper considers resource allocation among producers (agents) in the case where the Principal knows nothing about their cost functions while the agents have Markovian awareness about his/her strategies. We use a dynamic setup of the stochastic inverse Stackelberg game as the model. We suggest an algorithm for solving this game based on Q -learning. The associated Bellman equations contain functions of one variable for the Principal and also for the agents. The new results are illustrated by numerical examples.

Suggested Citation

  • Grigory Belyavsky & Natalya Danilova & Guennady Ougolnitsky, 2018. "A Markovian Mechanism of Proportional Resource Allocation in the Incentive Model as a Dynamic Stochastic Inverse Stackelberg Game," Mathematics, MDPI, vol. 6(8), pages 1-10, July.
  • Handle: RePEc:gam:jmathe:v:6:y:2018:i:8:p:131-:d:160726
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/6/8/131/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/6/8/131/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. G. J. Olsder, 2009. "Phenomena in Inverse Stackelberg Games, Part 2: Dynamic Problems," Journal of Optimization Theory and Applications, Springer, vol. 143(3), pages 601-618, December.
    2. Tharakunnel, Kurian & Bhattacharyya, Siddhartha, 2009. "Single-leader-multiple-follower games with boundedly rational agents," Journal of Economic Dynamics and Control, Elsevier, vol. 33(8), pages 1593-1603, August.
    3. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    4. G. J. Olsder, 2009. "Phenomena in Inverse Stackelberg Games, Part 1: Static Problems," Journal of Optimization Theory and Applications, Springer, vol. 143(3), pages 589-600, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Noortje Groot & Bart Schutter & Hans Hellendoorn, 2016. "Optimal Affine Leader Functions in Reverse Stackelberg Games," Journal of Optimization Theory and Applications, Springer, vol. 168(1), pages 348-374, January.
    2. Engwerda, J.C., 2012. "Prospects of Tools from Differential Games in the Study Of Macroeconomics of Climate Change," Other publications TiSEM cac36d07-227b-4cf2-83cb-7, Tilburg University, School of Economics and Management.
    3. Yurii Averboukh, 2018. "Inverse Stackelberg Solutions for Games with Many Followers," Mathematics, MDPI, vol. 6(9), pages 1-9, August.
    4. Yifen Mu, 2014. "Inverse Stackelberg Public Goods Game with Multiple Hierarchies Under Global and Local Information Structures," Journal of Optimization Theory and Applications, Springer, vol. 163(1), pages 332-350, October.
    5. Olga Gorbaneva & Guennady Ougolnitsky, 2022. "Sustainability of Intertwined Supply Networks: A Game-Theoretic Approach," Games, MDPI, vol. 13(3), pages 1-21, April.
    6. Junyi Xu, 2021. "Reinforcement Learning in a Cournot Oligopoly Model," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 1001-1024, December.
    7. Babajanyan, S.G. & Melkikh, A.V. & Allahverdyan, A.E., 2020. "Leadership scenarios in prisoner’s dilemma game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 545(C).
    8. Yurii Averboukh & Artem Baklanov, 2014. "Stackelberg Solutions of Differential Games in the Class of Nonanticipative Strategies," Dynamic Games and Applications, Springer, vol. 4(1), pages 1-9, March.
    9. Guennady Ougolnitsky & Anatoly Usov, 2023. "Differential Game-Theoretic Models of Cournot Oligopoly with Consideration of the Green Effect," Games, MDPI, vol. 14(1), pages 1-18, January.
    10. Noah Gans & George Knox & Rachel Croson, 2007. "Simple Models of Discrete Choice and Their Performance in Bandit Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 9(4), pages 383-408, December.
    11. Terry E. Daniel & Eyran J. Gisches & Amnon Rapoport, 2009. "Departure Times in Y-Shaped Traffic Networks with Multiple Bottlenecks," American Economic Review, American Economic Association, vol. 99(5), pages 2149-2176, December.
    12. Iftekhar, M. S. & Tisdell, J. G., 2018. "Learning in repeated multiple unit combinatorial auctions: An experimental study," Working Papers 267301, University of Western Australia, School of Agricultural and Resource Economics.
    13. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
    14. Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
    15. Oechssler, Jorg & Schipper, Burkhard, 2003. "Can you guess the game you are playing?," Games and Economic Behavior, Elsevier, vol. 43(1), pages 137-152, April.
    16. Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
    17. B Kelsey Jack, 2009. "Auctioning Conservation Contracts in Indonesia - Participant Learning in Multiple Trial Rounds," CID Working Papers 35, Center for International Development at Harvard University.
    18. Isabelle Brocas & Juan D. Carrillo, 2022. "The development of randomization and deceptive behavior in mixed strategy games," Quantitative Economics, Econometric Society, vol. 13(2), pages 825-862, May.
    19. James Choi & David Laibson & Brigitte Madrain & Andrew Metrick, 2007. "Reinforcement Learning in Investment Behavior," Levine's Bibliography 122247000000001737, UCLA Department of Economics.
    20. Enkhtaivan, Bolortuya & Davaadorj, Zagdbazar, 2021. "Do they recall their past? CEOs’ liquidity policies across firms as they switch jobs," Journal of Behavioral and Experimental Finance, Elsevier, vol. 29(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:6:y:2018:i:8:p:131-:d:160726. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.