IDEAS home Printed from https://ideas.repec.org/p/arx/papers/0710.0114.html
   My bibliography  Save this paper

Reinforcement learning in market games

Author

Listed:
  • Edward W. Piotrowski
  • Jan Sladkowski
  • Anna Szczypinska

Abstract

Financial markets investors are involved in many games -- they must interact with other agents to achieve their goals. Among them are those directly connected with their activity on markets but one cannot neglect other aspects that influence human decisions and their performance as investors. Distinguishing all subgames is usually beyond hope and resource consuming. In this paper we study how investors facing many different games, gather information and form their decision despite being unaware of the complete structure of the game. To this end we apply reinforcement learning methods to the Information Theory Model of Markets (ITMM). Following Mengel, we can try to distinguish a class $\Gamma$ of games and possible actions (strategies) $a^{i}_{m_{i}}$ for $i-$th agent. Any agent divides the whole class of games into analogy subclasses she/he thinks are analogous and therefore adopts the same strategy for a given subclass. The criteria for partitioning are based on profit and costs analysis. The analogy classes and strategies are updated at various stages through the process of learning. This line of research can be continued in various directions.

Suggested Citation

  • Edward W. Piotrowski & Jan Sladkowski & Anna Szczypinska, 2007. "Reinforcement learning in market games," Papers 0710.0114, arXiv.org.
  • Handle: RePEc:arx:papers:0710.0114
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/0710.0114
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
    2. Piotrowski, E.W & Sładkowski, J, 2002. "Quantum market games," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 312(1), pages 208-216.
    3. Piotrowski, Edward W & Sładkowski, Jan, 2004. "Arbitrage risk induced by transaction costs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 331(1), pages 233-239.
    4. Katarzyna Miakisz & Edward W. Piotrowski & Jan Sladkowski, "undated". "Quantization of Games: Towards Quantum Artificial Intelligence," Departmental Working Papers 21, University of Bialtystok, Department of Theoretical Physics.
    5. Piotrowski, Edward W., 2003. "Fixed point theorem for simple quantum strategies in quantum market games," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 324(1), pages 196-200.
    6. Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
    7. Shneyerov, Art & Wong, Adam Chi Leung, 2007. "The Rate of Convergence to Perfect Competition of a Simple Matching and Bargaining Mechanism," Microeconomics.ca working papers shneyerov-07-05-01-03-43-, Vancouver School of Economics, revised 01 May 2007.
    8. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, April.
    9. E. W. Piotrowski & M. Schroeder, 2007. "Kelly criterion revisited: optimal bets," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 57(2), pages 201-203, May.
    10. Piotrowski, E.W. & Sładkowski, J., 2003. "The merchandising mathematician model: profit intensities," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 318(3), pages 496-504.
    11. Edward W. Piotrowski & Jan Sladkowski, "undated". "Quantum Computer: An Appliance for Playing Market Games," Departmental Working Papers 16, University of Bialtystok, Department of Theoretical Physics.
    12. Edward W. Piotrowski & Jerzy Luczka, "undated". "The relativistic velocity addition law optimizes a forecast gambler's profit," Departmental Working Papers 31, University of Bialtystok, Department of Theoretical Physics.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pakuła, Ireneusz & Piotrowski, Edward W. & Sładkowski, Jan, 2007. "Universality of measurements on quantum markets," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 385(1), pages 397-405.
    2. Friederike Mengel & Emanuela Sciubba, 2010. "Extrapolation in Games of Coordination and Dominance Solvable Games," Working Papers 2010.148, Fondazione Eni Enrico Mattei.
    3. Mohlin, Erik, 2012. "Evolution of theories of mind," Games and Economic Behavior, Elsevier, vol. 75(1), pages 299-318.
    4. Jehiel, Philippe & Singh, Juni, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Games and Economic Behavior, Elsevier, vol. 130(C), pages 1-24.
    5. Daskalova, Vessela & Vriend, Nicolaas J., 2021. "Learning frames," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 78-96.
    6. Edward W. Piotrowski, "undated". "Quantum Mind. A sketch (in Polish)," Departmental Working Papers 120pl, University of Bialtystok, Department of Theoretical Physics.
    7. Edward W. Piotrowski & Jan Sladkowski, "undated". "Quantum Transmemetic Intelligence," Departmental Working Papers 27, University of Bialtystok, Department of Theoretical Physics.
    8. Christoph March, 2011. "Adaptive social learning," Working Papers halshs-00572528, HAL.
    9. Galbiati, Marco & Soramäki, Kimmo, 2011. "An agent-based model of payment systems," Journal of Economic Dynamics and Control, Elsevier, vol. 35(6), pages 859-875, June.
    10. Schipper, Burkhard C., 2021. "Discovery and equilibrium in games with unawareness," Journal of Economic Theory, Elsevier, vol. 198(C).
    11. Mathieu Faure & Gregory Roth, 2010. "Stochastic Approximations of Set-Valued Dynamical Systems: Convergence with Positive Probability to an Attractor," Mathematics of Operations Research, INFORMS, vol. 35(3), pages 624-640, August.
    12. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
    13. ,, 2011. "Manipulative auction design," Theoretical Economics, Econometric Society, vol. 6(2), May.
    14. Christian Ewerhart, 2020. "Ordinal potentials in smooth games," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 70(4), pages 1069-1100, November.
    15. Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
    16. Saori Iwanaga & Akira Namatame, 2015. "Hub Agents Determine Collective Behavior," New Mathematics and Natural Computation (NMNC), World Scientific Publishing Co. Pte. Ltd., vol. 11(02), pages 165-181.
    17. Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
    18. Jacob W. Crandall & Mayada Oudah & Tennom & Fatimah Ishowo-Oloko & Sherief Abdallah & Jean-François Bonnefon & Manuel Cebrian & Azim Shariff & Michael A. Goodrich & Iyad Rahwan, 2018. "Cooperating with machines," Nature Communications, Nature, vol. 9(1), pages 1-12, December.
      • Abdallah, Sherief & Bonnefon, Jean-François & Cebrian, Manuel & Crandall, Jacob W. & Ishowo-Oloko, Fatimah & Oudah, Mayada & Rahwan, Iyad & Shariff, Azim & Tennom,, 2017. "Cooperating with Machines," TSE Working Papers 17-806, Toulouse School of Economics (TSE).
      • Abdallah, Sherief & Bonnefon, Jean-François & Cebrian, Manuel & Crandall, Jacob W. & Ishowo-Oloko, Fatimah & Oudah, Mayada & Rahwan, Iyad & Shariff, Azim & Tennom,, 2017. "Cooperating with Machines," IAST Working Papers 17-68, Institute for Advanced Study in Toulouse (IAST).
      • Jacob Crandall & Mayada Oudah & Fatimah Ishowo-Oloko Tennom & Fatimah Ishowo-Oloko & Sherief Abdallah & Jean-François Bonnefon & Manuel Cebrian & Azim Shariff & Michael Goodrich & Iyad Rahwan, 2018. "Cooperating with machines," Post-Print hal-01897802, HAL.
    19. Dieter Balkenborg & Rosemarie Nagel, 2016. "An Experiment on Forward vs. Backward Induction: How Fairness and Level k Reasoning Matter," German Economic Review, Verein für Socialpolitik, vol. 17(3), pages 378-408, August.
    20. B Kelsey Jack, 2009. "Auctioning Conservation Contracts in Indonesia - Participant Learning in Multiple Trial Rounds," CID Working Papers 35, Center for International Development at Harvard University.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:0710.0114. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.