IDEAS home Printed from https://ideas.repec.org/a/eee/apmaco/v463y2024ics0096300323005337.html
   My bibliography  Save this article

Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game

Author

Listed:
  • Yang, Zhengzhi
  • Zheng, Lei
  • Perc, Matjaž
  • Li, Yumeng

Abstract

Many recent studies have used reinforcement learning methods to investigate the behavior of agents in evolutionary games. Q-learning, in particular, has become a mainstream method during this development. Here we introduce Q-learning agents into the evolutionary prisoner's dilemma game on a square lattice. Specifically, we associate the state space of Q-learning agents with the strategies of their neighbors, and we introduce a neighboring reward information sharing mechanism. We thus provide Q-learning agents with the payoff information of their neighbors, in addition to their strategies, which has not been done in previous studies. Through simulations, we show that considering neighborhood payoff information can significantly promote cooperation in the population. Moreover, we show that for an appropriate strength of neighborhood payoff information sharing, a chessboard pattern emerges on the lattice. We analyze in detail the reasons for the emergence of the chessboard pattern and the increase in cooperation frequency, and we also provide a theoretical analysis based on the pair approximation method. We hope that our research will inspire effective approaches for resolving social dilemmas by means of sharing more information among reinforcement learning agents during evolutionary games.

Suggested Citation

  • Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
  • Handle: RePEc:eee:apmaco:v:463:y:2024:i:c:s0096300323005337
    DOI: 10.1016/j.amc.2023.128364
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0096300323005337
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.amc.2023.128364?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Cao, Xian-Bin & Du, Wen-Bo & Rong, Zhi-Hai, 2010. "The evolutionary public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(6), pages 1273-1280.
    2. Takahiro Ezaki & Yutaka Horita & Masanori Takezawa & Naoki Masuda, 2016. "Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-13, July.
    3. Christoph Hauert & Michael Doebeli, 2004. "Spatial structure often inhibits the evolution of cooperation in the snowdrift game," Nature, Nature, vol. 428(6983), pages 643-646, April.
    4. Li, Yumeng & Zhang, Jun & Perc, Matjaž, 2018. "Effects of compassion on the evolution of cooperation in spatial social dilemmas," Applied Mathematics and Computation, Elsevier, vol. 320(C), pages 437-443.
    5. Li, Yumeng & Wang, Hanchen & Du, Wenbo & Perc, Matjaž & Cao, Xianbin & Zhang, Jun, 2019. "Resonance-like cooperation due to transaction costs in the prisoner’s dilemma game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 248-257.
    6. Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
    7. Theodor Cimpeanu & Francisco C. Santos & The Anh Han, 2023. "Does Spending More Always Ensure Higher Cooperation? An Analysis of Institutional Incentives on Heterogeneous Networks," Dynamic Games and Applications, Springer, vol. 13(4), pages 1236-1255, December.
    8. Ding, Hong & Zhang, Geng-shun & Wang, Shi-hao & Li, Juan & Wang, Zhen, 2019. "Q-learning boosts the evolution of cooperation in structured population by involving extortion," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
    9. Cimpeanu, Theodor & Di Stefano, Alessandro & Perret, Cedric & Han, The Anh, 2023. "Social diversity reduces the complexity and cost of fostering fairness," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).
    10. Du, Wen-Bo & Zheng, Hao-Ran & Hu, Mao-Bin, 2008. "Evolutionary prisoner’s dilemma game on weighted scale-free networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(14), pages 3796-3800.
    11. Wang, Shengxian & Chen, Xiaojie & Xiao, Zhilong & Szolnoki, Attila, 2022. "Decentralized incentives for general well-being in networked public goods game," Applied Mathematics and Computation, Elsevier, vol. 431(C).
    12. Wang, Hanchen & Sun, Yichun & Zheng, Lei & Du, Wenbo & Li, Yumeng, 2018. "The public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 509(C), pages 396-404.
    13. Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2022. "Mercenary punishment in structured populations," Applied Mathematics and Computation, Elsevier, vol. 417(C).
    14. Marco Alberto Javarone & Daniele Marinazzo, 2017. "Evolutionary dynamics of group formation," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-10, November.
    15. David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
    16. Wang, Chaoqian & Szolnoki, Attila, 2023. "Inertia in spatial public goods games under weak selection," Applied Mathematics and Computation, Elsevier, vol. 449(C).
    17. Geng, Yini & Liu, Yifan & Lu, Yikang & Shen, Chen & Shi, Lei, 2022. "Reinforcement learning explains various conditional cooperation," Applied Mathematics and Computation, Elsevier, vol. 427(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Bai, Xi & Ye, Ye & Chen, Tong & Xie, Nenggang, 2024. "The evolutionary game of emotions considering the influence of reputation," Applied Mathematics and Computation, Elsevier, vol. 474(C).
    2. Zhang, Qianwei & Tang, Rui & Lu, Yilun & Wang, Xinyu, 2024. "The impact of anxiety on cooperative behavior: A network evolutionary game theory approach," Applied Mathematics and Computation, Elsevier, vol. 474(C).
    3. Dai, Hui & Wang, Xiaoyue & Lu, Yikang & Hou, Yunxiang & Shi, Lei, 2024. "The effect of intraspecific cooperation in a three-species cyclic predator-prey model," Applied Mathematics and Computation, Elsevier, vol. 470(C).
    4. Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
    5. Yang, Guoli & Wu, Yu'e & Cavaliere, Matteo, 2024. "Information-driven cooperation on adaptive cyber-physical systems," Applied Mathematics and Computation, Elsevier, vol. 466(C).
    6. Yang, Qianxi & Yang, Yanlong, 2024. "A social monitoring mechanism for third-party judges promotes cooperation in evolutionary games," Applied Mathematics and Computation, Elsevier, vol. 483(C).
    7. Pi, Jinxiu & Wang, Chun & Zhou, Die & Tang, Wei & Yang, Guanghui, 2024. "Evolutionary dynamics of N-person snowdrift game with two thresholds in well-mixed and structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 180(C).
    8. Xu, Wei & Li, Dandan & Han, Dun & Sun, Mei, 2024. "The impact of relationship stickiness and memory on the evolution of individual behavior," Chaos, Solitons & Fractals, Elsevier, vol. 183(C).
    9. Zhang, Huizhen & An, Tianbo & Yan, Pingping & Hu, Kaipeng & An, Jinjin & Shi, Lijuan & Zhao, Jian & Wang, Jingrui, 2024. "Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
    10. He, Jialu & Cui, Lei, 2024. "The persistence-based game transition resolves the social dilemma," Applied Mathematics and Computation, Elsevier, vol. 477(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
    2. Sun, Jiaqin & Fan, Ruguo & Luo, Ming & Zhang, Yingqing & Dong, Lili, 2018. "The evolution of cooperation in spatial prisoner’s dilemma game with dynamic relationship-based preferential learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 598-611.
    3. Huang, Chaochao & Wang, Chaoqian, 2024. "Memory-based involution dilemma on square lattices," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
    4. Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2023. "Group-size dependent synergy in heterogeneous populations," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).
    5. Yang, Han-Xin & Yang, Jing, 2019. "Reputation-based investment strategy promotes cooperation in public goods games," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 523(C), pages 886-893.
    6. Molnar, Grant & Hammond, Caroline & Fu, Feng, 2023. "Reactive means in the iterated Prisoner’s dilemma," Applied Mathematics and Computation, Elsevier, vol. 458(C).
    7. Wang, Jianwei & Dai, Wenhui & Zheng, Yanfeng & Yu, Fengyuan & Chen, Wei & Xu, Wenshu, 2024. "Partial intervention promotes cooperation and social welfare in regional public goods game," Chaos, Solitons & Fractals, Elsevier, vol. 184(C).
    8. Chen, Wei & Wang, Jianwei & Yu, Fengyuan & Xu, Wenshu & Dai, Wenhui, 2024. "Heterogeneous interaction radius based on emotional dynamics can promote cooperation in spatial public goods games," Applied Mathematics and Computation, Elsevier, vol. 473(C).
    9. Yunsheng Deng & Jihui Zhang, 2022. "The choice-decision based on memory and payoff favors cooperation in stag hunt game on interdependent networks," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 95(2), pages 1-13, February.
    10. Ping Zhu & Guiyi Wei, 2014. "Stochastic Heterogeneous Interaction Promotes Cooperation in Spatial Prisoner's Dilemma Game," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-10, April.
    11. Qinghu Liao & Wenwen Dong & Boxin Zhao, 2023. "A New Strategy to Solve “the Tragedy of the Commons” in Sustainable Grassland Ecological Compensation: Experience from Inner Mongolia, China," Sustainability, MDPI, vol. 15(12), pages 1-24, June.
    12. Chen, Wei & Wang, Jianwei & Yu, Fengyuan & He, Jialu & Xu, Wenshu & Dai, Wenhui, 2024. "Successful initial positioning of non-cooperative individuals in cooperative populations effectively hinders cooperation prosperity," Applied Mathematics and Computation, Elsevier, vol. 462(C).
    13. Zou, Kuan & Huang, Changwei, 2024. "Incorporating reputation into reinforcement learning can promote cooperation on hypergraphs," Chaos, Solitons & Fractals, Elsevier, vol. 186(C).
    14. Yu, Fengyuan & Wang, Jianwei & Chen, Wei & He, Jialu, 2023. "Increased cooperation potential and risk under suppressed strategy differentiation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 621(C).
    15. Zou, Kuan & Han, Wenchen & Zhang, Lan & Huang, Changwei, 2024. "The spatial public goods game on hypergraphs with heterogeneous investment," Applied Mathematics and Computation, Elsevier, vol. 466(C).
    16. Bossert, Leonie & Hagendorff, Thilo, 2021. "Animals and AI. The role of animals in AI research and application – An overview and ethical evaluation," Technology in Society, Elsevier, vol. 67(C).
    17. Dong, Yukun & Xu, Hedong & Fan, Suohai, 2019. "Memory-based stag hunt game on regular lattices," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 519(C), pages 247-255.
    18. Zha, Jiajing & Li, Cong & Fan, Suohai, 2022. "The effect of stability-based strategy updating on cooperation in evolutionary social dilemmas," Applied Mathematics and Computation, Elsevier, vol. 413(C).
    19. Griffin, Christopher & Semonsen, Justin & Belmonte, Andrew, 2022. "Generalized Hamiltonian dynamics and chaos in evolutionary games on networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 597(C).
    20. Ding, Zhen-Wei & Zhang, Ji-Qiang & Zheng, Guo-Zhong & Cai, Wei-Ran & Cai, Chao-Ran & Chen, Li & Wang, Xu-Ming, 2024. "Emergence of anti-coordinated patterns in snowdrift game by reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 184(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:apmaco:v:463:y:2024:i:c:s0096300323005337. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/applied-mathematics-and-computation .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.