Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game

My bibliography Save this article

Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game

Author

Listed:

Yang, Zhengzhi
Zheng, Lei
Perc, Matjaž
Li, Yumeng

Registered:

Abstract

Many recent studies have used reinforcement learning methods to investigate the behavior of agents in evolutionary games. Q-learning, in particular, has become a mainstream method during this development. Here we introduce Q-learning agents into the evolutionary prisoner's dilemma game on a square lattice. Specifically, we associate the state space of Q-learning agents with the strategies of their neighbors, and we introduce a neighboring reward information sharing mechanism. We thus provide Q-learning agents with the payoff information of their neighbors, in addition to their strategies, which has not been done in previous studies. Through simulations, we show that considering neighborhood payoff information can significantly promote cooperation in the population. Moreover, we show that for an appropriate strength of neighborhood payoff information sharing, a chessboard pattern emerges on the lattice. We analyze in detail the reasons for the emergence of the chessboard pattern and the increase in cooperation frequency, and we also provide a theoretical analysis based on the pair approximation method. We hope that our research will inspire effective approaches for resolving social dilemmas by means of sharing more information among reinforcement learning agents during evolutionary games.

Suggested Citation

Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).

Handle: RePEc:eee:apmaco:v:463:y:2024:i:c:s0096300323005337
DOI: 10.1016/j.amc.2023.128364

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Cao, Xian-Bin & Du, Wen-Bo & Rong, Zhi-Hai, 2010. "The evolutionary public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 389(6), pages 1273-1280.
Takahiro Ezaki & Yutaka Horita & Masanori Takezawa & Naoki Masuda, 2016. "Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-13, July.
Ding, Hong & Zhang, Geng-shun & Wang, Shi-hao & Li, Juan & Wang, Zhen, 2019. "Q-learning boosts the evolution of cooperation in structured population by involving extortion," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 536(C).
Du, Wen-Bo & Zheng, Hao-Ran & Hu, Mao-Bin, 2008. "Evolutionary prisoner’s dilemma game on weighted scale-free networks," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 387(14), pages 3796-3800.
Wang, Hanchen & Sun, Yichun & Zheng, Lei & Du, Wenbo & Li, Yumeng, 2018. "The public goods game on scale-free networks with heterogeneous investment," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 509(C), pages 396-404.
Marco Alberto Javarone & Daniele Marinazzo, 2017. "Evolutionary dynamics of group formation," PLOS ONE, Public Library of Science, vol. 12(11), pages 1-10, November.
David Silver & Julian Schrittwieser & Karen Simonyan & Ioannis Antonoglou & Aja Huang & Arthur Guez & Thomas Hubert & Lucas Baker & Matthew Lai & Adrian Bolton & Yutian Chen & Timothy Lillicrap & Fan , 2017. "Mastering the game of Go without human knowledge," Nature, Nature, vol. 550(7676), pages 354-359, October.
Wang, Chaoqian & Szolnoki, Attila, 2023. "Inertia in spatial public goods games under weak selection," Applied Mathematics and Computation, Elsevier, vol. 449(C).
Christoph Hauert & Michael Doebeli, 2004. "Spatial structure often inhibits the evolution of cooperation in the snowdrift game," Nature, Nature, vol. 428(6983), pages 643-646, April.
Li, Yumeng & Zhang, Jun & Perc, Matjaž, 2018. "Effects of compassion on the evolution of cooperation in spatial social dilemmas," Applied Mathematics and Computation, Elsevier, vol. 320(C), pages 437-443.
Li, Yumeng & Wang, Hanchen & Du, Wenbo & Perc, Matjaž & Cao, Xianbin & Zhang, Jun, 2019. "Resonance-like cooperation due to transaction costs in the prisoner’s dilemma game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 521(C), pages 248-257.
Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
Theodor Cimpeanu & Francisco C. Santos & The Anh Han, 2023. "Does Spending More Always Ensure Higher Cooperation? An Analysis of Institutional Incentives on Heterogeneous Networks," Dynamic Games and Applications, Springer, vol. 13(4), pages 1236-1255, December.
Cimpeanu, Theodor & Di Stefano, Alessandro & Perret, Cedric & Han, The Anh, 2023. "Social diversity reduces the complexity and cost of fostering fairness," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).
Wang, Shengxian & Chen, Xiaojie & Xiao, Zhilong & Szolnoki, Attila, 2022. "Decentralized incentives for general well-being in networked public goods game," Applied Mathematics and Computation, Elsevier, vol. 431(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2022. "Mercenary punishment in structured populations," Applied Mathematics and Computation, Elsevier, vol. 417(C).
Geng, Yini & Liu, Yifan & Lu, Yikang & Shen, Chen & Shi, Lei, 2022. "Reinforcement learning explains various conditional cooperation," Applied Mathematics and Computation, Elsevier, vol. 427(C).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Zhang, Yali & Lu, Yikang & Jin, Haoyu & Dong, Yuting & Du, Chunpeng & Shi, Lei, 2024. "The impact of dynamic reward on cooperation in the spatial public goods game," Chaos, Solitons & Fractals, Elsevier, vol. 187(C).
Xu, Wei & Li, Dandan & Han, Dun & Sun, Mei, 2024. "The impact of relationship stickiness and memory on the evolution of individual behavior," Chaos, Solitons & Fractals, Elsevier, vol. 183(C).
Bai, Xi & Ye, Ye & Chen, Tong & Xie, Nenggang, 2024. "The evolutionary game of emotions considering the influence of reputation," Applied Mathematics and Computation, Elsevier, vol. 474(C).
Zhang, Qianwei & Tang, Rui & Lu, Yilun & Wang, Xinyu, 2024. "The impact of anxiety on cooperative behavior: A network evolutionary game theory approach," Applied Mathematics and Computation, Elsevier, vol. 474(C).
Dai, Hui & Wang, Xiaoyue & Lu, Yikang & Hou, Yunxiang & Shi, Lei, 2024. "The effect of intraspecific cooperation in a three-species cyclic predator-prey model," Applied Mathematics and Computation, Elsevier, vol. 470(C).
Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
Zhang, Huizhen & An, Tianbo & Yan, Pingping & Hu, Kaipeng & An, Jinjin & Shi, Lijuan & Zhao, Jian & Wang, Jingrui, 2024. "Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
Yang, Guoli & Wu, Yu'e & Cavaliere, Matteo, 2024. "Information-driven cooperation on adaptive cyber-physical systems," Applied Mathematics and Computation, Elsevier, vol. 466(C).
He, Jialu & Cui, Lei, 2024. "The persistence-based game transition resolves the social dilemma," Applied Mathematics and Computation, Elsevier, vol. 477(C).
Yang, Qianxi & Yang, Yanlong, 2024. "A social monitoring mechanism for third-party judges promotes cooperation in evolutionary games," Applied Mathematics and Computation, Elsevier, vol. 483(C).
Pi, Jinxiu & Wang, Chun & Zhou, Die & Tang, Wei & Yang, Guanghui, 2024. "Evolutionary dynamics of N-person snowdrift game with two thresholds in well-mixed and structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 180(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
Sun, Jiaqin & Fan, Ruguo & Luo, Ming & Zhang, Yingqing & Dong, Lili, 2018. "The evolution of cooperation in spatial prisoner’s dilemma game with dynamic relationship-based preferential learning," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 512(C), pages 598-611.
Huang, Chaochao & Wang, Chaoqian, 2024. "Memory-based involution dilemma on square lattices," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2023. "Restoring spatial cooperation with myopic agents in a three-strategy social dilemma," Applied Mathematics and Computation, Elsevier, vol. 458(C).
Yang, Han-Xin & Yang, Jing, 2019. "Reputation-based investment strategy promotes cooperation in public goods games," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 523(C), pages 886-893.
Wang, Jianwei & Dai, Wenhui & Zheng, Yanfeng & Yu, Fengyuan & Chen, Wei & Xu, Wenshu, 2024. "Partial intervention promotes cooperation and social welfare in regional public goods game," Chaos, Solitons & Fractals, Elsevier, vol. 184(C).
Chen, Wei & Wang, Jianwei & Yu, Fengyuan & Xu, Wenshu & Dai, Wenhui, 2024. "Heterogeneous interaction radius based on emotional dynamics can promote cooperation in spatial public goods games," Applied Mathematics and Computation, Elsevier, vol. 473(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2023. "Group-size dependent synergy in heterogeneous populations," Chaos, Solitons & Fractals, Elsevier, vol. 167(C).
Molnar, Grant & Hammond, Caroline & Fu, Feng, 2023. "Reactive means in the iterated Prisoner’s dilemma," Applied Mathematics and Computation, Elsevier, vol. 458(C).
Ping Zhu & Guiyi Wei, 2014. "Stochastic Heterogeneous Interaction Promotes Cooperation in Spatial Prisoner's Dilemma Game," PLOS ONE, Public Library of Science, vol. 9(4), pages 1-10, April.
Qinghu Liao & Wenwen Dong & Boxin Zhao, 2023. "A New Strategy to Solve “the Tragedy of the Commons” in Sustainable Grassland Ecological Compensation: Experience from Inner Mongolia, China," Sustainability, MDPI, vol. 15(12), pages 1-24, June.
Chen, Wei & Wang, Jianwei & Yu, Fengyuan & He, Jialu & Xu, Wenshu & Dai, Wenhui, 2024. "Successful initial positioning of non-cooperative individuals in cooperative populations effectively hinders cooperation prosperity," Applied Mathematics and Computation, Elsevier, vol. 462(C).
Yu, Fengyuan & Wang, Jianwei & Chen, Wei & He, Jialu, 2023. "Increased cooperation potential and risk under suppressed strategy differentiation," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 621(C).
Zha, Jiajing & Li, Cong & Fan, Suohai, 2022. "The effect of stability-based strategy updating on cooperation in evolutionary social dilemmas," Applied Mathematics and Computation, Elsevier, vol. 413(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2021. "Small fraction of selective cooperators can elevate general wellbeing significantly," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 582(C).
Quan, Ji & Zhou, Yawen & Wang, Xianjia & Yang, Jian-Bo, 2020. "Information fusion based on reputation and payoff promotes cooperation in spatial public goods game," Applied Mathematics and Computation, Elsevier, vol. 368(C).
Chen, Qiao & Chen, Tong & Wang, Yongjie, 2019. "Cleverly handling the donation information can promote cooperation in public goods game," Applied Mathematics and Computation, Elsevier, vol. 346(C), pages 363-373.
Zhu, Wenqiang & Pan, Qiuhui & Song, Sha & He, Mingfeng, 2023. "Effects of exposure-based reward and punishment on the evolution of cooperation in prisoner’s dilemma game," Chaos, Solitons & Fractals, Elsevier, vol. 172(C).
Zhou, Zhizhuo & Rong, Zhihai & Yang, Wen & Wu, Zhi-Xi, 2024. "Coevolution of extortion strategies with mixed imitation and aspiration learning dynamics in spatial Prisoner’s Dilemma game," Chaos, Solitons & Fractals, Elsevier, vol. 188(C).
Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.

More about this item

Keywords

Evolutionary games; Cooperation; Prisoner's dilemma game; Reinforcement learning;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:apmaco:v:463:y:2024:i:c:s0096300323005337. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/applied-mathematics-and-computation .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data