Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning

My bibliography Save this article

Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning

Author

Listed:

Zhang, Huizhen
An, Tianbo
Yan, Pingping
Hu, Kaipeng
An, Jinjin
Shi, Lijuan
Zhao, Jian
Wang, Jingrui

Registered:

Abstract

Imitation and replication have emerged as a paradigm in numerous studies that explore the evolution of cooperative behavior. Since they embrace the essence of natural selection, it is widely recognized in exploring the evolution of biological behaviors. However, it is not easy to express the way individuals select and optimize in these simple and elegant ways in the complex and variable interactive environments. Currently, reinforcement learning is widely used in the study of strategy updating dynamics and agent learning processes in game theory. Therefore, we introduce the Q-learning algorithms into the voluntary public goods game to explore the impact of cooperative evolution. Simulation results demonstrate that when the synergy factor is large and the adjust loner payoff’s multiply factor is smaller, the number of cooperators becomes gradually consistent. As the synergy factor increases, the evolution of the proportion of defectors become nonlinear. Furthermore, we further explore the Q-table and strategy updating processes of agents in the steady state under a smaller multiply factor that adjusts the loner’s payoff. The results find inconsistency between the average Q-values and the steady state population strategy distribution. Subsequently, we explain the reason for the inconsistency by analyzing strategy sequence, namely that there are a number of agents who constantly change strategies in the population, and the Q-values of these agents have an impact on the overall Q-values. In addition, evolutionary snapshots of agent strategy sequences are observed. The results find that the agent’s strategic selection shows greater instability when the proportions of cooperators, defectors, and loners in the population are relatively balanced. Finally, the effect of parameters in the Q-learning algorithm on cooperative behavior is analyzed. This study hopes to provide valuable insights into understanding the dynamics of cooperation in complex social interactions.

Suggested Citation

Zhang, Huizhen & An, Tianbo & Yan, Pingping & Hu, Kaipeng & An, Jinjin & Shi, Lijuan & Zhao, Jian & Wang, Jingrui, 2024. "Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).

Handle: RePEc:eee:chsofr:v:178:y:2024:i:c:s0960077923012602
DOI: 10.1016/j.chaos.2023.114358

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

References listed on IDEAS

Cui, Guang-Hai & Wang, Zhen & Li, Jun-Li & Jin, Xing & Zhang, Zhi-Wang, 2021. "Influence of precaution and dynamic post-indemnity based insurance policy on controlling the propagation of epidemic security risks in networks," Applied Mathematics and Computation, Elsevier, vol. 392(C).
Gabriela Koľveková & Manuela Raisová & Martin Zoričak & Vladimír Gazda, 2021. "Endogenous Shared Punishment Model in Threshold Public Goods Games," Computational Economics, Springer;Society for Computational Economics, vol. 58(1), pages 57-81, June.
Pan, Jianchen & Zhang, Lan & Han, Wenchen & Huang, Changwei, 2023. "Heterogeneous investment promotes cooperation in spatial public goods game on hypergraphs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 609(C).
Jingrui Wang & Xing Jin & Yixuan Yang & Qingfang Chen & Zhen Wang & Hong Ding, 2021. "The spread of epidemic under voluntary vaccination with heterogeneous infection rates," International Journal of Modern Physics C (IJMPC), World Scientific Publishing Co. Pte. Ltd., vol. 32(03), pages 1-14, March.
Wang, Jingrui & Zhang, Huizhen & Jin, Xing & Ma, Leyu & Chen, Yueren & Wang, Chao & Zhao, Jian & An, Tianbo, 2023. "Subsidy policy with punishment mechanism can promote voluntary vaccination behaviors in structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 174(C).
Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
Huang, Changwei & Hou, Yongzhao & Han, Wenchen, 2023. "Coevolution of consensus and cooperation in evolutionary Hegselmann–Krause dilemma with the cooperation cost," Chaos, Solitons & Fractals, Elsevier, vol. 168(C).
Jia, Danyang & Li, Tong & Zhao, Yang & Zhang, Xiaoqin & Wang, Zhen, 2022. "Empty nodes affect conditional cooperation under reinforcement learning," Applied Mathematics and Computation, Elsevier, vol. 413(C).
Jin, Xing & Tao, Yuchen & Wang, Jingrui & Wang, Chao & Wang, Yongheng & Zhang, Zhouyang & Wang, Zhen, 2023. "Strategic use of payoff information in k-hop evolutionary Best-shot networked public goods game," Applied Mathematics and Computation, Elsevier, vol. 459(C).
Gao, Bo & Liu, Xuan & Lan, Zhong-Zhou & Hong, Jie & Zhang, Wenguang, 2021. "The evolution of cooperation with preferential selection in voluntary public goods game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 584(C).
Quan, Ji & Yu, Junyu & Li, Xia & Wang, Xianjia, 2023. "Conditional switching between social excluders and loners promotes cooperation in spatial public goods game," Chaos, Solitons & Fractals, Elsevier, vol. 169(C).
Haozheng Xu & Yiwen Zhang & Xing Jin & Jingrui Wang & Zhen Wang, 2023. "The Evolution of Cooperation in Multigames with Uniform Random Hypergraphs," Mathematics, MDPI, vol. 11(11), pages 1-11, May.
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2022. "Mercenary punishment in structured populations," Applied Mathematics and Computation, Elsevier, vol. 417(C).

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Zheng, Guozhong & Zhang, Jiqiang & Deng, Shengfeng & Cai, Weiran & Chen, Li, 2024. "Evolution of cooperation in the public goods game with Q-learning," Chaos, Solitons & Fractals, Elsevier, vol. 188(C).
Yuanguo Lin & Fan Lin & Guorong Cai & Hong Chen & Linxin Zou & Yunxuan Liu & Pengcheng Wu, 2025. "Evolutionary Reinforcement Learning: A Systematic Review and Future Directions," Mathematics, MDPI, vol. 13(5), pages 1-33, March.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Wang, Jingrui & Zhang, Huizhen & Jin, Xing & Ma, Leyu & Chen, Yueren & Wang, Chao & Zhao, Jian & An, Tianbo, 2023. "Subsidy policy with punishment mechanism can promote voluntary vaccination behaviors in structured populations," Chaos, Solitons & Fractals, Elsevier, vol. 174(C).
Wang, Chengjie & Deng, Juan & Zhao, Hui & Li, Li, 2024. "Effect of Q-learning on the evolution of cooperation behavior in collective motion: An improved Vicsek model," Applied Mathematics and Computation, Elsevier, vol. 482(C).
Wang, Xianjia & Yang, Zhipeng & Liu, Yanli & Chen, Guici, 2023. "A reinforcement learning-based strategy updating model for the cooperative evolution," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 618(C).
Cui, Guang-Hai & Li, Jun-Li & Dong, Kun-Xiang & Jin, Xing & Yang, Hong-Yong & Wang, Zhen, 2024. "Influence of subsidy policies against insurances on controlling the propagation of epidemic security risks in networks," Applied Mathematics and Computation, Elsevier, vol. 476(C).
Bai, Pengzhou & Qiang, Bingzhuang & Zou, Kuan & Huang, Changwei, 2024. "Preferential selection based on adaptive attractiveness induce by reinforcement learning promotes cooperation," Chaos, Solitons & Fractals, Elsevier, vol. 180(C).
Zhang, Yao & Hao, Qing-Yi & Qian, Jia-Li & Wu, Chao-Yun & Bi, Yan, 2024. "The influence of the heterogeneities of social institutions and individuals’ tendency to establish social institutions on cooperation," Chaos, Solitons & Fractals, Elsevier, vol. 186(C).
Liu, Zhuo & Wang, Juan & Li, Xiaopeng, 2024. "Evolutionary dynamics of networked N-player trust games with exclusion strategy," Chaos, Solitons & Fractals, Elsevier, vol. 186(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2024. "Supporting punishment via taxation in a structured population," Chaos, Solitons & Fractals, Elsevier, vol. 178(C).
Yang, Zhengzhi & Zheng, Lei & Perc, Matjaž & Li, Yumeng, 2024. "Interaction state Q-learning promotes cooperation in the spatial prisoner's dilemma game," Applied Mathematics and Computation, Elsevier, vol. 463(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2024. "Suppressing defection by increasing temptation: The impact of smart cooperators on a social dilemma situation," Applied Mathematics and Computation, Elsevier, vol. 479(C).
Dai, Hui & Wang, Xiaoyue & Lu, Yikang & Hou, Yunxiang & Shi, Lei, 2024. "The effect of intraspecific cooperation in a three-species cyclic predator-prey model," Applied Mathematics and Computation, Elsevier, vol. 470(C).
Yang, Guoli & Wu, Yu'e & Cavaliere, Matteo, 2024. "Information-driven cooperation on adaptive cyber-physical systems," Applied Mathematics and Computation, Elsevier, vol. 466(C).
Wang, Jianwei & Xu, Wenshu & Yu, Fengyuan & He, Jialu & Chen, Wei & Dai, Wenhui, 2024. "Evolution of cooperation under corrupt institutions," Chaos, Solitons & Fractals, Elsevier, vol. 184(C).
Yong Shen & Jin Guo & Hongwei Kang, 2024. "The Influence of Fine Distribution and Compensation on Cooperation in Public Goods Game," Mathematics, MDPI, vol. 12(24), pages 1-21, December.
Du, Chunpeng & Guo, Keyu & Lu, Yikang & Jin, Haoyu & Shi, Lei, 2023. "Aspiration driven exit-option resolves social dilemmas in the network," Applied Mathematics and Computation, Elsevier, vol. 438(C).
Yan, Zeyuan & Zhao, Hui & Liang, Shu & Li, Li & Song, Yanjie, 2024. "Inter-layer feedback mechanism with reinforcement learning boosts the evolution of cooperation in multilayer network," Chaos, Solitons & Fractals, Elsevier, vol. 185(C).
Zhu, Wenqiang & Pan, Qiuhui & Song, Sha & He, Mingfeng, 2023. "Effects of exposure-based reward and punishment on the evolution of cooperation in prisoner’s dilemma game," Chaos, Solitons & Fractals, Elsevier, vol. 172(C).
Ding, Zhen-Wei & Zheng, Guo-Zhong & Cai, Chao-Ran & Cai, Wei-Ran & Chen, Li & Zhang, Ji-Qiang & Wang, Xu-Ming, 2023. "Emergence of cooperation in two-agent repeated games with reinforcement learning," Chaos, Solitons & Fractals, Elsevier, vol. 175(P1).
Zhou, Zhizhuo & Rong, Zhihai & Yang, Wen & Wu, Zhi-Xi, 2024. "Coevolution of extortion strategies with mixed imitation and aspiration learning dynamics in spatial Prisoner’s Dilemma game," Chaos, Solitons & Fractals, Elsevier, vol. 188(C).
Lee, Hsuan-Wei & Cleveland, Colin & Szolnoki, Attila, 2023. "Restoring spatial cooperation with myopic agents in a three-strategy social dilemma," Applied Mathematics and Computation, Elsevier, vol. 458(C).

More about this item

Keywords

Public goods game; Self-regarding Q-learning; Human cooperation; Loner;
All these keywords.

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:chsofr:v:178:y:2024:i:c:s0960077923012602. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Thayer, Thomas R. (email available below). General contact details of provider: https://www.journals.elsevier.com/chaos-solitons-and-fractals .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Exploring cooperative evolution with tunable payoff’s loners using reinforcement learning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data