IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2410.21359.html
   My bibliography  Save this paper

Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games

Author

Listed:
  • Ji Ma

Abstract

As Large Language Model (LLM)-based agents increasingly undertake real-world tasks and engage with human society, how well do we understand their behaviors? We (1) investigate how LLM agents' prosocial behaviors -- a fundamental social norm -- can be induced by different personas and benchmarked against human behaviors; and (2) introduce a behavioral and social science approach to evaluate LLM agents' decision-making. We explored how different personas and experimental framings affect these AI agents' altruistic behavior in dictator games and compared their behaviors within the same LLM family, across various families, and with human behaviors. The findings reveal substantial variations and inconsistencies among LLMs and notable differences compared to human behaviors. Merely assigning a human-like identity to LLMs does not produce human-like behaviors. Despite being trained on extensive human-generated data, these AI agents are unable to capture the internal processes of human decision-making. Their alignment with human is highly variable and dependent on specific model architectures and prompt formulations; even worse, such dependence does not follow a clear pattern. LLMs can be useful task-specific tools but are not yet intelligent human-like agents.

Suggested Citation

  • Ji Ma, 2024. "Can Machines Think Like Humans? A Behavioral Evaluation of LLM-Agents in Dictator Games," Papers 2410.21359, arXiv.org, revised Dec 2024.
  • Handle: RePEc:arx:papers:2410.21359
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2410.21359
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Christoph Engel, 2011. "Dictator games: a meta study," Experimental Economics, Springer;Economic Science Association, vol. 14(4), pages 583-610, November.
    2. Nicholas Bardsley, 2008. "Dictator game giving: altruism or artefact?," Experimental Economics, Springer;Economic Science Association, vol. 11(2), pages 122-133, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wendelin Schnedler & Nina Lucia Stephan, 2020. "Revisiting a Remedy Against Chains of Unkindness," Schmalenbach Business Review, Springer;Schmalenbach-Gesellschaft, vol. 72(3), pages 347-364, July.
    2. Ertac, Seda & Gumren, Mert & Gurdal, Mehmet Y., 2020. "Demand for decision autonomy and the desire to avoid responsibility in risky environments: Experimental evidence," Journal of Economic Psychology, Elsevier, vol. 77(C).
    3. Emin Karagözoğlu & Elif Tosun, 2022. "Endogenous Game Choice and Giving Behavior in Distribution Games," Games, MDPI, vol. 13(6), pages 1-32, November.
    4. James C. Cox & Vjollca Sadiraj, 2018. "Incentives," Experimental Economics Center Working Paper Series 2018-01, Experimental Economics Center, Andrew Young School of Policy Studies, Georgia State University.
    5. Ortiz-Riomalo, Juan Felipe & Koessler, Ann-Kathrin & Engel, Stefanie, 2021. "Inducing perspective-taking for prosocial behaviour in natural resource management," Journal of Environmental Economics and Management, Elsevier, vol. 110(C).
    6. Carpenter, Jeffrey & Robbett, Andrea, 2024. "Measuring socially appropriate social preferences," Games and Economic Behavior, Elsevier, vol. 147(C), pages 517-532.
    7. Breitmoser, Yves, 2016. "Stochastic choice, systematic mistakes and preference estimation," MPRA Paper 72779, University Library of Munich, Germany.
    8. Andreas Bergh & Philipp Christoph Wichardt, 2018. "Mine, Ours or Yours? Unintended Framing Effects in Dictator Games," CESifo Working Paper Series 7049, CESifo.
    9. Thorsten Chmura & Christoph Engel & Markus Englerth, 2013. "Selfishness As a Potential Cause of Crime. A Prison Experiment," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2013_05, Max Planck Institute for Research on Collective Goods.
    10. Matteo M. Galizzi & Daniel Navarro-Martinez, 2019. "On the External Validity of Social Preference Games: A Systematic Lab-Field Study," Management Science, INFORMS, vol. 65(3), pages 976-1002, March.
    11. Kim Lehrer & Catherine Porter, 2018. "Charitable Dictators? Determinants of Giving to NGOs in Uganda," Journal of International Development, John Wiley & Sons, Ltd., vol. 30(1), pages 80-101, January.
    12. Dittrich, Dennis A.V. & Büchner, Susanne & Kulesz, Micaela M., 2015. "Dynamic repeated random dictatorship and gender discrimination," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 55(C), pages 81-90.
    13. Larney, Andrea & Rotella, Amanda & Barclay, Pat, 2019. "Stake size effects in ultimatum game and dictator game offers: A meta-analysis," Organizational Behavior and Human Decision Processes, Elsevier, vol. 151(C), pages 61-72.
    14. Argenton, Cédric & Potters, Jan & Yang, Yadi, 2023. "Receiving credit: On delegation and responsibility," European Economic Review, Elsevier, vol. 158(C).
    15. Dorian Jullien, 2018. "Under Risk, Over Time, Regarding Other People: Language and Rationality within Three Dimensions," Research in the History of Economic Thought and Methodology, in: Including a Symposium on Latin American Monetary Thought: Two Centuries in Search of Originality, volume 36, pages 119-155, Emerald Group Publishing Limited.
    16. Benistant, Julien & Suchon, Rémi, 2021. "It does (not) get better: Reference income violation and altruism," Journal of Economic Psychology, Elsevier, vol. 85(C).
    17. Subhasish M. Chowdhury & Joo Young Jeon & Bibhas Saha, 2017. "Gender Differences in the Giving and Taking Variants of the Dictator Game," Southern Economic Journal, John Wiley & Sons, vol. 84(2), pages 474-483, October.
    18. Marco Faillo & Matteo Rizzolli & Stephan Tontrup, 2016. "Thou shalt not steal (from hard-working people)An experiment on respect for property claims," Econometica Working Papers wp58, Econometica.
    19. Peter G. Moffatt & Graciela Zevallos, 2021. "A Kuhn–Tucker model for behaviour in dictator games," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 7(2), pages 226-243, December.
    20. Kate Farrow & Gilles Grolleau & Lisette Ibanez, 2017. "Designing more effective norm interventions: the role of valence," Post-Print hal-01680539, HAL.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2410.21359. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.