IDEAS home Printed from https://ideas.repec.org/p/ces/ceswps/_11099.html
   My bibliography  Save this paper

Strategic Behavior and AI Training Data

Author

Listed:
  • Christian Peukert
  • Florian Abeillon
  • Jérémie Haese
  • Franziska Kaiser
  • Alexander Staub

Abstract

Human-created works represent critical data inputs to artificial intelligence (AI). Strategic behaviour can play a major role for AI training datasets, be it in limiting access to existing works or in deciding which types of new works to create or whether to create new works at all. We examine creators’ behavioral change when their works become training data for AI. Specifically, we focus on contributors on Unsplash, a popular stock image platform with about 6 million high-quality photos and illustrations. In the summer of 2020, Unsplash launched an AI research program by releasing a dataset of 25,000 images for commercial use. We study contributors’ reactions, comparing contributors whose works were included in this dataset to contributors whose works were not included. Our results suggest that treated contributors left the platform at a higher-than-usual rate and substantially slowed down the rate of new uploads. Professional and more successful photographers react stronger than amateurs and less successful photographers. We also show that affected users changed the variety and novelty of contributions to the platform, with long-run implications for the stock of works potentially available for AI training. Taken together, our findings highlight the trade-off between interests of rightsholders and promoting innovation at the technological frontier. We discuss implications for copyright and AI policy.

Suggested Citation

  • Christian Peukert & Florian Abeillon & Jérémie Haese & Franziska Kaiser & Alexander Staub, 2024. "Strategic Behavior and AI Training Data," CESifo Working Paper Series 11099, CESifo.
  • Handle: RePEc:ces:ceswps:_11099
    as

    Download full text from publisher

    File URL: https://www.cesifo.org/DocDL/cesifo1_wp11099.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Erik Brynjolfsson & Danielle Li & Lindsey R. Raymond, 2023. "Generative AI at Work," NBER Working Papers 31161, National Bureau of Economic Research, Inc.
    2. Martin Beraja & David Y Yang & Noam Yuchtman, 2023. "Data-intensive Innovation and the State: Evidence from AI Firms in China," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 90(4), pages 1701-1723.
    3. Jiachen T. Wang & Zhun Deng & Hiroaki Chiba-Okabe & Boaz Barak & Weijie J. Su, 2024. "An Economic Solution to Copyright Challenges of Generative AI," Papers 2404.13964, arXiv.org, revised Sep 2024.
    4. Daron Acemoglu & David Autor & Jonathon Hazell & Pascual Restrepo, 2022. "Artificial Intelligence and Jobs: Evidence from Online Vacancies," Journal of Labor Economics, University of Chicago Press, vol. 40(S1), pages 293-340.
    5. Ed Felten & Manav Raj & Robert Seamans, 2023. "How will Language Modelers like ChatGPT Affect Occupations and Industries?," Papers 2303.01157, arXiv.org, revised Mar 2023.
    6. Joshua S. Gans, 2024. "Copyright Policy Options for Generative Artificial Intelligence," NBER Working Papers 32106, National Bureau of Economic Research, Inc.
    7. Ai, Chunrong & Norton, Edward C., 2003. "Interaction terms in logit and probit models," Economics Letters, Elsevier, vol. 80(1), pages 123-129, July.
    8. Douglas Guilbeault & Solène Delecourt & Tasker Hull & Bhargav Srinivasa Desikan & Mark Chu & Ethan Nadler, 2024. "Online images amplify gender bias," Nature, Nature, vol. 626(8001), pages 1049-1055, February.
    9. Jonathan Roth & Pedro H. C. Sant'Anna, 2023. "When Is Parallel Trends Sensitive to Functional Form?," Econometrica, Econometric Society, vol. 91(2), pages 737-747, March.
    10. Julia Bauer & Nikolaus Franke & Philipp Tuertscher, 2016. "Intellectual Property Norms in Online Communities: How User-Organized Intellectual Property Regulation Supports Innovation," Information Systems Research, INFORMS, vol. 27(4), pages 724-750, December.
    11. Tyna Eloundou & Sam Manning & Pamela Mishkin & Daniel Rock, 2023. "GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models," Papers 2303.10130, arXiv.org, revised Aug 2023.
    12. Jeffrey M Wooldridge, 2023. "Simple approaches to nonlinear difference-in-differences with panel data," The Econometrics Journal, Royal Economic Society, vol. 26(3), pages 31-66.
    13. Miguel Godinho de Matos & Idris Adjerid, 2022. "Consumer Consent and Firm Targeting After GDPR: The Case of a Large Telecom Provider," Management Science, INFORMS, vol. 68(5), pages 3330-3378, May.
    14. Mckinley L. Blackburn, 2015. "The Relative Performance of Poisson and Negative Binomial Regression Estimators," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 77(4), pages 605-616, August.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gaétan de Rassenfosse & Adam B. Jaffe & Joel Waldfogel, 2024. "Intellectual Property and Creative Machines," NBER Chapters, in: Entrepreneurship and Innovation Policy and the Economy, volume 4, National Bureau of Economic Research, Inc.
    2. Evangelos Katsamakas & J. Manuel Sanchez-Cartas, 2024. "Generative Artificial Intelligence, Content Creation, and Platforms," Journal of Industry, Competition and Trade, Springer, vol. 24(1), pages 1-20, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian Peukert & Florian Abeillon & J'er'emie Haese & Franziska Kaiser & Alexander Staub, 2024. "Strategic Behavior and AI Training Data," Papers 2404.18445, arXiv.org.
    2. Dario Guarascio & Jelena Reljic & Roman Stollinger, 2023. "Artificial Intelligence and Employment: A Look into the Crystal Ball," LEM Papers Series 2023/34, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    3. Daniel Goller & Christian Gschwendt & Stefan C. Wolter, 2023. ""This time it's different" Generative Artificial Intelligence and Occupational Choice," Economics of Education Working Paper Series 0209, University of Zurich, Department of Business Administration (IBW).
    4. Fossen, Frank M. & McLemore, Trevor & Sorgner, Alina, 2024. "Artificial Intelligence and Entrepreneurship," IZA Discussion Papers 17055, Institute of Labor Economics (IZA).
    5. Jin Liu & Xingchen Xu & Xi Nan & Yongjun Li & Yong Tan, 2023. ""Generate" the Future of Work through AI: Empirical Evidence from Online Labor Markets," Papers 2308.05201, arXiv.org, revised Jun 2024.
    6. Pablo Casas & Concepción Román, 2024. "The impact of artificial intelligence in the early retirement decision," Empirica, Springer;Austrian Institute for Economic Research;Austrian Economic Association, vol. 51(3), pages 583-618, August.
    7. Stefania Albanesi & Wabitsch Alena & António Dias da Silva & Juan F. Jimeno & Ana Lamo, 2024. "New Technologies and Jobs in Europe," Opportunity and Inclusive Growth Institute Working Papers 105, Federal Reserve Bank of Minneapolis.
    8. Flavio Calvino & Chiara Criscuolo & Luca Fontanelli & Lionel Nesta & Elena Verdolini, 2024. "The role of human capital for AI adoption: Evidence from French firms," CEP Discussion Papers dp2055, Centre for Economic Performance, LSE.
    9. Carvajal, Daniel & Franco, Catalina & Isaksson, Siri, 2024. "Will Artificial Intelligence Get in the Way of Achieving Gender Equality?," Discussion Paper Series in Economics 3/2024, Norwegian School of Economics, Department of Economics, revised 31 Oct 2024.
    10. Enrico Maria Fenoaltea & Dario Mazzilli & Aurelio Patelli & Angelica Sbardella & Andrea Tacchella & Andrea Zaccaria & Marco Trombetti & Luciano Pietronero, 2024. "Follow the money: a startup-based measure of AI exposure across occupations, industries and regions," Papers 2412.04924, arXiv.org, revised Dec 2024.
    11. Evangelos Katsamakas & Oleg V. Pavlov & Ryan Saklad, 2024. "Artificial intelligence and the transformation of higher education institutions," Papers 2402.08143, arXiv.org.
    12. Gaétan de Rassenfosse & Adam B. Jaffe & Joel Waldfogel, 2024. "Intellectual Property and Creative Machines," NBER Chapters, in: Entrepreneurship and Innovation Policy and the Economy, volume 4, National Bureau of Economic Research, Inc.
    13. Caleb Peppiatt, 2024. "The Future of Work: Inequality, Artificial Intelligence, and What Can Be Done About It. A Literature Review," Papers 2408.13300, arXiv.org.
    14. Amali Matharaarachchi & Wishmitha Mendis & Kanishka Randunu & Daswin De Silva & Gihan Gamage & Harsha Moraliyage & Nishan Mills & Andrew Jennings, 2024. "Optimizing Generative AI Chatbots for Net-Zero Emissions Energy Internet-of-Things Infrastructure," Energies, MDPI, vol. 17(8), pages 1-19, April.
    15. Barkowski, Scott, 2021. "Interpretation of nonlinear difference-in-differences: the role of the parallel trends assumption," MPRA Paper 108975, University Library of Munich, Germany.
    16. Olivier Armantier & Sebastian Doerr & Jon Frost & Andreas Fuster & Kelly Shue, 2024. "Nothing to hide? Gender and age differences in willingness to share data," Swiss Finance Institute Research Paper Series 24-99, Swiss Finance Institute.
    17. Anil R. Doshi & Oliver P. Hauser, 2023. "Generative artificial intelligence enhances creativity but reduces the diversity of novel content," Papers 2312.00506, arXiv.org, revised Mar 2024.
    18. James Bono & Alec Xu, 2024. "Randomized Controlled Trials for Security Copilot for IT Administrators," Papers 2411.01067, arXiv.org, revised Nov 2024.
    19. Kristina McElheran & J. Frank Li & Erik Brynjolfsson & Zachary Kroff & Emin Dinlersoz & Lucia Foster & Nikolas Zolas, 2024. "AI adoption in America: Who, what, and where," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 33(2), pages 375-415, March.
    20. Michelle Alexopoulos & Jon Cohen, 2024. "Tracking technical change: Past, present and future," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 57(4), pages 1047-1087, November.

    More about this item

    Keywords

    generative artificial intelligence; training data; licensing; copyright; natural experiment;
    All these keywords.

    JEL classification:

    • K11 - Law and Economics - - Basic Areas of Law - - - Property Law
    • L82 - Industrial Organization - - Industry Studies: Services - - - Entertainment; Media
    • L86 - Industrial Organization - - Industry Studies: Services - - - Information and Internet Services; Computer Software

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ces:ceswps:_11099. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Klaus Wohlrabe (email available below). General contact details of provider: https://edirc.repec.org/data/cesifde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.