IDEAS home Printed from https://ideas.repec.org/a/eee/respol/v51y2022i5s0048733322000415.html
   My bibliography  Save this article

The role of data for AI startup growth

Author

Listed:
  • Bessen, James
  • Impink, Stephen Michael
  • Reichensperger, Lydia
  • Seamans, Robert

Abstract

Artificial intelligence (AI)-enabled products are expected to drive economic growth. Training data are important for firms developing AI-enabled products; without training data, firms cannot develop or refine their algorithms. This is particularly the case for AI startups developing new algorithms and products. However, there is no consensus in the literature on which aspects of training data are most important. Using unique survey data of AI startups, we find a positive correlation between having proprietary training data and obtaining future venture capital funding. Moreover, this correlation is greater for startups in markets where data is a major advantage and for startups using more sophisticated algorithms, such as neural networks and ensemble learning.

Suggested Citation

  • Bessen, James & Impink, Stephen Michael & Reichensperger, Lydia & Seamans, Robert, 2022. "The role of data for AI startup growth," Research Policy, Elsevier, vol. 51(5).
  • Handle: RePEc:eee:respol:v:51:y:2022:i:5:s0048733322000415
    DOI: 10.1016/j.respol.2022.104513
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0048733322000415
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.respol.2022.104513?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Erik Brynjolfsson & Daniel Rock & Chad Syverson, 2021. "The Productivity J-Curve: How Intangibles Complement General Purpose Technologies," American Economic Journal: Macroeconomics, American Economic Association, vol. 13(1), pages 333-372, January.
    2. Susan Athey & Michael Luca, 2019. "Economists (and Economics) in Tech Companies," Journal of Economic Perspectives, American Economic Association, vol. 33(1), pages 209-230, Winter.
    3. James J. Heckman, 1976. "The Common Structure of Statistical Models of Truncation, Sample Selection and Limited Dependent Variables and a Simple Estimator for Such Models," NBER Chapters, in: Annals of Economic and Social Measurement, Volume 5, number 4, pages 475-492, National Bureau of Economic Research, Inc.
    4. Ramana Nanda, 2016. "Financing high-potential entrepreneurship," IZA World of Labor, Institute of Labor Economics (IZA), pages 252-252, April.
    5. Garrett A. Johnson & Scott K. Shriver & Shaoyin Du, 2020. "Consumer Privacy Choice in Online Advertising: Who Opts Out and at What Cost to Industry?," Marketing Science, INFORMS, vol. 39(1), pages 33-51, January.
    6. David J. TEECE, 2008. "Profiting from technological innovation: Implications for integration, collaboration, licensing and public policy," World Scientific Book Chapters, in: The Transfer And Licensing Of Know-How And Intellectual Property Understanding the Multinational Enterprise in the Modern World, chapter 5, pages 67-87, World Scientific Publishing Co. Pte. Ltd..
    7. Lesley Chiou & Catherine Tucker, 2017. "Content aggregation by platforms: The case of the news media," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 26(4), pages 782-805, December.
    8. Jasjit Singh & Ajay Agrawal, 2011. "Recruiting for Ideas: How Firms Exploit the Prior Inventions of New Hires," Management Science, INFORMS, vol. 57(1), pages 129-150, January.
    9. Maria Savona, 2019. "The Value of Data:Towards a Framework to Redistribute It," SPRU Working Paper Series 2019-21, SPRU - Science Policy Research Unit, University of Sussex Business School.
    10. Susan Athey & Guido W. Imbens, 2019. "Machine Learning Methods That Economists Should Know About," Annual Review of Economics, Annual Reviews, vol. 11(1), pages 685-725, August.
    11. Frank Nagle, 2018. "Learning by Contributing: Gaining Competitive Advantage Through Contribution to Crowdsourced Public Goods," Organization Science, INFORMS, vol. 29(4), pages 569-587, August.
    12. Joshua D. Angrist & Jörn-Steffen Pischke, 2009. "Mostly Harmless Econometrics: An Empiricist's Companion," Economics Books, Princeton University Press, edition 1, number 8769.
    13. Susan Athey, 2018. "The Impact of Machine Learning on Economics," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 507-547, National Bureau of Economic Research, Inc.
    14. William R. Kerr & Ramana Nanda, 2009. "Financing Constraints and Entrepreneurship," Harvard Business School Working Papers 10-013, Harvard Business School.
    15. Jason Furman & Robert Seamans, 2019. "AI and the Economy," Innovation Policy and the Economy, University of Chicago Press, vol. 19(1), pages 161-191.
    16. Charles I. Jones & Christopher Tonetti, 2020. "Nonrivalry and the Economics of Data," American Economic Review, American Economic Association, vol. 110(9), pages 2819-2858, September.
    17. Nanda, Ramana & Samila, Sampsa & Sorenson, Olav, 2020. "The persistent effect of initial success: Evidence from venture capital," Journal of Financial Economics, Elsevier, vol. 137(1), pages 231-248.
    18. Alessandro Acquisti & Curtis Taylor & Liad Wagman, 2016. "The Economics of Privacy," Journal of Economic Literature, American Economic Association, vol. 54(2), pages 442-492, June.
    19. John Gibson, 2019. "Are You Estimating the Right Thing? An Editor Reflects," Applied Economic Perspectives and Policy, Agricultural and Applied Economics Association, vol. 41(3), pages 329-350.
    20. Jentzsch, Nicola, 2016. "State-of-the-Art of the Economics of Cyber-Security and Privacy," EconStor Research Reports 126223, ZBW - Leibniz Information Centre for Economics.
    21. Erik Brynjolfsson & Lorin M. Hitt, 2000. "Beyond Computation: Information Technology, Organizational Transformation and Business Performance," Journal of Economic Perspectives, American Economic Association, vol. 14(4), pages 23-48, Fall.
    22. Bo Cowgill & Fabrizio Dell'Acqua & Samuel Deng & Daniel Hsu & Nakul Verma & Augustin Chaintreau, 2020. "Biased Programmers? Or Biased Data? A Field Experiment in Operationalizing AI Ethics," Papers 2012.02394, arXiv.org.
    23. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    24. Maryam Farboodi & Laura Veldkamp, 2021. "A Model of the Data Economy," NBER Working Papers 28427, National Bureau of Economic Research, Inc.
    25. Athey, Susan & Imbens, Guido W., 2019. "Machine Learning Methods Economists Should Know About," Research Papers 3776, Stanford University, Graduate School of Business.
    26. Catherine Tucker, 2019. "Digital Data, Platforms and the Usual [Antitrust] Suspects: Network Effects, Switching Costs, Essential Facility," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 54(4), pages 683-694, June.
    27. Hal R Varian, 2014. "Beyond Big Data," Business Economics, Palgrave Macmillan;National Association for Business Economics, vol. 49(1), pages 27-31, January.
    28. James Bessen & Maarten Goos & Anna Salomons & Wiljan van den Berge, 2020. "Firm-Level Automation: Evidence from the Netherlands," AEA Papers and Proceedings, American Economic Association, vol. 110, pages 389-393, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alessandra Colombelli & Elettra D’Amico & Emilio Paolucci, 2023. "When computer science is not enough: universities knowledge specializations behind artificial intelligence startups in Italy," The Journal of Technology Transfer, Springer, vol. 48(5), pages 1599-1627, October.
    2. Kristina McElheran & J. Frank Li & Erik Brynjolfsson & Zachary Kroff & Emin Dinlersoz & Lucia Foster & Nikolas Zolas, 2024. "AI adoption in America: Who, what, and where," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 33(2), pages 375-415, March.
    3. Igna, Ioana & Venturini, Francesco, 2023. "The determinants of AI innovation across European firms," Research Policy, Elsevier, vol. 52(2).
    4. Fossen, Frank M. & McLemore, Trevor & Sorgner, Alina, 2024. "Artificial Intelligence and Entrepreneurship," IZA Discussion Papers 17055, Institute of Labor Economics (IZA).
    5. ZHU Chen & MOTOHASHI Kazuyuki, 2024. "The Fundraising of AI Startups: Evidence from web data," Discussion papers 24021, Research Institute of Economy, Trade and Industry (RIETI).
    6. Christian Peukert & Margaritha Windisch, 2023. "The Economics of Copyright in the Digital Age," CESifo Working Paper Series 10687, CESifo.
    7. Emin Dinlersoz & Can Dogan & Nikolas Zolas, 2024. "Starting Up AI," Working Papers 24-09, Center for Economic Studies, U.S. Census Bureau.
    8. Flavio Calvino & Luca Fontanelli, 2023. "Artificial intelligence, complementary assets and productivity: evidence from French firms," LEM Papers Series 2023/35, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    9. Nam, Jinyoung & Kim, Junghwan & Jung, Yoonhyuk, 2023. "Understandings of the AI business ecosystem in South Korea: AI startups' perspective," 32nd European Regional ITS Conference, Madrid 2023: Realising the digital decade in the European Union – Easier said than done? 278005, International Telecommunications Society (ITS).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Huber, Martin & Meier, Jonas & Wallimann, Hannes, 2022. "Business analytics meets artificial intelligence: Assessing the demand effects of discounts on Swiss train tickets," Transportation Research Part B: Methodological, Elsevier, vol. 163(C), pages 22-39.
    2. MARTENS Bertin, 2020. "An economic perspective on data and platform market power," JRC Working Papers on Digital Economy 2020-09, Joint Research Centre.
    3. Oliver Falck & Johannes Koenen, 2020. "Resource “Data”: Economic Benefits of Data Provision," CESifo Forum, ifo Institute - Leibniz Institute for Economic Research at the University of Munich, vol. 21(03), pages 31-41, September.
    4. Steffen, Nico & Wiewiorra, Lukas & Kroon, Peter, 2021. "Wettbewerb und Regulierung in der Plattform- und Datenökonomie," WIK Discussion Papers 481, WIK Wissenschaftliches Institut für Infrastruktur und Kommunikationsdienste GmbH.
    5. Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asu Ozdaglar, 2022. "Too Much Data: Prices and Inefficiencies in Data Markets," American Economic Journal: Microeconomics, American Economic Association, vol. 14(4), pages 218-256, November.
    6. Michael Lechner, 2023. "Causal Machine Learning and its use for public policy," Swiss Journal of Economics and Statistics, Springer;Swiss Society of Economics and Statistics, vol. 159(1), pages 1-15, December.
    7. Chen, S. & Doerr, S. & Frost, J. & Gambacorta, L. & Shin, H.S., 2023. "The fintech gender gap," Journal of Financial Intermediation, Elsevier, vol. 54(C).
    8. Rahman, Mustafizur & Al-Hasan, Md., 2018. "Male-Female wage gap and informal employment in Bangladesh: A quantile regression approach," MPRA Paper 90131, University Library of Munich, Germany.
    9. Mark Kattenberg & Bas Scheer & Jurre Thiel, 2023. "Causal forests with fixed effects for treatment effect heterogeneity in difference-in-differences," CPB Discussion Paper 452, CPB Netherlands Bureau for Economic Policy Analysis.
    10. Sara Wong, 2017. "Minimum wage impacts on wages and hours worked of low-income workers in Ecuador," Working Papers PMMA 2017-14, PEP-PMMA.
    11. Daniel Boller & Michael Lechner & Gabriel Okasa, 2021. "The Effect of Sport in Online Dating: Evidence from Causal Machine Learning," Papers 2104.04601, arXiv.org.
    12. Peter Hull & Michal Kolesár & Christopher Walters, 2022. "Labor by design: contributions of David Card, Joshua Angrist, and Guido Imbens," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(3), pages 603-645, July.
    13. Hollenstein, Heinz & Woerter, Martin, 2008. "Inter- and intra-firm diffusion of technology: The example of E-commerce: An analysis based on Swiss firm-level data," Research Policy, Elsevier, vol. 37(3), pages 545-564, April.
    14. Andrew A. Toole & Dirk Czarnitzki & Christian Rammer, 2015. "University research alliances, absorptive capacity, and the contribution of startups to employment growth," Economics of Innovation and New Technology, Taylor & Francis Journals, vol. 24(5), pages 532-549, July.
    15. Leone, Maria Isabella & Messeni Petruzzelli, Antonio & Natalicchio, Angelo, 2022. "Boundary spanning through external technology acquisition: The moderating role of star scientists and upstream alliances," Technovation, Elsevier, vol. 116(C).
    16. Tarsia, Romano, 2024. "Heterogeneous effects of weather shocks on firm economic performance," LSE Research Online Documents on Economics 124251, London School of Economics and Political Science, LSE Library.
    17. Imbens, Guido W., 2014. "Instrumental Variables: An Econometrician's Perspective," IZA Discussion Papers 8048, Institute of Labor Economics (IZA).
    18. Hazar Altınbaş & Vincenzo Pacelli & Edgardo Sica, 2022. "An Empirical Assessment of the Contagion Determinants in the Euro Area in a Period of Sovereign Debt Risk," Italian Economic Journal: A Continuation of Rivista Italiana degli Economisti and Giornale degli Economisti, Springer;Società Italiana degli Economisti (Italian Economic Association), vol. 8(2), pages 339-371, July.
    19. James T. E. Chapman & Ajit Desai, 2023. "Macroeconomic Predictions Using Payments Data and Machine Learning," Forecasting, MDPI, vol. 5(4), pages 1-32, November.
    20. Byron Botha & Rulof Burger & Kevin Kotzé & Neil Rankin & Daan Steenkamp, 2023. "Big data forecasting of South African inflation," Empirical Economics, Springer, vol. 65(1), pages 149-188, July.

    More about this item

    Keywords

    Artificial intelligence; Competition; Data; Algorithms; Venture capital;
    All these keywords.

    JEL classification:

    • O33 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Technological Change: Choices and Consequences; Diffusion Processes
    • J21 - Labor and Demographic Economics - - Demand and Supply of Labor - - - Labor Force and Employment, Size, and Structure
    • L10 - Industrial Organization - - Market Structure, Firm Strategy, and Market Performance - - - General
    • L26 - Industrial Organization - - Firm Objectives, Organization, and Behavior - - - Entrepreneurship

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:respol:v:51:y:2022:i:5:s0048733322000415. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/respol .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.