IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-04261926.html
   My bibliography  Save this paper

Complementarities in learning from data: insights from general search

Author

Listed:
  • Maximilian Schäfer

    (IMT-BS - DEFI - Département Droit, Economie et Finances - TEM - Télécom Ecole de Management - IMT - Institut Mines-Télécom [Paris] - IMT-BS - Institut Mines-Télécom Business School - IMT - Institut Mines-Télécom [Paris], LITEM - Laboratoire en Innovation, Technologies, Economie et Management (EA 7363) - UEVE - Université d'Évry-Val-d'Essonne - Université Paris-Saclay - IMT-BS - Institut Mines-Télécom Business School - IMT - Institut Mines-Télécom [Paris])

  • Geza Sapi

    (European Commission [Brussels])

Abstract

The ability to make accurate predictions relating to consumer preferences is a key factor of a digital firm's success. Examples include targeted advertisements and, more broadly, business models relying on capturing consumers' attention. The prediction technologies used to learn consumer preferences rely on consumer generated data. Despite the importance of data-driven technologies, there is a lack of knowledge about the precise role that data-scale plays for prediction accuracy. From a policy perspective, a better understanding about the role of data is needed to assess the risks that "big data" might pose for competition. This article highlights potential complementarities between different data dimensions in algorithmic learning. We analyze our hypothesis using search engine data from Yahoo! and provide evidence that more data in the within-user dimension enhances the efficiency of algorithmic learning in the across-user dimension. Our findings suggest that ignoring these complementarities might lead to underestimating scale advantages from data.

Suggested Citation

  • Maximilian Schäfer & Geza Sapi, 2023. "Complementarities in learning from data: insights from general search," Post-Print hal-04261926, HAL.
  • Handle: RePEc:hal:journl:hal-04261926
    DOI: 10.1016/j.infoecopol.2023.101063
    as

    Download full text from publisher

    To our knowledge, this item is not available for download. To find whether it is available, there are three options:
    1. Check below whether another version of this item is available online.
    2. Check on the provider's web page whether it is in fact available.
    3. Perform a search for a similarly titled item that would be available.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Cédric Argenton & Jens Prüfer, 2012. "Search Engine Competition With Network Externalities," Journal of Competition Law and Economics, Oxford University Press, vol. 8(1), pages 73-105.
    2. Dirk Bergemann & Alessandro Bonatti & Tan Gan, 2022. "The economics of social data," RAND Journal of Economics, RAND Corporation, vol. 53(2), pages 263-296, June.
    3. Maryam Farboodi & Roxana Mihet & Thomas Philippon & Laura Veldkamp, 2019. "Big Data and Firm Dynamics," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 38-42, May.
    4. Jens Prüfer & Christoph Schottmüller, 2021. "Competing with Big Data," Journal of Industrial Economics, Wiley Blackwell, vol. 69(4), pages 967-1008, December.
    5. Schaefer, Maximilian & Sapi, Geza & Lorincz, Szabolcs, 2018. "The effect of big data on recommendation quality: The example of internet search," DICE Discussion Papers 284, Heinrich Heine University Düsseldorf, Düsseldorf Institute for Competition Economics (DICE).
    6. X Nie & S Wager, 2021. "Quasi-oracle estimation of heterogeneous treatment effects [TensorFlow: A system for large-scale machine learning]," Biometrika, Biometrika Trust, vol. 108(2), pages 299-319.
    7. Patrick Bajari & Victor Chernozhukov & Ali Hortaçsu & Junichi Suzuki, 2019. "The Impact of Big Data on Firm Performance: An Empirical Investigation," AEA Papers and Proceedings, American Economic Association, vol. 109, pages 33-37, May.
    8. Daron Acemoglu & Ali Makhdoumi & Azarakhsh Malekian & Asu Ozdaglar, 2022. "Too Much Data: Prices and Inefficiencies in Data Markets," American Economic Journal: Microeconomics, American Economic Association, vol. 14(4), pages 218-256, November.
    9. Hema Yoganarasimhan, 2020. "Search Personalization Using Machine Learning," Management Science, INFORMS, vol. 66(3), pages 1045-1070, March.
    10. Catherine Tucker, 2019. "Digital Data, Platforms and the Usual [Antitrust] Suspects: Network Effects, Switching Costs, Essential Facility," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 54(4), pages 683-694, June.
    11. Eduardo M. Azevedo & Alex Deng & José Luis Montiel Olea & Justin Rao & E. Glen Weyl, 2020. "A/B Testing with Fat Tails," Journal of Political Economy, University of Chicago Press, vol. 128(12), pages 4614-4000.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. MARTENS Bertin, 2020. "An economic perspective on data and platform market power," JRC Working Papers on Digital Economy 2020-09, Joint Research Centre.
    2. Ehsan Valavi & Joel Hestness & Newsha Ardalani & Marco Iansiti, 2022. "Time and the Value of Data," Papers 2203.09118, arXiv.org.
    3. Bergemann, Dirk & Ottaviani, Marco, 2021. "Information Markets and Nonmarkets," CEPR Discussion Papers 16459, C.E.P.R. Discussion Papers.
    4. Graef, Inge & Prüfer, Jens, 2021. "Governance of data sharing: A law & economics proposal," Research Policy, Elsevier, vol. 50(9).
    5. Georgios Petropoulos & Bertin Martens & Geoffrey Parker & Marshall Van Alstyne, 2023. "Platform Competition and Information Sharing," CESifo Working Paper Series 10663, CESifo.
    6. Yiquan Gu & Leonardo Madio & Carlo Reggiani, 2022. "Data brokers co-opetition [The impact of big data on firm performance: an empirical investigation]," Oxford Economic Papers, Oxford University Press, vol. 74(3), pages 820-839.
    7. Argentesi, Elena & Buccirossi, Paolo & Calvano, Emilio & Duso, Tomaso & Marrazzo, Alessia & Nava, Salvatore, 2021. "Merger Policy in Digital Markets: An Ex Post Assessment," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 17(1), pages 95-140.
    8. Hemant Bhargava & Antoine Dubus & David Ronayne & Shiva Shekhar, 2024. "The Strategic Value of Data Sharing in Interdependent Markets," CESifo Working Paper Series 10963, CESifo.
    9. Bertin Martens & Alexandre de Streel & Inge Graef & Thomas Tombal & Nestor Duch-Brown, 2020. "Business-to-Business data sharing: An economic and legal analysis," JRC Working Papers on Digital Economy 2020-05, Joint Research Centre.
    10. Calvano, Emilio & Polo, Michele, 2021. "Market power, competition and innovation in digital markets: A survey," Information Economics and Policy, Elsevier, vol. 54(C).
    11. Nathalie Jorzik & Paula Johanna Kirchhof & Frank Mueller-Langer, 2024. "Industrial data sharing and data readiness: a law and economics perspective," European Journal of Law and Economics, Springer, vol. 57(1), pages 181-205, April.
    12. Dirk Bergemann & Alessandro Bonatti, 2024. "Data, Competition, and Digital Platforms," American Economic Review, American Economic Association, vol. 114(8), pages 2553-2595, August.
    13. Mert Demirer & Diego Jimenez-Hernandez & Dean Li & Sida Peng, 2024. "Data, Privacy Laws and Firm Production: Evidence from the GDPR," Working Paper Series WP 2024-02, Federal Reserve Bank of Chicago.
    14. Shan Huang & Michael Allan Ribers & Hannes Ullrich, 2021. "The Value of Data for Prediction Policy Problems: Evidence from Antibiotic Prescribing," Discussion Papers of DIW Berlin 1939, DIW Berlin, German Institute for Economic Research.
    15. Dirk Bergemann & Alessandro Bonatti & Tan Gan, 2022. "The economics of social data," RAND Journal of Economics, RAND Corporation, vol. 53(2), pages 263-296, June.
    16. Catherine E. Tucker, 2023. "The Economics of Privacy: An Agenda," NBER Chapters, in: The Economics of Privacy, National Bureau of Economic Research, Inc.
    17. Lagerlöf, Johan N.M., 2023. "Surfing incognito: Welfare effects of anonymous shopping," International Journal of Industrial Organization, Elsevier, vol. 87(C).
    18. Jiadong Gu, 2024. "Data Trade and Consumer Privacy," Papers 2406.12457, arXiv.org, revised Jul 2024.
    19. Pehr-Johan Norbäck & Lars Persson, 2024. "Why generative AI can make creative destruction more creative but less destructive," Small Business Economics, Springer, vol. 63(1), pages 349-377, June.

    More about this item

    Keywords

    Sustainable Development Goals;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-04261926. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.