IDEAS home Printed from https://ideas.repec.org/p/egu/wpaper/2437.html
   My bibliography  Save this paper

GitPat: A Database Linking Open Source Contributions & Patenting Activity of Organizations

Author

Listed:
  • Sergio Petralia

Abstract

This article outlines a method to link organizations’ patenting activities at the United States Patent and Trademark Office (USPTO) with their Open Source Software (OSS) contributions in GitHub, the most popular code-hosting service platform. It also provides two ready-to-use databases that are easy to connect to related data sources. The first includes information about all contributions (6,091,653) made to 54 of the most popular OSS projects until June 2024, amounting to over 49 million file changes and more than 3.3 billion line modifications. The second includes information on patents granted until June 2024 (1,719,510) to 1,328 organizations with activity in GitHub. This novel data can be used to explore the dynamics and mechanisms driving innovation within modern technological ecosystems, where the lines between proprietary and open-source development are becoming blurry. It offers an opportunity to investigate several unresolved puzzles in the economics of OSS literature, such as disentangling the intrinsic and extrinsic motivations behind individual contributions to OSS, understanding the strategic reasons organizations engage in OSS, and exploring collaboration and geographical concentration mechanisms in the production of digital technologies.

Suggested Citation

  • Sergio Petralia, 2024. "GitPat: A Database Linking Open Source Contributions & Patenting Activity of Organizations," Papers in Evolutionary Economic Geography (PEEG) 2437, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Nov 2024.
  • Handle: RePEc:egu:wpaper:2437
    as

    Download full text from publisher

    File URL: http://econ.geo.uu.nl/peeg/peeg2437.pdf
    File Function: Version November 2024
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Sergio Petralia, 2020. "Mapping General Purpose Technologies with Patent Data," Papers in Evolutionary Economic Geography (PEEG) 2027, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Jul 2020.
    2. Suzanne Scotchmer, 2010. "Openness, Open Source, and the Veil of Ignorance," American Economic Review, American Economic Association, vol. 100(2), pages 165-171, May.
    3. Lerner, Josh & Tirole, Jean, 2001. "The open source movement: Key research questions," European Economic Review, Elsevier, vol. 45(4-6), pages 819-826, May.
    4. Henkel, Joachim, 2004. "The Jukebox Mode of Innovation - A Model of Commercial Open Source Development," CEPR Discussion Papers 4507, C.E.P.R. Discussion Papers.
    5. Bitzer, Jurgen & Schrettl, Wolfram & Schroder, Philipp J.H., 2007. "Intrinsic motivation in open source software development," Journal of Comparative Economics, Elsevier, vol. 35(1), pages 160-169, March.
    6. Justin Pappas Johnson, 2002. "Open Source Software: Private Provision of a Public Good," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 11(4), pages 637-662, December.
    7. Florian Seliger & Gaéran de Rassenfosse & Jan Kozak, 2019. "Geocoding of worldwide patent data," KOF Working papers 19-458, KOF Swiss Economic Institute, ETH Zurich.
    8. Massimo D'Antoni & Maria Alessandra Rossi, 2014. "Appropriability and Incentives with Complementary Innovations," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 23(1), pages 103-124, March.
    9. Henkel, Joachim, 2006. "Selective revealing in open innovation processes: The case of embedded Linux," Research Policy, Elsevier, vol. 35(7), pages 953-969, September.
    10. Richard P. Bagozzi & Utpal M. Dholakia, 2006. "Open Source Software User Communities: A Study of Participation in Linux User Groups," Management Science, INFORMS, vol. 52(7), pages 1099-1115, July.
    11. Tesoriere, Antonio & Balletta, Luigi, 2017. "A dynamic model of open source vs proprietary R&D," European Economic Review, Elsevier, vol. 94(C), pages 221-239.
    12. Josh Lerner & Jean Tirole, 2005. "The Economics of Technology Sharing: Open Source and Beyond," Journal of Economic Perspectives, American Economic Association, vol. 19(2), pages 99-120, Spring.
    13. James Bessen & Eric Maskin, 2009. "Sequential innovation, patents, and imitation," RAND Journal of Economics, RAND Corporation, vol. 40(4), pages 611-635, December.
    14. Arnold Polanski, 2007. "Is The General Public Licence A Rational Choice?," Journal of Industrial Economics, Wiley Blackwell, vol. 55(4), pages 691-714, December.
    15. Petralia, Sergio, 2020. "Mapping general purpose technologies with patent data," Research Policy, Elsevier, vol. 49(7).
    16. Wachs, Johannes & Nitecki, Mariusz & Schueller, William & Polleres, Axel, 2022. "The Geography of Open Source Software: Evidence from GitHub," Technological Forecasting and Social Change, Elsevier, vol. 176(C).
    17. Andrea Fosfuri & Marco S. Giarratana & Alessandra Luzzi, 2008. "The Penguin Has Entered the Building: The Commercialization of Open Source Software Products," Organization Science, INFORMS, vol. 19(2), pages 292-305, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Engelhardt, Sebastian v. & Freytag, Andreas, 2013. "Institutions, culture, and open source," Journal of Economic Behavior & Organization, Elsevier, vol. 95(C), pages 90-110.
    2. Ramon Casadesus-Masanell & Gastón Llanes, 2011. "Mixed Source," Management Science, INFORMS, vol. 57(7), pages 1212-1230, July.
    3. Dongryul Lee & Byung Kim, 2013. "Motivations for Open Source Project Participation and Decisions of Software Developers," Computational Economics, Springer;Society for Computational Economics, vol. 41(1), pages 31-57, January.
    4. Reisinger, Markus & Ressner, Ludwig & Schmidtke, Richard & Thomes, Tim Paul, 2014. "Crowding-in of complementary contributions to public goods: Firm investment into open source software," Journal of Economic Behavior & Organization, Elsevier, vol. 106(C), pages 78-94.
    5. Alexy, Oliver & Reitzig, Markus, 2013. "Private–collective innovation, competition, and firms’ counterintuitive appropriation strategies," Research Policy, Elsevier, vol. 42(4), pages 895-913.
    6. Llanes, Gastón & de Elejalde, Ramiro, 2013. "Industry equilibrium with open-source and proprietary firms," International Journal of Industrial Organization, Elsevier, vol. 31(1), pages 36-49.
    7. Stephen M. Maurer & Suzanne Scotchmer, 2006. "Open Source Software: The New Intellectual Property Paradigm," NBER Working Papers 12148, National Bureau of Economic Research, Inc.
    8. Gastón Llanes, 2019. "Competitive strategy for open and user innovation," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 28(2), pages 280-297, April.
    9. Gauguier, Jean-Jacques, 2009. "L’industrialisation de l’Open Source," Economics Thesis from University Paris Dauphine, Paris Dauphine University, number 123456789/4388 edited by Toledano, Joëlle.
    10. Burcu Tan & Edward G. Anderson, Jr. & Geoffrey G. Parker, 2020. "Platform Pricing and Investment to Drive Third-Party Value Creation in Two-Sided Networks," Information Systems Research, INFORMS, vol. 31(1), pages 217-239, March.
    11. Robert M. Sauer, 2007. "Why develop open-source software? The role of non-pecuniary benefits, monetary rewards, and open-source licence type," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 23(4), pages 605-619, Winter.
    12. Eric Darmon & Dominique Torre, 2010. "Open source, dual licensing and software compétition," Post-Print halshs-00497623, HAL.
    13. Stephane Verani, 2006. "Open Source Development in a Differentiated Duopoly," Economics Discussion / Working Papers 06-05, The University of Western Australia, Department of Economics.
    14. Nicholas Economides & Evangelos Katsamakas, 2005. "Linux vs. Windows: A Comparison of Innovation Incentives and a Case Study," Working Papers 05-11, New York University, Leonard N. Stern School of Business, Department of Economics.
    15. Massimo D'Antoni & Maria Alessandra Rossi, 2014. "Appropriability and Incentives with Complementary Innovations," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 23(1), pages 103-124, March.
    16. Rockett, Katharine, 2010. "Property Rights and Invention," Handbook of the Economics of Innovation, in: Bronwyn H. Hall & Nathan Rosenberg (ed.), Handbook of the Economics of Innovation, edition 1, volume 1, chapter 0, pages 315-380, Elsevier.
    17. Nicholas Economides & Evangelos Katsamakas, 2006. "Two-Sided Competition of Proprietary vs. Open Source Technology Platforms and the Implications for the Software Industry," Management Science, INFORMS, vol. 52(7), pages 1057-1071, July.
    18. Ghafele, Roya & Gibert, Benjamin, 2012. "Efficiency through openness: the economic value proposition of open source software," MPRA Paper 38088, University Library of Munich, Germany.
    19. Belleflamme,Paul & Peitz,Martin, 2015. "Industrial Organization," Cambridge Books, Cambridge University Press, number 9781107687899, January.
    20. Rentocchini, Francesco, 2011. "Sources and characteristics of software patents in the European Union: Some empirical considerations," Information Economics and Policy, Elsevier, vol. 23(1), pages 141-157, March.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:egu:wpaper:2437. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/deguunl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.