IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0234880.html
   My bibliography  Save this article

The proximity of ideas: An analysis of patent text using machine learning

Author

Listed:
  • Sijie Feng

Abstract

This paper introduces a measure of the proximity in ideas using unsupervised machine learning. Knowledge transfers are considered a key driving force of innovation and regional economic growth. I explore knowledge relationships by deriving vector space representations of a patent’s abstract text using Document Vectors (Doc2Vec), and using cosine similarity to measure their proximity in ideas space. I illustrate the potential uses of this method with an application to geographic localization in knowledge spillovers. For patents in the same technology field, their normalized text similarity is 0.02-0.05 S.D.s higher if they are located within the same city, compared to patents from other cities. This effect is much smaller than when knowledge transfers are measured using normalized patent citations: local patents receive about 0.23-0.30 S.D.s more local citations than compared to non-local control patents. These findings suggest that the effect of geography on knowledge transfers may be much smaller than the previous literature using citations suggests.

Suggested Citation

  • Sijie Feng, 2020. "The proximity of ideas: An analysis of patent text using machine learning," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-19, July.
  • Handle: RePEc:plo:pone00:0234880
    DOI: 10.1371/journal.pone.0234880
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234880
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0234880&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0234880?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Bryan Kelly & Dimitris Papanikolaou & Amit Seru & Matt Taddy, 2021. "Measuring Technological Innovation over the Long Run," American Economic Review: Insights, American Economic Association, vol. 3(3), pages 303-320, September.
    2. Jaffe, Adam B, 1986. "Technological Opportunity and Spillovers of R&D: Evidence from Firms' Patents, Profits, and Market Value," American Economic Review, American Economic Association, vol. 76(5), pages 984-1001, December.
    3. Nicholas Bloom & Mark Schankerman & John Van Reenen, 2013. "Identifying Technology Spillovers and Product Market Rivalry," Econometrica, Econometric Society, vol. 81(4), pages 1347-1393, July.
    4. Yasusada Murata & Ryo Nakajima & Ryosuke Okamoto & Ryuichi Tamura, 2014. "Localized Knowledge Spillovers and Patent Citations: A Distance-Based Approach," The Review of Economics and Statistics, MIT Press, vol. 96(5), pages 967-985, December.
    5. Adam B. Jaffe & Manuel Trajtenberg & Rebecca Henderson, 1993. "Geographic Localization of Knowledge Spillovers as Evidenced by Patent Citations," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 108(3), pages 577-598.
    6. Kristy Buzard & Gerald A. Carlino & Jake Carr & Robert M. Hunt & Tony E. Smith, 2015. "Localized Knowledge Spillovers: Evidence from the Agglomeration of American R&D Labs and Patent Data," Working Papers 15-3, Federal Reserve Bank of Philadelphia.
    7. Sharon Belenzon & Mark Schankerman, 2013. "Spreading the Word: Geography, Policy, and Knowledge Spillovers," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 884-903, July.
    8. Bryan, Kevin A. & Ozcan, Yasin & Sampat, Bhaven, 2020. "In-text patent citations: A user's guide," Research Policy, Elsevier, vol. 49(4).
    9. Audretsch, David B & Feldman, Maryann P, 1996. "R&D Spillovers and the Geography of Innovation and Production," American Economic Review, American Economic Association, vol. 86(3), pages 630-640, June.
    10. Alcácer, Juan & Gittelman, Michelle & Sampat, Bhaven, 2009. "Applicant and examiner citations in U.S. patents: An overview and analysis," Research Policy, Elsevier, vol. 38(2), pages 415-427, March.
    11. Jaffe, Adam B, 1989. "Real Effects of Academic Research," American Economic Review, American Economic Association, vol. 79(5), pages 957-970, December.
    12. Ina Ganguli & Jeffrey Lin & Nicholas Reynolds, 2017. "The Paper Trail of Knowledge Spillovers: Evidence from Patent Interferences [REVISED]," Working Papers 17-44, Federal Reserve Bank of Philadelphia.
    13. Sarah Kaplan & Keyvan Vakili, 2015. "The double-edged sword of recombination in breakthrough innovation," Strategic Management Journal, Wiley Blackwell, vol. 36(10), pages 1435-1457, October.
    14. Alberto Galasso & Mark Schankerman, 2014. "Patents and Cumulative Innovation: Causal Evidence from the Courts," NBER Working Papers 20269, National Bureau of Economic Research, Inc.
    15. Antonin Bergeaud & Yoann Potiron & Juste Raimbault, 2017. "Classifying patents based on their semantic content," PLOS ONE, Public Library of Science, vol. 12(4), pages 1-22, April.
    16. Michael Roach & Wesley M. Cohen, 2013. "Lens or Prism? Patent Citations as a Measure of Knowledge Flows from Public Research," Management Science, INFORMS, vol. 59(2), pages 504-525, October.
    17. Murray, Fiona & Stern, Scott, 2007. "Do formal intellectual property rights hinder the free flow of scientific knowledge?: An empirical test of the anti-commons hypothesis," Journal of Economic Behavior & Organization, Elsevier, vol. 63(4), pages 648-687, August.
    18. Fiona E. Murray & Scott Stern, 2007. "Do Formal Intellectual Property Rights Hinder the Free Flow of Scientific Knowledge?: An Empirical Test of the Anti-Commons Hypothesis," NBER Chapters, in: Academic Science and Entrepreneurship: Dual Engines of Growth, National Bureau of Economic Research, Inc.
    19. Sam Arts & Bruno Cassiman & Juan Carlos Gomez, 2018. "Text matching to measure patent similarity," Strategic Management Journal, Wiley Blackwell, vol. 39(1), pages 62-84, January.
    20. Mikko Packalen & Jay Bhattacharya, 2015. "New Ideas in Invention," NBER Working Papers 20922, National Bureau of Economic Research, Inc.
    21. Paul Almeida & Bruce Kogut, 1999. "Localization of Knowledge and the Mobility of Engineers in Regional Networks," Management Science, INFORMS, vol. 45(7), pages 905-917, July.
    22. Adam B. Jaffe & Gaétan de Rassenfosse, 2017. "Patent citation data in social science research: Overview and best practices," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(6), pages 1360-1374, June.
    23. Ashish Arora & Sharon Belenzon & Honggi Lee, 2018. "Reversed citations and the localization of knowledge spillovers," Journal of Economic Geography, Oxford University Press, vol. 18(3), pages 495-521.
    24. Pierre Azoulay & Waverly Ding & Toby Stuart, 2009. "The Impact Of Academic Patenting On The Rate, Quality And Direction Of (Public) Research Output," Journal of Industrial Economics, Wiley Blackwell, vol. 57(4), pages 637-676, December.
    25. Peter Thompson & Melanie Fox-Kean, 2005. "Patent Citations and the Geography of Knowledge Spillovers: A Reassessment: Reply," American Economic Review, American Economic Association, vol. 95(1), pages 465-466, March.
    26. Maryann Feldman, 2014. "The character of innovative places: entrepreneurial strategy, economic development, and prosperity," Small Business Economics, Springer, vol. 43(1), pages 9-20, June.
    27. Peter Thompson & Melanie Fox-Kean, 2005. "Patent Citations and the Geography of Knowledge Spillovers: A Reassessment," American Economic Review, American Economic Association, vol. 95(1), pages 450-460, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. A. Fronzetti Colladon & B. Guardabascio & F. Venturini, 2023. "A new mapping of technological interdependence," Papers 2308.00014, arXiv.org, revised Sep 2024.
    2. Hongshu Chen & Xinna Song & Qianqian Jin & Ximeng Wang, 2022. "Network dynamics in university-industry collaboration: a collaboration-knowledge dual-layer network perspective," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6637-6660, November.
    3. Jessica Birkholz & Jarina Kühn & Mariia Shkolnykova, 2022. "Exploration or Exploitation: Innovation Behavior of SMEs and Large Firms during the COVID-19 Crisis," Bremen Papers on Economics & Innovation 2203, University of Bremen, Faculty of Business Studies and Economics.
    4. de Rassenfosse, Gaétan & Palangkaraya, Alfons, 2023. "Do patent pledges accelerate innovation?," Research Policy, Elsevier, vol. 52(5).
    5. Ascione, Grazia Sveva, 2023. "Technological diversity to address complex challenges: the contribution of American universities to sdgs," MPRA Paper 119452, University Library of Munich, Germany.
    6. Nils Grashof & Alexander Kopka, 2023. "Artificial intelligence and radical innovation: an opportunity for all companies?," Small Business Economics, Springer, vol. 61(2), pages 771-797, August.
    7. Guangtong Li & L. Siddharth & Jianxi Luo, 2023. "Embedding knowledge graph of patent metadata to measure knowledge proximity," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(4), pages 476-490, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wang, Fang, 2024. "Does the recombination of distant scientific knowledge generate valuable inventions? An analysis of pharmaceutical patents," Technovation, Elsevier, vol. 130(C).
    2. Po‐Hsuan Hsu & Hai‐Ping Hui & Hsiao‐Hui Lee & Kevin Tseng, 2022. "Supply chain technology spillover, customer concentration, and product invention," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 31(2), pages 393-417, April.
    3. Ashish Arora & Sharon Belenzon & Honggi Lee, 2018. "Reversed citations and the localization of knowledge spillovers," Journal of Economic Geography, Oxford University Press, vol. 18(3), pages 495-521.
    4. Eunhee Sohn, 2021. "How Local Industry R&D Shapes Academic Research: Evidence from the Agricultural Biotechnology Revolution," Organization Science, INFORMS, vol. 32(3), pages 675-707, May.
    5. Arts, Sam & Hou, Jianan & Gomez, Juan Carlos, 2021. "Natural language processing to identify the creation and impact of new technologies in patent text: Code, data, and new measures," Research Policy, Elsevier, vol. 50(2).
    6. Castillo, Victoria & Figal-Garone, Lucas & Maffioli, Alessandro & Rojo, Sofia & Stucchi, Rodolfo, 2016. "The Effects of Knowledge Spillovers through Labor Mobility," MPRA Paper 69141, University Library of Munich, Germany.
    7. Bryan, Kevin A. & Ozcan, Yasin & Sampat, Bhaven, 2020. "In-text patent citations: A user's guide," Research Policy, Elsevier, vol. 49(4).
    8. Autant-Bernard, Corinne & Fadairo, Muriel & Massard, Nadine, 2013. "Knowledge diffusion and innovation policies within the European regions: Challenges based on recent empirical evidence," Research Policy, Elsevier, vol. 42(1), pages 196-210.
    9. de Rassenfosse, Gaétan & Pellegrino, Gabriele & Raiteri, Emilio, 2024. "Do patents enable disclosure? Evidence from the invention secrecy act," International Journal of Industrial Organization, Elsevier, vol. 92(C).
    10. Nelson, Andrew J., 2009. "Measuring knowledge spillovers: What patents, licenses and publications reveal about innovation diffusion," Research Policy, Elsevier, vol. 38(6), pages 994-1005, July.
    11. Adam B. Jaffe & Gaétan de Rassenfosse, 2017. "Patent citation data in social science research: Overview and best practices," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(6), pages 1360-1374, June.
    12. Jasjit Singh & Ajay Agrawal, 2011. "Recruiting for Ideas: How Firms Exploit the Prior Inventions of New Hires," Management Science, INFORMS, vol. 57(1), pages 129-150, January.
    13. Crespi, Gustavo & Figal Garone, Lucas & Maffioli, Alessandro & Stein, Ernesto, 2020. "Public support to R&D, productivity, and spillover effects: Firm-level evidence from Chile," World Development, Elsevier, vol. 130(C).
    14. Francesco Quatraro & Stefano Usai, 2017. "Are knowledge flows all alike? Evidence from European regions," Regional Studies, Taylor & Francis Journals, vol. 51(8), pages 1246-1258, August.
    15. Miguelez, Ernest & Noumedem Temgoua, Claudia, 2020. "Inventor migration and knowledge flows: A two-way communication channel?," Research Policy, Elsevier, vol. 49(9).
    16. Jasjit Singh & Matt Marx, 2013. "Geographic Constraints on Knowledge Spillovers: Political Borders vs. Spatial Proximity," Management Science, INFORMS, vol. 59(9), pages 2056-2078, September.
    17. Hyuk-Soo Kwon & Jihong Lee & Sokbae Lee & Ryungha Oh, 2022. "Knowledge spillovers and patent citations: trends in geographic localization, 1976–2015," Economics of Innovation and New Technology, Taylor & Francis Journals, vol. 31(3), pages 123-147, April.
    18. Carlino, Gerald & Kerr, William R., 2015. "Agglomeration and Innovation," Handbook of Regional and Urban Economics, in: Gilles Duranton & J. V. Henderson & William C. Strange (ed.), Handbook of Regional and Urban Economics, edition 1, volume 5, chapter 0, pages 349-404, Elsevier.
    19. Sergey Lychagin & Joris Pinkse & Margaret E. Slade & John Van Reenen, 2016. "Spillovers in Space: Does Geography Matter?," Journal of Industrial Economics, Wiley Blackwell, vol. 64(2), pages 295-335, June.
    20. Diemer, Andreas & Regan, Tanner, 2022. "No inventor is an island: Social connectedness and the geography of knowledge flows in the US," Research Policy, Elsevier, vol. 51(2).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0234880. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.