IDEAS home Printed from https://ideas.repec.org/p/osf/socarx/x78ys.html
   My bibliography  Save this paper

The Missing 15 Percent of Patent Citations

Author

Listed:
  • Verluise, Cyril
  • Cristelli, Gabriele
  • Higham, Kyle
  • de Rassenfosse, Gaetan

Abstract

Patent citations are one of the most commonly-used metrics in the innovation literature. Leading uses of patent-to-patent citations are associated with the quantification of inventions' quality and the measurement of knowledge flows. Due to their widespread availability, scholars have exploited citations listed on the front-page of patent documents. Citations appearing in the full-text of patent documents have been neglected. We apply modern machine learning methods to extract these citations from the text of USPTO patent documents. Overall, we are able to recover an additional 15 percent of patent citations that could not be found using only front-page data. We show that "in-text" citations bring a different type of information compared to front-page citations. They exhibit higher text-similarity to the citing patents and alter the ranking of patent importance. The dataset is available at patcit.io (CC-BY-4).

Suggested Citation

  • Verluise, Cyril & Cristelli, Gabriele & Higham, Kyle & de Rassenfosse, Gaetan, 2020. "The Missing 15 Percent of Patent Citations," SocArXiv x78ys, Center for Open Science.
  • Handle: RePEc:osf:socarx:x78ys
    DOI: 10.31219/osf.io/x78ys
    as

    Download full text from publisher

    File URL: https://osf.io/download/5fe988c51e6d9702072faba1/
    Download Restriction: no

    File URL: https://libkey.io/10.31219/osf.io/x78ys?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Sokoloff, Kenneth L., 1988. "Inventive Activity in Early Industrial America: Evidence From Patent Records, 1790–1846," The Journal of Economic History, Cambridge University Press, vol. 48(4), pages 813-850, December.
    2. Juan Alcácer & Michelle Gittelman, 2006. "Patent Citations as a Measure of Knowledge Flows: The Influence of Examiner Citations," The Review of Economics and Statistics, MIT Press, vol. 88(4), pages 774-779, November.
    3. Adam B. Jaffe & Michael S. Fogarty & Bruce A. Banks, 1998. "Evidence from Patents and Patent Citations on the Impact of NASA and Other Federal Labs on Commercial Innovation," Journal of Industrial Economics, Wiley Blackwell, vol. 46(2), pages 183-205, June.
    4. Christopher L Benson & Christopher L Magee, 2015. "Quantitative Determination of Technological Improvement from Patent Data," PLOS ONE, Public Library of Science, vol. 10(4), pages 1-23, April.
    5. Cristian Candia & C. Jara-Figueroa & Carlos Rodriguez-Sickert & Albert-László Barabási & César A. Hidalgo, 2019. "The universal decay of collective memory and attention," Nature Human Behaviour, Nature, vol. 3(1), pages 82-91, January.
    6. Michael J. Andrews, 2021. "Historical patent data: A practitioner's guide," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 30(2), pages 368-397, May.
    7. Manuel Trajtenberg, 1990. "A Penny for Your Quotes: Patent Citations and the Value of Innovations," RAND Journal of Economics, The RAND Corporation, vol. 21(1), pages 172-187, Spring.
    8. Ufuk Akcigit & John Grigsby & Tom Nicholas, 2017. "Immigration and the Rise of American Ingenuity," American Economic Review, American Economic Association, vol. 107(5), pages 327-331, May.
    9. Petra Moser & Tom Nicholas, 2004. "Was Electricity a General Purpose Technology? Evidence from Historical Patent Citations," American Economic Review, American Economic Association, vol. 94(2), pages 388-394, May.
    10. Bronwyn H. Hall & Adam Jaffe & Manuel Trajtenberg, 2005. "Market Value and Patent Citations," RAND Journal of Economics, The RAND Corporation, vol. 36(1), pages 16-38, Spring.
    11. Adam B. Jaffe & Manuel Trajtenberg & Rebecca Henderson, 1993. "Geographic Localization of Knowledge Spillovers as Evidenced by Patent Citations," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 108(3), pages 577-598.
    12. Sharon Belenzon & Mark Schankerman, 2013. "Spreading the Word: Geography, Policy, and Knowledge Spillovers," The Review of Economics and Statistics, MIT Press, vol. 95(3), pages 884-903, July.
    13. Gaétan de Rassenfosse & Adam Jaffe & Emilio Raiteri, 2019. "The procurement of innovation by the U.S. government," PLOS ONE, Public Library of Science, vol. 14(8), pages 1-11, August.
    14. Florian Seliger & Gaéran de Rassenfosse & Jan Kozak, 2019. "Geocoding of worldwide patent data," KOF Working papers 19-458, KOF Swiss Economic Institute, ETH Zurich.
    15. Bryan, Kevin A. & Ozcan, Yasin & Sampat, Bhaven, 2020. "In-text patent citations: A user's guide," Research Policy, Elsevier, vol. 49(4).
    16. Martin Meyer, 2000. "What is Special about Patent Citations? Differences between Scientific and Patent Citations," Scientometrics, Springer;Akadémiai Kiadó, vol. 49(1), pages 93-123, August.
    17. Carpenter, Mark P. & Narin, Francis & Woolf, Patricia, 1981. "Citation rates to technologically important patents," World Patent Information, Elsevier, vol. 3(4), pages 160-163, October.
    18. Sarah Kaplan & Keyvan Vakili, 2015. "The double-edged sword of recombination in breakthrough innovation," Strategic Management Journal, Wiley Blackwell, vol. 36(10), pages 1435-1457, October.
    19. Stefan Wagner & Karin Hoisl & Grid Thoma, 2014. "Overcoming localization of knowledge — the role of professional service firms," Strategic Management Journal, Wiley Blackwell, vol. 35(11), pages 1671-1688, November.
    20. Sam Arts & Bruno Cassiman & Juan Carlos Gomez, 2018. "Text matching to measure patent similarity," Strategic Management Journal, Wiley Blackwell, vol. 39(1), pages 62-84, January.
    21. Giovanni Peri, 2005. "Determinants of Knowledge Flows and Their Effect on Innovation," The Review of Economics and Statistics, MIT Press, vol. 87(2), pages 308-322, May.
    22. Adams, Stephen, 2010. "The text, the full text and nothing but the text: Part 1 - Standards for creating textual information in patent documents and general search implications," World Patent Information, Elsevier, vol. 32(1), pages 22-29, March.
    23. Jeffrey Kuhn & Kenneth Younge & Alan Marco, 2020. "Patent citations reexamined," RAND Journal of Economics, RAND Corporation, vol. 51(1), pages 109-132, March.
    24. Albert, M. B. & Avery, D. & Narin, F. & McAllister, P., 1991. "Direct validation of citation counts as indicators of industrially important patents," Research Policy, Elsevier, vol. 20(3), pages 251-259, June.
    25. Marco Corsino & Myriam Mariani & Salvatore Torrisi, 2019. "Firm strategic behavior and the measurement of knowledge flows with patent citations," Strategic Management Journal, Wiley Blackwell, vol. 40(7), pages 1040-1069, July.
    26. William R. Kerr, 2008. "Ethnic Scientific Communities and International Technology Diffusion," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 518-537, August.
    27. Adam B. Jaffe & Gaétan de Rassenfosse, 2017. "Patent citation data in social science research: Overview and best practices," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(6), pages 1360-1374, June.
    28. Rajshree Agarwal & Martin Ganco & Rosemarie H. Ziedonis, 2009. "Reputations for toughness in patent enforcement: implications for knowledge spillovers via inventor mobility," Strategic Management Journal, Wiley Blackwell, vol. 30(13), pages 1349-1374, December.
    29. Prithwiraj Choudhury & Evan Starr & Rajshree Agarwal, 2020. "Machine learning and human capital complementarities: Experimental evidence on bias mitigation," Strategic Management Journal, Wiley Blackwell, vol. 41(8), pages 1381-1411, August.
    30. Manuel Trajtenberg & Adam B. Jaffe & Michael S. Fogarty, 2000. "Knowledge Spillovers and Patent Citations: Evidence from a Survey of Inventors," American Economic Review, American Economic Association, vol. 90(2), pages 215-218, May.
    31. Stefano Breschi & Francesco Lissoni, 2009. "Mobility of skilled workers and co-invention networks: an anatomy of localized knowledge flows," Journal of Economic Geography, Oxford University Press, vol. 9(4), pages 439-468, July.
    32. Righi, Cesare & Simcoe, Timothy, 2019. "Patent examiner specialization," Research Policy, Elsevier, vol. 48(1), pages 137-148.
    33. Audretsch, David B, 1998. "Agglomeration and the Location of Innovative Activity," Oxford Review of Economic Policy, Oxford University Press and Oxford Review of Economic Policy Limited, vol. 14(2), pages 18-29, Summer.
    34. Jonathan Eaton & Samuel Kortum, 1996. "Measuring Technology Diffusion and the International Sources of Growth," Eastern Economic Journal, Eastern Economic Association, vol. 22(4), pages 401-410, Fall.
    35. Matt Marx & Aaron Fuegi, 2020. "Reliance on Science by Inventors: Hybrid Extraction of In-text Patent-to-Article Citations," NBER Working Papers 27987, National Bureau of Economic Research, Inc.
    36. Jean O. Lanjouw & Mark Schankerman, 2004. "Patent Quality and Research Productivity: Measuring Innovation with Multiple Indicators," Economic Journal, Royal Economic Society, vol. 114(495), pages 441-465, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Antonin Bergeaud & Cyril Verluise, 2022. "The rise of China's technological power: the perspective from frontier technologies," CEP Discussion Papers dp1876, Centre for Economic Performance, LSE.
    2. Antonin Bergeaud & Arthur Guillouzouic & Emeric Henry & Clement Malgouyres, 2022. "From public labs to private firms: magnitude and channels of R&D spillovers," POID Working Papers 041, Centre for Economic Performance, LSE.
    3. Higham, Kyle & Contisciani, Martina & De Bacco, Caterina, 2022. "Multilayer patent citation networks: A comprehensive analytical framework for studying explicit technological relationships," Technological Forecasting and Social Change, Elsevier, vol. 179(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Higham, Kyle & de Rassenfosse, Gaétan & Jaffe, Adam B., 2021. "Patent Quality: Towards a Systematic Framework for Analysis and Measurement," Research Policy, Elsevier, vol. 50(4).
    2. Adam B. Jaffe & Gaétan de Rassenfosse, 2017. "Patent citation data in social science research: Overview and best practices," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 68(6), pages 1360-1374, June.
    3. Arts, Sam & Hou, Jianan & Gomez, Juan Carlos, 2021. "Natural language processing to identify the creation and impact of new technologies in patent text: Code, data, and new measures," Research Policy, Elsevier, vol. 50(2).
    4. Manuel Acosta & Daniel Coronado & Esther Ferrándiz & Manuel Jiménez, 2022. "Effects of knowledge spillovers between competitors on patent quality: what patent citations reveal about a global duopoly," The Journal of Technology Transfer, Springer, vol. 47(5), pages 1451-1487, October.
    5. Fernández, Ana María & Ferrándiz, Esther & Medina, Jennifer, 2022. "The diffusion of energy technologies. Evidence from renewable, fossil, and nuclear energy patents," Technological Forecasting and Social Change, Elsevier, vol. 178(C).
    6. Hur, Wonchang & Oh, Junbyoung, 2021. "A man is known by the company he keeps?: A structural relationship between backward citation and forward citation of patents," Research Policy, Elsevier, vol. 50(1).
    7. Ahmad Barirani & Bruno Agard & Catherine Beaudry, 2013. "Discovering and assessing fields of expertise in nanomedicine: a patent co-citation network perspective," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(3), pages 1111-1136, March.
    8. Jurriën Bakker & Dennis Verhoeven & Lin Zhang & Bart Van Looy, 2016. "Patent citation indicators: One size fits all?," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(1), pages 187-211, January.
    9. Mariani, Manuel Sebastian & Medo, Matúš & Lafond, François, 2019. "Early identification of important patents: Design and validation of citation network metrics," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 644-654.
    10. Satoshi Yasukawa & Shingo Kano, 2014. "Validating the usefulness of examiners’ forward citations from the viewpoint of applicants’ self-selection during the patent application procedure," Scientometrics, Springer;Akadémiai Kiadó, vol. 99(3), pages 895-909, June.
    11. Elena M. Tur & Evangelos Bourelos & Maureen McKelvey, 2022. "The case of sleeping beauties in nanotechnology: a study of potential breakthrough inventions in emerging technologies," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 69(3), pages 683-708, December.
    12. Petra Moser & Joerg Ohmstedt & Paul W. Rhode, 2018. "Patent Citations—An Analysis of Quality Differences and Citing Practices in Hybrid Corn," Management Science, INFORMS, vol. 64(4), pages 1926-1940, April.
    13. Jee, Su Jung & Kwon, Minji & Ha, Jung Moon & Sohn, So Young, 2019. "Exploring the forward citation patterns of patents based on the evolution of technology fields," Journal of Informetrics, Elsevier, vol. 13(4).
    14. Manajit Chakraborty & Maksym Byshkin & Fabio Crestani, 2020. "Patent citation network analysis: A perspective from descriptive statistics and ERGMs," PLOS ONE, Public Library of Science, vol. 15(12), pages 1-28, December.
    15. Emanuele Bacchiocchi & Fabio Montobbio, 2010. "International Knowledge Diffusion and Home‐bias Effect: Do USPTO and EPO Patent Citations Tell the Same Story?," Scandinavian Journal of Economics, Wiley Blackwell, vol. 112(3), pages 441-470, September.
    16. Diemer, Andreas & Regan, Tanner, 2022. "No inventor is an island: Social connectedness and the geography of knowledge flows in the US," Research Policy, Elsevier, vol. 51(2).
    17. Ren, Haiying & Zhao, Yuhui, 2021. "Technology opportunity discovery based on constructing, evaluating, and searching knowledge networks," Technovation, Elsevier, vol. 101(C).
    18. Montobbio, Fabio & Sterzi, Valerio, 2013. "The Globalization of Technology in Emerging Markets: A Gravity Model on the Determinants of International Patent Collaborations," World Development, Elsevier, vol. 44(C), pages 281-299.
    19. Drivas, Kyriakos & Economidou, Claire & Karamanis, Dimitrios & Sanders, Mark, 2020. "Mobility of highly skilled individuals and local innovation activity," Technological Forecasting and Social Change, Elsevier, vol. 158(C).
    20. Ron Boschma & Ernest Miguelez & Rosina Moreno & Diego B. Ocampo-Corrales, 2021. "Technological breakthroughs in European regions: the role of related and unrelated combinations," Papers in Evolutionary Economic Geography (PEEG) 2118, Utrecht University, Department of Human Geography and Spatial Planning, Group Economic Geography, revised Jun 2021.

    More about this item

    JEL classification:

    • C81 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Methodology for Collecting, Estimating, and Organizing Microeconomic Data; Data Access
    • O30 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - General

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:osf:socarx:x78ys. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: OSF (email available below). General contact details of provider: https://arabixiv.org .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.