IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0234618.html
   My bibliography  Save this article

Measuring the drafting alignment of patent documents using text mining

Author

Listed:
  • Davit Khachatryan
  • Brigitte Muehlmann

Abstract

How would an inventor, entrepreneur, investor, or patent examiner quantify the extent to which the inventive claims listed in a patent document align with patent specification? Since a specification that is poorly aligned with the inventive claims can render an invention unpatentable and can invalidate an already issued patent, an effective measure of alignment is necessary. We define a novel measure of drafting alignment using Latent Dirichlet Allocation (LDA). The measure is defined for each patent document by first identifying the latent topics underlying the claims and the specification, and then using the Hellinger distance to find the proximity between the topical coverages. We demonstrate the use of the novel measure for data processing patent documents related to cybersecurity. The properties of the proposed measure are further investigated using exploratory data analysis, and it is shown that generally alignment is positively associated with the prior patenting efforts as well as the tendency to include figures in a document.

Suggested Citation

  • Davit Khachatryan & Brigitte Muehlmann, 2020. "Measuring the drafting alignment of patent documents using text mining," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-20, July.
  • Handle: RePEc:plo:pone00:0234618
    DOI: 10.1371/journal.pone.0234618
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0234618
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0234618&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0234618?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Adam B. Jaffe & Josh Lerner, 2006. "Innovation and its Discontents," NBER Chapters, in: Innovation Policy and the Economy, Volume 6, pages 27-66, National Bureau of Economic Research, Inc.
    2. Graham, Stuart J. H. & Hall, Bronwyn H. & Harhoff, Dietmar & Mowery, David C., 2002. "Post-Issue Patent "Quality Control": A Comparative Study of US Patent Re-Examinations and European Patent Oppositions," Department of Economics, Working Paper Series qt2qt097bd, Department of Economics, Institute for Business and Economic Research, UC Berkeley.
    3. Grün, Bettina & Hornik, Kurt, 2011. "topicmodels: An R Package for Fitting Topic Models," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 40(i13).
    4. Strumsky, Deborah & Lobo, José, 2015. "Identifying the sources of technological novelty in the process of invention," Research Policy, Elsevier, vol. 44(8), pages 1445-1461.
    5. Stuart J.H. Graham & Alan C. Marco & Richard Miller, 2018. "The USPTO Patent Examination Research Dataset: A window on patent processing," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 27(3), pages 554-578, September.
    6. Joan Farre-Mensa & Deepak Hegde & Alexander Ljungqvist, 2016. "The Bright Side of Patents," NBER Working Papers 21959, National Bureau of Economic Research, Inc.
    7. Lemley, Mark, 2000. "Rational Ignorance at the Patent Office," Berkeley Olin Program in Law & Economics, Working Paper Series qt1tc166q2, Berkeley Olin Program in Law & Economics.
    8. Ronald J. Mann & Marian Underweiser, 2012. "A New Look at Patent Quality: Relating Patent Prosecution to Validity," Journal of Empirical Legal Studies, John Wiley & Sons, vol. 9(1), pages 1-32, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sangsung Park & Seongyong Choi & Sunghae Jun, 2021. "Bayesian Structure Learning and Visualization for Technology Analysis," Sustainability, MDPI, vol. 13(14), pages 1-16, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Davit Khachatryan & Brigitte Muehlmann, 2017. "Determinants of successful patent applications to combat financial fraud," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1353-1383, June.
    2. deGrazia, Charles A.W. & Pairolero, Nicholas A. & Teodorescu, Mike H.M., 2021. "Examination incentives, learning, and patent office outcomes: The use of examiner’s amendments at the USPTO," Research Policy, Elsevier, vol. 50(10).
    3. Burke, Paul F. & Reitzig, Markus, 2007. "Measuring patent assessment quality--Analyzing the degree and kind of (in)consistency in patent offices' decision making," Research Policy, Elsevier, vol. 36(9), pages 1404-1430, November.
    4. Lei, Zhen & Wright, Brian D., 2017. "Why weak patents? Testing the examiner ignorance hypothesis," Journal of Public Economics, Elsevier, vol. 148(C), pages 43-56.
    5. Gaétan de Rassenfosse & Reza Hosseini, 2020. "Discrimination against foreigners in the U.S. patent system," Journal of International Business Policy, Palgrave Macmillan, vol. 3(4), pages 349-366, December.
    6. Yutaka Niidome, 2015. "The Factors Related to the Minimum and Maximum Survival of Patents against Challenges to Validity," GRIPS Discussion Papers 14-23, National Graduate Institute for Policy Studies.
    7. Sun, Zhen & Wright, Brian D., 2022. "Citations backward and forward: Insights into the patent examiner's role," Research Policy, Elsevier, vol. 51(7).
    8. Gaétan de Rassenfosse & Reza Hosseini, 0. "Discrimination against foreigners in the U.S. patent system," Journal of International Business Policy, Palgrave Macmillan, vol. 0, pages 1-18.
    9. Yutaka Niidome, 2017. "The relation of patent description and examination with validity: an empirical study," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(1), pages 159-183, April.
    10. Filitz, Rainer & Henkel, Joachim & Tether, Bruce S., 2015. "Protecting aesthetic innovations? An exploration of the use of registered community designs," Research Policy, Elsevier, vol. 44(6), pages 1192-1206.
    11. Joachim Henkel & Hans Zischka, 2019. "How many patents are truly valid? Extent, causes, and remedies for latent patent invalidity," European Journal of Law and Economics, Springer, vol. 48(2), pages 195-239, October.
    12. Ozgur Aydogmus & Erkan Gürpinar, 2022. "Science, Technology and Institutional Change in Knowledge Production: An Evolutionary Game Theoretic Framework," Dynamic Games and Applications, Springer, vol. 12(4), pages 1163-1188, December.
    13. Schankerman, Mark & Lanjouw, Jean, 2001. "Enforcing Intellectual Property Rights," CEPR Discussion Papers 3093, C.E.P.R. Discussion Papers.
    14. Sandra Wankmüller, 2023. "A comparison of approaches for imbalanced classification problems in the context of retrieving relevant documents for an analysis," Journal of Computational Social Science, Springer, vol. 6(1), pages 91-163, April.
    15. Dietmar Harhoff & Georg von Graevenitz & Stefan Wagner, 2016. "Conflict Resolution, Public Goods, and Patent Thickets," Management Science, INFORMS, vol. 62(3), pages 704-721, March.
    16. Verhoeven, Dennis & Bakker, Jurriën & Veugelers, Reinhilde, 2016. "Measuring technological novelty with patent-based indicators," Research Policy, Elsevier, vol. 45(3), pages 707-723.
    17. George A Shinkle & Jo-Ann Suchard, 2019. "Innovation in newly public firms: The influence of government grants, venture capital, and private equity," Australian Journal of Management, Australian School of Business, vol. 44(2), pages 248-281, May.
    18. Arsenyan, Jbid & Mirowska, Agata & Piepenbrink, Anke, 2023. "Close encounters with the virtual kind: Defining a human-virtual agent coexistence framework," Technological Forecasting and Social Change, Elsevier, vol. 193(C).
    19. Manuel Acosta & Daniel Coronado & Esther Ferrándiz & Manuel Jiménez, 2022. "Effects of knowledge spillovers between competitors on patent quality: what patent citations reveal about a global duopoly," The Journal of Technology Transfer, Springer, vol. 47(5), pages 1451-1487, October.
    20. Hong Joo Lee & Hoyeon Oh, 2020. "A Study on the Deduction and Diffusion of Promising Artificial Intelligence Technology for Sustainable Industrial Development," Sustainability, MDPI, vol. 12(14), pages 1-15, July.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0234618. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.