IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v16y2022i3s1751157722000645.html
   My bibliography  Save this article

A deep learning based method benefiting from characteristics of patents for semantic relation classification

Author

Listed:
  • Chen, Liang
  • Xu, Shuo
  • Zhu, Lijun
  • Zhang, Jing
  • Yang, Guancan
  • Xu, Haiyun

Abstract

The deep learning has become an important technique for semantic relation classification in patent texts. Previous studies just borrowed the relevant models from generic texts to patent texts while keeping structure of the models unchanged. Due to significant distinctions between patent texts and generic ones, this enables the performance of these models in the patent texts to be reduced dramatically. To highlight these distinct characteristics in patent texts, seven annotated corpora from different fields are comprehensively compared in terms of several indicators for linguistic characteristics. Then, a deep learning based method is proposed to benefit from these characteristics. Our method exploits the information from other similar entity pairs as well as that from the sentences mentioning a focal entity pair. The latter stems from the conventional practices, and the former from our meaningful observation: the stronger the connection between two entity pairs is, the more likely they belong to the same relation type. To measure quantitatively the connection between two entity pairs, a similarity indicator on the basis of association rules is raised. Extensive experiments on the corpora of TFH-2020 and ChemProt demonstrate that our method for semantic relation classification is capable of benefiting from characteristic of patent texts.

Suggested Citation

  • Chen, Liang & Xu, Shuo & Zhu, Lijun & Zhang, Jing & Yang, Guancan & Xu, Haiyun, 2022. "A deep learning based method benefiting from characteristics of patents for semantic relation classification," Journal of Informetrics, Elsevier, vol. 16(3).
  • Handle: RePEc:eee:infome:v:16:y:2022:i:3:s1751157722000645
    DOI: 10.1016/j.joi.2022.101312
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157722000645
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2022.101312?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Zhou, Xiao & Huang, Lu & Porter, Alan & Vicente-Gomila, Jose M., 2019. "Tracing the system transformations and innovation pathways of an emerging technology: Solid lipid nanoparticles," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 785-794.
    2. Xuefeng Wang & Huichao Ren & Yun Chen & Yuqin Liu & Yali Qiao & Ying Huang, 2019. "Measuring patent similarity with SAO semantic analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 1-23, October.
    3. Shuo Xu & Dongsheng Zhai & Feifei Wang & Xin An & Hongshen Pang & Yirong Sun, 2019. "A novel method for topic linkages between scientific publications and patents," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 70(9), pages 1026-1042, September.
    4. An, Jaehyeong & Kim, Kyuwoong & Mortara, Letizia & Lee, Sungjoo, 2018. "Deriving technology intelligence from patents: Preposition-based semantic analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 217-236.
    5. Janghyeok Yoon & Kwangsoo Kim, 2011. "Identifying rapidly evolving technological trends for R&D planning using SAO-based semantic patent networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(1), pages 213-228, July.
    6. Liliana Mitkova & Wang Xuefeng & Pengjun Qui & Donghua Zhu & Ming Lei & Alan L. Porter, 2015. "Identification of technology development trends based on subject–action–object analysis: The case of dye-sensitized solar cells," Post-Print hal-01202391, HAL.
    7. Zhang, Yi & Lu, Jie & Liu, Feng & Liu, Qian & Porter, Alan & Chen, Hongshu & Zhang, Guangquan, 2018. "Does deep learning help topic extraction? A kernel k-means clustering method with word embedding," Journal of Informetrics, Elsevier, vol. 12(4), pages 1099-1117.
    8. Yang, Chao & Huang, Cui & Su, Jun, 2018. "An improved SAO network-based method for technology trend analysis: A case study of graphene," Journal of Informetrics, Elsevier, vol. 12(1), pages 271-286.
    9. Abrishami, Ali & Aliakbary, Sadegh, 2019. "Predicting citation counts based on deep neural network learning techniques," Journal of Informetrics, Elsevier, vol. 13(2), pages 485-499.
    10. Shuo Xu & Ling Li & Xin An & Liyuan Hao & Guancan Yang, 2021. "An approach for detecting the commonality and specialty between scientific publications and patents," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(9), pages 7445-7475, September.
    11. Schneider, Cédric, 2008. "Fences and competition in patent races," International Journal of Industrial Organization, Elsevier, vol. 26(6), pages 1348-1364, November.
    12. An, Xin & Li, Jinghong & Xu, Shuo & Chen, Liang & Sun, Wei, 2021. "An improved patent similarity measurement based on entities and semantic relations," Journal of Informetrics, Elsevier, vol. 15(2).
    13. Liang Chen & Shuo Xu & Lijun Zhu & Jing Zhang & Xiaoping Lei & Guancan Yang, 2020. "A deep learning based method for extracting semantic information from patent documents," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 289-312, October.
    14. Saber A Akhondi & Alexander G Klenner & Christian Tyrchan & Anil K Manchala & Kiran Boppana & Daniel Lowe & Marc Zimmermann & Sarma A R P Jagarlapudi & Roger Sayle & Jan A Kors & Sorel Muresan, 2014. "Annotated Chemical Patent Corpus: A Gold Standard for Text Mining," PLOS ONE, Public Library of Science, vol. 9(9), pages 1-8, September.
    15. Hyunseok Park & Janghyeok Yoon & Kwangsoo Kim, 2012. "Identifying patent infringement using SAO based semantic technological similarities," Scientometrics, Springer;Akadémiai Kiadó, vol. 90(2), pages 515-529, February.
    16. Chao Lu & Yi Bu & Jie Wang & Ying Ding & Vetle Torvik & Matthew Schnaars & Chengzhi Zhang, 2019. "Examining scientific writing styles from the perspective of linguistic complexity," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 70(5), pages 462-475, May.
    17. Guo, Junfang & Wang, Xuefeng & Li, Qianrui & Zhu, Donghua, 2016. "Subject–action–object-based morphology analysis for determining the direction of technological change," Technological Forecasting and Social Change, Elsevier, vol. 105(C), pages 27-40.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Hur, Wonchang, 2024. "Entropy, heterogeneity, and their impact on technology progress," Journal of Informetrics, Elsevier, vol. 18(2).
    2. Wang, Zhenhua & Ren, Ming & Gao, Dong & Li, Zhuang, 2023. "A Zipf's law-based text generation approach for addressing imbalance in entity extraction," Journal of Informetrics, Elsevier, vol. 17(4).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. An, Xin & Li, Jinghong & Xu, Shuo & Chen, Liang & Sun, Wei, 2021. "An improved patent similarity measurement based on entities and semantic relations," Journal of Informetrics, Elsevier, vol. 15(2).
    2. Liang Chen & Shuo Xu & Lijun Zhu & Jing Zhang & Xiaoping Lei & Guancan Yang, 2020. "A deep learning based method for extracting semantic information from patent documents," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 289-312, October.
    3. Liu, Zhenfeng & Feng, Jian & Uden, Lorna, 2023. "Technology opportunity analysis using hierarchical semantic networks and dual link prediction," Technovation, Elsevier, vol. 128(C).
    4. Shuo Xu & Ling Li & Xin An & Liyuan Hao & Guancan Yang, 2021. "An approach for detecting the commonality and specialty between scientific publications and patents," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(9), pages 7445-7475, September.
    5. Mun, Changbae & Yoon, Sejun & Raghavan, Nagarajan & Hwang, Dongwook & Basnet, Subarna & Park, Hyunseok, 2021. "Function score-based technological trend analysis," Technovation, Elsevier, vol. 101(C).
    6. Jiang, Cuiqing & Zhou, Yiru & Chen, Bo, 2023. "Mining semantic features in patent text for financial distress prediction," Technological Forecasting and Social Change, Elsevier, vol. 190(C).
    7. Ren, Haiying & Zhao, Yuhui, 2021. "Technology opportunity discovery based on constructing, evaluating, and searching knowledge networks," Technovation, Elsevier, vol. 101(C).
    8. Xu, Shuo & Hao, Liyuan & Yang, Guancan & Lu, Kun & An, Xin, 2021. "A topic models based framework for detecting and forecasting emerging technologies," Technological Forecasting and Social Change, Elsevier, vol. 162(C).
    9. An, Jaehyeong & Kim, Kyuwoong & Mortara, Letizia & Lee, Sungjoo, 2018. "Deriving technology intelligence from patents: Preposition-based semantic analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 217-236.
    10. Vicente-Gomila, J.M. & Artacho-Ramírez, M.A. & Ting, Ma & Porter, A.L., 2021. "Combining tech mining and semantic TRIZ for technology assessment: Dye-sensitized solar cell as a case," Technological Forecasting and Social Change, Elsevier, vol. 169(C).
    11. Chao Yang & Donghua Zhu & Xuefeng Wang & Yi Zhang & Guangquan Zhang & Jie Lu, 2017. "Requirement-oriented core technological components’ identification based on SAO analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1229-1248, September.
    12. Jaewoong Choi & Jiho Lee & Janghyeok Yoon & Sion Jang & Jaeyoung Kim & Sungchul Choi, 2022. "A two-stage deep learning-based system for patent citation recommendation," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(11), pages 6615-6636, November.
    13. Yang, Chao & Huang, Cui & Su, Jun, 2018. "An improved SAO network-based method for technology trend analysis: A case study of graphene," Journal of Informetrics, Elsevier, vol. 12(1), pages 271-286.
    14. Teng, Hao & Wang, Nan & Zhao, Hongyu & Hu, Yingtong & Jin, Haitao, 2024. "Enhancing semantic text similarity with functional semantic knowledge (FOP) in patents," Journal of Informetrics, Elsevier, vol. 18(1).
    15. Kyuwoong Kim & Kyeongmin Park & Sungjoo Lee, 2019. "Investigating technology opportunities: the use of SAOx analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(1), pages 45-70, January.
    16. Myeongji Oh & Hyejin Jang & Sunhye Kim & Byungun Yoon, 2023. "Main path analysis for technological development using SAO structure and DEMATEL based on keyword causality," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(4), pages 2079-2104, April.
    17. Li, Xin & Xie, Qianqian & Jiang, Jiaojiao & Zhou, Yuan & Huang, Lucheng, 2019. "Identifying and monitoring the development trends of emerging technologies using patent analysis and Twitter data mining: The case of perovskite solar cell technology," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 687-705.
    18. Anqi Ma & Yu Liu & Xiujuan Xu & Tao Dong, 2021. "A deep-learning based citation count prediction model with paper metadata semantic features," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6803-6823, August.
    19. Lee, Changyong, 2021. "A review of data analytics in technological forecasting," Technological Forecasting and Social Change, Elsevier, vol. 166(C).
    20. Zhou, Xiao & Huang, Lu & Porter, Alan & Vicente-Gomila, Jose M., 2019. "Tracing the system transformations and innovation pathways of an emerging technology: Solid lipid nanoparticles," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 785-794.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:16:y:2022:i:3:s1751157722000645. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.