IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i9p1349-d1385504.html
   My bibliography  Save this article

A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph Construction

Author

Listed:
  • Lin Zhang

    (School of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, China)

  • Yanan Li

    (School of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, China)

  • Qinru Li

    (School of Maritime Economics and Management, Dalian Maritime University, Dalian 116026, China)

Abstract

In this paper, we construct an academic literature knowledge graph based on the relationship between documents to facilitate the storage and research of academic literature data. Keywords are an important type of node in the knowledge graph. To solve the problem that there are no keywords in some documents for several reasons in the process of knowledge graph construction, an improved keyword extraction algorithm called TP-CoGlo-TextRank is proposed by using word frequency, position, word co-occurrence frequency, and a word embedding model. By combining the word frequency and position in the document, the importance of words is distinguished. By introducing the GloVe word-embedding model, which brings the external knowledge of documents into the TextRank algorithm, and combining the internal word co-occurrence frequency in the documents, the word-adjacency relationship is transferred non-uniformly. Finally, the words with the highest scores are combined into phrases if they are adjacent in the original text. The validity of the TP-CoGlo-TextRank algorithm is verified by experiments. On this basis, the Neo4j graph database is used to store and display the academic literature knowledge graph, to provide data support for research tasks such as text clustering, automatic summarization, and question-answering systems.

Suggested Citation

  • Lin Zhang & Yanan Li & Qinru Li, 2024. "A Graph-Based Keyword Extraction Method for Academic Literature Knowledge Graph Construction," Mathematics, MDPI, vol. 12(9), pages 1-25, April.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:9:p:1349-:d:1385504
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/9/1349/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/9/1349/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:9:p:1349-:d:1385504. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.