IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v128y2023i8d10.1007_s11192-023-04774-7.html
   My bibliography  Save this article

A hybrid strategy to extract metadata from scholarly articles by utilizing support vector machine and heuristics

Author

Listed:
  • Muhammad Waqas

    (Capital University of Science and Technology)

  • Nadeem Anjum

    (Capital University of Science and Technology)

  • Muhammad Tanvir Afzal

    (Shifa Tameer-e-Millat University)

Abstract

The immense growth in online research publications has attracted the research community to extract valuable information from scientific resources by exploring online digital libraries and publishers’ websites. The metadata stored in a machine comprehendible form can facilitate a precise search to enlist most related articles by applying semantic queries to the document’s metadata and the structural elements. The online search engines and digital libraries offer only keyword-based search on full-body text, which creates excessive results. The research community in recent years has adopted different approaches to extract structural information from research documents. We have distributed the content of an article into two logical layouts and metadata levels. This strategy has given our technique an advantage over the state-of-the-art (SOTA) extracting metadata with diversified publication styles. The experimental results have revealed that the proposed approach has shown a significant gain in performance of 20.26% to 27.14%.

Suggested Citation

  • Muhammad Waqas & Nadeem Anjum & Muhammad Tanvir Afzal, 2023. "A hybrid strategy to extract metadata from scholarly articles by utilizing support vector machine and heuristics," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(8), pages 4349-4382, August.
  • Handle: RePEc:spr:scient:v:128:y:2023:i:8:d:10.1007_s11192-023-04774-7
    DOI: 10.1007/s11192-023-04774-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-023-04774-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-023-04774-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:128:y:2023:i:8:d:10.1007_s11192-023-04774-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.