IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v129y2024i7d10.1007_s11192-024-05073-5.html
   My bibliography  Save this article

Comparison of datasets citation coverage in Google Scholar, Web of Science, Scopus, Crossref, and DataCite

Author

Listed:
  • Irina Gerasimov

    (ADNET Systems, Inc
    NASA Goddard Space Flight Center
    Towson University)

  • Binita KC

    (ADNET Systems, Inc
    NASA Goddard Space Flight Center)

  • Armin Mehrabian

    (ADNET Systems, Inc
    NASA Goddard Space Flight Center)

  • James Acker

    (ADNET Systems, Inc
    NASA Goddard Space Flight Center)

  • Michael P. McGuire

    (Towson University)

Abstract

The rapid increase of Earth science data from remote sensing, models, and ground-based observations highlights an urgent need for effective data management practices. Data repositories track provenance and usage metrics which are crucial for ensuring data integrity and scientific reproducibility. Although the introduction of Digital Object Identifiers (DOIs) for datasets in the late 1990s has significantly aided in crediting creators and enhancing dataset discoverability (akin to traditional research citations), considerable challenges persist in establishing linkage of datasets used with scholarly documents. This study evaluates the citation coverage of datasets from NASA’s Earth Observing System Data and Information System (EOSDIS) across several major bibliographic sources ‒ namely Google Scholar (GS), Web of Science (WoS), Scopus, Crossref, and DataCite—which helps data managers in making informed decisions when selecting bibliographic sources. We provide a robust and comprehensive understanding of the citation landscape, crucial for advancing data management practices and advancing open science. Our study searched and analyzed temporal trends across the bibliographic sources for publications that cite approximately 11,000 DOIs associated with EOSDIS datasets, yielding 17,000 unique journal and conference articles, reports, and book records linked to 3,000 dataset DOIs. GS emerged as the most comprehensive source while Crossref lagged significantly behind the other major sources. Crossref’s record references revealed that the absence of dataset DOIs and shortcomings in the Crossref Event data interface likely contributed to its underperformance. Scopus initially outperformed WoS until 2020, after which WoS began to show superior performance. Overall, our study underscores the necessity of utilizing multiple bibliographic sources for citation analysis, particularly for exploring dataset-to-document connections.

Suggested Citation

  • Irina Gerasimov & Binita KC & Armin Mehrabian & James Acker & Michael P. McGuire, 2024. "Comparison of datasets citation coverage in Google Scholar, Web of Science, Scopus, Crossref, and DataCite," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 3681-3704, July.
  • Handle: RePEc:spr:scient:v:129:y:2024:i:7:d:10.1007_s11192-024-05073-5
    DOI: 10.1007/s11192-024-05073-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-024-05073-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-024-05073-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:129:y:2024:i:7:d:10.1007_s11192-024-05073-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.