IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v107y2016i2d10.1007_s11192-016-1867-8.html
   My bibliography  Save this article

Do Scopus and WoS correct “old” omitted citations?

Author

Listed:
  • Fiorenzo Franceschini

    (Politecnico di Torino)

  • Domenico Maisano

    (Politecnico di Torino)

  • Luca Mastrogiacomo

    (Politecnico di Torino)

Abstract

Omitted citations—i.e., missing links between a cited paper and the corresponding citing papers—are a consequence of several bibliometric-database errors. To reduce these errors, databases may undertake two actions: (1) improving the control of the (new) papers to be indexed, i.e., limiting the introduction of “new” dirty data, and (2) detecting and correcting errors in the papers already indexed by the database, i.e., cleaning “old” dirty data. The latter action is probably more complicated, as it requires the application of suitable error-detection procedures to a huge amount of data. Based on an extensive sample of scientific papers in the Engineering-Manufacturing field, this study focuses on old dirty data in the Scopus and WoS databases. To this purpose, a recent automated algorithm for estimating the omitted-citation rate of databases is applied to the same sample of papers, but in three different-time sessions. A database’s ability to clean the old dirty data is evaluated considering the variations in the omitted-citation rate from session to session. The major outcomes of this study are that: (1) both databases slowly correct old omitted citations, and (2) a small portion of initially corrected citations can surprisingly come off from databases over time.

Suggested Citation

  • Fiorenzo Franceschini & Domenico Maisano & Luca Mastrogiacomo, 2016. "Do Scopus and WoS correct “old” omitted citations?," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 321-335, May.
  • Handle: RePEc:spr:scient:v:107:y:2016:i:2:d:10.1007_s11192-016-1867-8
    DOI: 10.1007/s11192-016-1867-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-016-1867-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-016-1867-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Schenker N. & Gentleman J. F., 2001. "On Judging the Significance of Differences by Examining the Overlap Between Confidence Intervals," The American Statistician, American Statistical Association, vol. 55, pages 182-186, August.
    2. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "The museum of errors/horrors in Scopus," Journal of Informetrics, Elsevier, vol. 10(1), pages 174-182.
    3. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2014. "Scientific journal publishers and omitted citations in bibliometric databases: Any relationship?," Journal of Informetrics, Elsevier, vol. 8(3), pages 751-765.
    4. Marlies Olensky & Marion Schmidt & Nees Jan Eck, 2016. "Evaluation of the citation matching algorithms of CWTS and iFQ in comparison to the Web of science," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 67(10), pages 2550-2564, October.
    5. Valderrama-Zurián, Juan-Carlos & Aguilar-Moya, Remedios & Melero-Fuentes, David & Aleixandre-Benavent, Rafael, 2015. "A systematic analysis of duplicate records in Scopus," Journal of Informetrics, Elsevier, vol. 9(3), pages 570-576.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    2. Thelwall, Mike, 2018. "Microsoft Academic automatic document searches: Accuracy for journal articles and suitability for citation analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 1-9.
    3. Mariana-Daniela González-Zamar & Emilio Abad-Segura & Eloy López-Meneses & José Gómez-Galán, 2020. "Managing ICT for Sustainable Education: Research Analysis in the Context of Higher Education," Sustainability, MDPI, vol. 12(19), pages 1-25, October.
    4. Shirley Ainsworth & Jane M. Russell, 2018. "Has hosting on science direct improved the visibility of Latin American scholarly journals? A preliminary analysis of data quality," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(3), pages 1463-1484, June.
    5. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "Empirical analysis and classification of database errors in Scopus and Web of Science," Journal of Informetrics, Elsevier, vol. 10(4), pages 933-953.
    6. Houqiang Yu & Xueting Cao & Tingting Xiao & Zhenyi Yang, 2020. "How accurate are policy document mentions? A first look at the role of altmetrics database," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1517-1540, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "Empirical analysis and classification of database errors in Scopus and Web of Science," Journal of Informetrics, Elsevier, vol. 10(4), pages 933-953.
    2. Shirley Ainsworth & Jane M. Russell, 2018. "Has hosting on science direct improved the visibility of Latin American scholarly journals? A preliminary analysis of data quality," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(3), pages 1463-1484, June.
    3. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    4. Shuo Xu & Liyuan Hao & Xin An & Dongsheng Zhai & Hongshen Pang, 2019. "Types of DOI errors of cited references in Web of Science with a cleaning method," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(3), pages 1427-1437, September.
    5. Thelwall, Mike, 2018. "Microsoft Academic automatic document searches: Accuracy for journal articles and suitability for citation analysis," Journal of Informetrics, Elsevier, vol. 12(1), pages 1-9.
    6. Sergio Copiello, 2019. "The open access citation premium may depend on the openness and inclusiveness of the indexing database, but the relationship is controversial because it is ambiguous where the open access boundary lie," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 995-1018, November.
    7. Houqiang Yu & Xueting Cao & Tingting Xiao & Zhenyi Yang, 2020. "How accurate are policy document mentions? A first look at the role of altmetrics database," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1517-1540, November.
    8. Alessia Cioffi & Sara Coppini & Arcangelo Massari & Arianna Moretti & Silvio Peroni & Cristian Santini & Nooshin Shahidzadeh Asadi, 2022. "Identifying and correcting invalid citations due to DOI errors in Crossref data," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(6), pages 3593-3612, June.
    9. Franceschini, Fiorenzo & Maisano, Domenico & Mastrogiacomo, Luca, 2016. "The museum of errors/horrors in Scopus," Journal of Informetrics, Elsevier, vol. 10(1), pages 174-182.
    10. Fiorenzo Franceschini & Domenico Maisano & Luca Mastrogiacomo, 2015. "Influence of omitted citations on the bibliometric statistics of the major Manufacturing journals," Scientometrics, Springer;Akadémiai Kiadó, vol. 103(3), pages 1083-1122, June.
    11. Núria Bautista-Puig & Jorge Mañana-Rodríguez & Antonio Eleazar Serrano-López, 2021. "Role taxonomy of green and sustainable science and technology journals: exportation, importation, specialization and interdisciplinarity," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 3871-3892, May.
    12. Mikhail Rogov & Céline Rozenblat, 2018. "Urban Resilience Discourse Analysis: Towards a Multi-Level Approach to Cities," Sustainability, MDPI, vol. 10(12), pages 1-21, November.
    13. Domínguez-Torreiro, Marcos & Soliño, Mario, 2011. "Provided and perceived status quo in choice experiments: Implications for valuing the outputs of multifunctional rural areas," Ecological Economics, Elsevier, vol. 70(12), pages 2523-2531.
    14. Weishu Liu & Meiting Huang & Haifeng Wang, 2021. "Same journal but different numbers of published records indexed in Scopus and Web of Science Core Collection: causes, consequences, and solutions," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(5), pages 4541-4550, May.
    15. Ignacio Rodríguez-Rodríguez & José-Víctor Rodríguez & Niloofar Shirvanizadeh & Andrés Ortiz & Domingo-Javier Pardo-Quiles, 2021. "Applications of Artificial Intelligence, Machine Learning, Big Data and the Internet of Things to the COVID-19 Pandemic: A Scientometric Review Using Text Mining," IJERPH, MDPI, vol. 18(16), pages 1-29, August.
    16. Olugbenga Oladinrin & Kasun Gomis & Wadu Mesthrige Jayantha & Lovelin Obi & Muhammad Qasim Rana, 2021. "Scientometric Analysis of Global Scientific Literature on Aging in Place," IJERPH, MDPI, vol. 18(23), pages 1-16, November.
    17. Christophe Boudry & Ghislaine Chartron, 2017. "Availability of digital object identifiers in publications archived by PubMed," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(3), pages 1453-1469, March.
    18. Sidra Salam & Aslan Amat Senin, 2022. "A Bibliometric Study on Innovative Behavior Literature (1961–2019)," SAGE Open, , vol. 12(3), pages 21582440221, July.
    19. Tim Goedemé & Karel Van den Bosch & Lina Salanauskaite & Gerlinde Verbist, 2013. "Testing the Statistical Significance of Microsimulation Results: Often Easier than You Think. A Technical Note," ImPRovE Working Papers 13/10, Herman Deleeck Centre for Social Policy, University of Antwerp.
    20. Yao Xiao & Chengzhen Meng & Suli Huang & Yanran Duan & Gang Liu & Shuyuan Yu & Ji Peng & Jinquan Cheng & Ping Yin, 2021. "Short-Term Effect of Temperature Change on Non-Accidental Mortality in Shenzhen, China," IJERPH, MDPI, vol. 18(16), pages 1-14, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:107:y:2016:i:2:d:10.1007_s11192-016-1867-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.