IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v122y2020i3d10.1007_s11192-020-03371-2.html
   My bibliography  Save this article

The evolution of data science and big data research: A bibliometric analysis

Author

Listed:
  • Daphne R. Raban

    (University of Haifa)

  • Avishag Gordon

    (University of Haifa)

Abstract

In this study the evolution of Big Data (BD) and Data Science (DS) literatures and the relationship between the two are analyzed by bibliometric indicators that help establish the course taken by publications on these research areas before and after forming concepts. We observe a surge in BD publications along a gradual increase in DS publications. Interestingly, a new publications course emerges combining the BD and DS concepts. We evaluate the three literature streams using various bibliometric indicators including research areas and their origin, central journals, the countries producing and funding research and startup organizations, citation dynamics, dispersion and author commitment. We find that BD and DS have differing academic origin and different leading publications. Of the two terms, BD is more salient, possibly catalyzed by the strong acceptance of the pre-coordinated term by the research community, intensive citation activity, and also, we observe, by generous funding from Chinese sources. Overall, DS literature serves as a theory-base for BD publications.

Suggested Citation

  • Daphne R. Raban & Avishag Gordon, 2020. "The evolution of data science and big data research: A bibliometric analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(3), pages 1563-1581, March.
  • Handle: RePEc:spr:scient:v:122:y:2020:i:3:d:10.1007_s11192-020-03371-2
    DOI: 10.1007/s11192-020-03371-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-020-03371-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-020-03371-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. William S. Cleveland, 2001. "Data Science: an Action Plan for Expanding the Technical Areas of the Field of Statistics," International Statistical Review, International Statistical Institute, vol. 69(1), pages 21-26, April.
    2. Jiming Hu & Yin Zhang, 2017. "Discovering the interdisciplinary nature of Big Data research through social network analysis and visualization," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(1), pages 91-109, July.
    3. Chris A. Mattmann, 2013. "A vision for data science," Nature, Nature, vol. 493(7433), pages 473-475, January.
    4. Wolfgang Glänzel & Bart Thijs, 2012. "Using ‘core documents’ for detecting and labelling new emerging topics," Scientometrics, Springer;Akadémiai Kiadó, vol. 91(2), pages 399-416, May.
    5. Diana Tal & Avishag Gordon, 2017. "Publication attributes of leadership: what do they mean?," Scientometrics, Springer;Akadémiai Kiadó, vol. 112(3), pages 1391-1402, September.
    6. Vivek Kumar Singh & Sumit Kumar Banshal & Khushboo Singhal & Ashraf Uddin, 2015. "Scientometric mapping of research on ‘Big Data’," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(2), pages 727-741, November.
    7. Wolfgang Glänzel & Bart Thijs & Pei-Shan Chi, 2016. "The challenges to expand bibliometric studies from periodical literature to monographic literature with a new data source: the book citation index," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(3), pages 2165-2179, December.
    8. Rons, Nadine, 2018. "Bibliometric approximation of a scientific specialty by combining key sources, title words, authors and references," Journal of Informetrics, Elsevier, vol. 12(1), pages 113-132.
    9. Avishag Gordon, 2007. "Transient and continuant authors in a research field: The case of terrorism," Scientometrics, Springer;Akadémiai Kiadó, vol. 72(2), pages 213-224, August.
    10. Gregorio González-Alcaide & Pedro Llorente & José M. Ramos, 2016. "Bibliometric indicators to identify emerging research fields: publications on mass gatherings," Scientometrics, Springer;Akadémiai Kiadó, vol. 109(2), pages 1283-1298, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Deming Lin & Tianhui Gong & Wenbin Liu & Martin Meyer, 2020. "An entropy-based measure for the evolution of h index research," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2283-2298, December.
    2. Yin Zhang & Dan Wu & Loni Hagen & Il‐Yeol Song & Javed Mostafa & Sam Oh & Theresa Anderson & Chirag Shah & Bradley Wade Bishop & Frank Hopfgartner & Kai Eckert & Lisa Federer & Jeffrey S. Saltz, 2023. "Data science curriculum in the iField," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(6), pages 641-662, June.
    3. Shalini R. Urs & Mohamed Minhaj, 2023. "Evolution of data science and its education in iSchools: An impressionistic study using curriculum analysis," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(6), pages 606-622, June.
    4. Tuba Bircan & Almila Alkim Akdag Salah, 2022. "A Bibliometric Analysis of the Use of Artificial Intelligence Technologies for Social Sciences," Mathematics, MDPI, vol. 10(23), pages 1-17, November.
    5. Wieslawa Gryncewicz & Monika Sitarska-Buba, 2021. "Leading Research by Institutions and Authors: A Modern Research Analysis," European Research Studies Journal, European Research Studies Journal, vol. 0(3B), pages 1012-1026.
    6. Fernando Garrigós-Simón & Silvia Sanz-Blas & Yeamduan Narangajavana & Daniela Buzova, 2021. "The Nexus between Big Data and Sustainability: An Analysis of Current Trends and Developments," Sustainability, MDPI, vol. 13(12), pages 1-24, June.
    7. Hassani, Hossein & Beneki, Christina & Silva, Emmanuel Sirimal & Vandeput, Nicolas & Madsen, Dag Øivind, 2021. "The science of statistics versus data science: What is the future?," Technological Forecasting and Social Change, Elsevier, vol. 173(C).
    8. Haoran Zhu & Lei Lei, 2022. "The Research Trends of Text Classification Studies (2000–2020): A Bibliometric Analysis," SAGE Open, , vol. 12(2), pages 21582440221, April.
    9. Yin Zhang & Il‐Yeol Song & Theresa Anderson & Dan Wu, 2023. "About JASIST special issue on “Data Science in the iField”," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(6), pages 601-605, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zhang, Yi & Huang, Ying & Porter, Alan L. & Zhang, Guangquan & Lu, Jie, 2019. "Discovering and forecasting interactions in big data research: A learning-enhanced bibliometric study," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 795-807.
    2. Christian Mühlroth & Michael Grottke, 2018. "A systematic literature review of mining weak signals and trends for corporate foresight," Journal of Business Economics, Springer, vol. 88(5), pages 643-687, July.
    3. Xu, Shuo & Hao, Liyuan & An, Xin & Yang, Guancan & Wang, Feifei, 2019. "Emerging research topics detection with multiple machine learning models," Journal of Informetrics, Elsevier, vol. 13(4).
    4. Xiaozan Lyu & Rodrigo Costas, 2021. "Studying the characteristics of scientific communities using individual-level bibliometrics: the case of Big Data research," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6965-6987, August.
    5. Tuba Bircan & Almila Alkim Akdag Salah, 2022. "A Bibliometric Analysis of the Use of Artificial Intelligence Technologies for Social Sciences," Mathematics, MDPI, vol. 10(23), pages 1-17, November.
    6. Lin Zhang & Wenjing Zhao & Beibei Sun & Ying Huang & Wolfgang Glänzel, 2020. "How scientific research reacts to international public health emergencies: a global analysis of response patterns," Scientometrics, Springer;Akadémiai Kiadó, vol. 124(1), pages 747-773, July.
    7. Shuo Xu & Liyuan Hao & Xin An & Hongshen Pang & Ting Li, 2020. "Review on emerging research topics with key-route main path analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 607-624, January.
    8. Situngkir, Hokky, 2015. "Indonesia embraces the Data Science," MPRA Paper 66048, University Library of Munich, Germany.
    9. Johannes Sorz & Wolfgang Glänzel & Ursula Ulrych & Christian Gumpenberger & Juan Gorraiz, 2020. "Research strengths identified by esteem and bibliometric indicators: a case study at the University of Vienna," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(2), pages 1095-1116, November.
    10. Ying Huang & Wolfgang Glänzel & Lin Zhang, 2021. "Tracing the development of mapping knowledge domains," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(7), pages 6201-6224, July.
    11. Wang, Zhiqi & Chen, Yue & Glänzel, Wolfgang, 2020. "Preprints as accelerator of scholarly communication: An empirical analysis in Mathematics," Journal of Informetrics, Elsevier, vol. 14(4).
    12. Wolfgang Glänzel & Koenraad Debackere, 2022. "Various aspects of interdisciplinarity in research and how to quantify and measure those," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5551-5569, September.
    13. Sabrina L. Woltmann & Lars Alkærsig, 2018. "Tracing university–industry knowledge transfer through a text mining approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 449-472, October.
    14. İsmail Dönmez, 2024. "Sustainability in Educational Research: Mapping the Field with a Bibliometric Analysis," Sustainability, MDPI, vol. 16(13), pages 1-16, June.
    15. Zhichao Ba & Yujie Cao & Jin Mao & Gang Li, 2019. "A hierarchical approach to analyzing knowledge integration between two fields—a case study on medical informatics and computer science," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(3), pages 1455-1486, June.
    16. Wolfgang Glänzel & Lin Zhang, 2018. "Scientometric research assessment in the developing world: A tribute to Michael J. Moravcsik from the perspective of the twenty-first century," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(3), pages 1517-1532, June.
    17. Siluo Yang & Xin Xing & Fan Qi & Maria Cláudia Cabrini Grácio, 2021. "Comparison of academic book impact from a disciplinary perspective: an analysis of citations and altmetric indicators," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(2), pages 1101-1123, February.
    18. Mingkun Wei & Abdolreza Noroozi Chakoli, 2020. "Evaluating the relationship between the academic and social impact of open access books based on citation behaviors and social media attention," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2401-2420, December.
    19. Kwon, Seokbeom & Liu, Xiaoyu & Porter, Alan L. & Youtie, Jan, 2019. "Research addressing emerging technological ideas has greater scientific impact," Research Policy, Elsevier, vol. 48(9), pages 1-1.
    20. Costa, Carlos & Santos, Maribel Yasmina, 2017. "The data scientist profile and its representativeness in the European e-Competence framework and the skills framework for the information age," International Journal of Information Management, Elsevier, vol. 37(6), pages 726-734.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:122:y:2020:i:3:d:10.1007_s11192-020-03371-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.