IDEAS home Printed from https://ideas.repec.org/a/spr/scient/v129y2024i11d10.1007_s11192-024-04937-0.html
   My bibliography  Save this article

Measuring the evolving stage of temporal distribution of research topic keyword in scientific literature by research heat curve

Author

Listed:
  • Junsheng Zhang

    (Institute of Scientific and Technical Information of China)

  • Xiaoping Sun

    (Chinese Academy of Sciences)

  • Zhihui Liu

    (Institute of Scientific and Technical Information of China)

Abstract

Scientific literature records research progresses of science and technology. Research topics of technologies are evolving in scientific literature. The temporal distribution of the research topic keywords in literature can reflect the evolving stages of a research topic over time. A research topic can be in different evolving stages with different evolving distributions. Previous work mainly focused on visualizing the temporal distribution of keyword weights to illustrate the developing history and trend of a research topic in a literature collection. Quantitatively measuring the evolving stage of a research topic keyword by a baseline distribution can help to detect topic evolving stages in a large scientific literature corpus in an automatic way. How to build a quantitative baseline and how to quantitative compare the topic temporal distribution with the baseline distribution are two challenges. In this paper, an explicit function of the research heat curve is obtained by constructing a differential equation system of evolving research population groups within a research community on a research topic represented by a topic keyword. Six segments of the heat curve are obtained by zero points of derivatives of the heat curve, which together with the full heat curve are used as the quantitative baselines for measuring the temporal distribution of a research topic in different evolving stages. The temporal distribution of a research topic keyword in a scientific literature collection is obtained from the TF-IDF features of the literature collection. A curve shape matching algorithm is designed to match the temporal distribution curve with each baseline segment of the heat curve function to obtain a distance by measuring the shape similarity between the baseline segment curve and the temporal distribution curve. The segment with the smallest distance is used as a quantitative indicator of the evolving stage of the research topic. Experiments on the produced distributions and the real distributions confirm the effectiveness of the heat curve matching method for measuring the evolving stages from the temporal distribution of topics.

Suggested Citation

  • Junsheng Zhang & Xiaoping Sun & Zhihui Liu, 2024. "Measuring the evolving stage of temporal distribution of research topic keyword in scientific literature by research heat curve," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(11), pages 7287-7328, November.
  • Handle: RePEc:spr:scient:v:129:y:2024:i:11:d:10.1007_s11192-024-04937-0
    DOI: 10.1007/s11192-024-04937-0
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11192-024-04937-0
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11192-024-04937-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Staša Milojević & Cassidy R. Sugimoto & Erjia Yan & Ying Ding, 2011. "The cognitive structure of Library and Information Science: Analysis of article title words," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(10), pages 1933-1953, October.
    2. Lin, Yiling & Evans, James A. & Wu, Lingfei, 2022. "New directions in science emerge from disconnection and discord," Journal of Informetrics, Elsevier, vol. 16(1).
    3. Wang, Jian & Veugelers, Reinhilde & Stephan, Paula, 2017. "Bias against novelty in science: A cautionary tale for users of bibliometric indicators," Research Policy, Elsevier, vol. 46(8), pages 1416-1436.
    4. Lutz Bornmann & Robin Haunschild & Rüdiger Mutz, 2021. "Growth rates of modern science: a latent piecewise growth curve approach to model publication numbers from established and new literature databases," Palgrave Communications, Palgrave Macmillan, vol. 8(1), pages 1-15, December.
    5. Hong Wu & Huifang Yi & Chang Li, 2021. "An integrated approach for detecting and quantifying the topic evolutions of patent technology: a case study on graphene field," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(8), pages 6301-6321, August.
    6. Seyyed Reza Taher Harikandeh & Sadegh Aliakbary & Soroush Taheri, 2023. "An embedding approach for analyzing the evolution of research topics with a case study on computer science subdomains," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(3), pages 1567-1582, March.
    7. Dotsika, Fefie & Watkins, Andrew, 2017. "Identifying potentially disruptive trends by means of keyword network analysis," Technological Forecasting and Social Change, Elsevier, vol. 119(C), pages 114-127.
    8. Rotolo, Daniele & Hicks, Diana & Martin, Ben R., 2015. "What is an emerging technology?," Research Policy, Elsevier, vol. 44(10), pages 1827-1843.
    9. Samira Ranaei & Arho Suominen & Alan Porter & Stephen Carley, 2020. "Evaluating technological emergence using text analytics: two case technologies and three approaches," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 215-247, January.
    10. Kun Dong & Haiyun Xu & Rui Luo & Ling Wei & Shu Fang, 2018. "An integrated method for interdisciplinary topic identification and prediction: a case study on information science and library science," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(2), pages 849-868, May.
    11. Ma, Tingting & Zhou, Xiao & Liu, Jia & Lou, Zhenkai & Hua, Zhaoting & Wang, Ruitao, 2021. "Combining topic modeling and SAO semantic analysis to identify technological opportunities of emerging technologies," Technological Forecasting and Social Change, Elsevier, vol. 173(C).
    12. Chen, Baitong & Tsutsui, Satoshi & Ding, Ying & Ma, Feicheng, 2017. "Understanding the topic evolution in a scientific domain: An exploratory study for the field of information retrieval," Journal of Informetrics, Elsevier, vol. 11(4), pages 1175-1189.
    13. Lingfei Wu & Dashun Wang & James A. Evans, 2019. "Large teams develop and small teams disrupt science and technology," Nature, Nature, vol. 566(7744), pages 378-382, February.
    14. Yan, Erjia, 2014. "Research dynamics: Measuring the continuity and popularity of research topics," Journal of Informetrics, Elsevier, vol. 8(1), pages 98-110.
    15. Lu Huang & Xiang Chen & Yi Zhang & Changtian Wang & Xiaoli Cao & Jiarun Liu, 2022. "Identification of topic evolution: network analytics with piecewise linear representation and word embedding," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(9), pages 5353-5383, September.
    16. Staša Milojević & Cassidy R. Sugimoto & Erjia Yan & Ying Ding, 2011. "The cognitive structure of Library and Information Science: Analysis of article title words," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(10), pages 1933-1953, October.
    17. Haiyun Xu & Ting Guo & Zenghui Yue & Lijie Ru & Shu Fang, 2016. "Interdisciplinary topics of information science: a study based on the terms interdisciplinarity index series," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(2), pages 583-601, February.
    18. Sotaro Shibayama & Jian Wang, 2020. "Measuring originality in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 409-427, January.
    19. Kumar, Vivek & Srivastava, Arpita, 2022. "Trends in the thematic landscape of corporate social responsibility research: A structural topic modeling approach," Journal of Business Research, Elsevier, vol. 150(C), pages 26-37.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Chengzhi Zhang & Philipp Mayr & Wei Lu & Yi Zhang, 2024. "An editorial note on extraction and evaluation of knowledge entities from scientific documents," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(11), pages 7169-7174, November.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hou, Jianhua & Wang, Dongyi & Li, Jing, 2022. "A new method for measuring the originality of academic articles based on knowledge units in semantic networks," Journal of Informetrics, Elsevier, vol. 16(3).
    2. Yosuke Miyata & Emi Ishita & Fang Yang & Michimasa Yamamoto & Azusa Iwase & Keiko Kurata, 2020. "Knowledge structure transition in library and information science: topic modeling and visualization," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(1), pages 665-687, October.
    3. Wang, Cheng-Jun & Yan, Lihan & Cui, Haochuan, 2023. "Unpacking the essential tension of knowledge recombination: Analyzing the impact of knowledge spanning on citation impact and disruptive innovation," Journal of Informetrics, Elsevier, vol. 17(4).
    4. Bornmann, Lutz & Haunschild, Robin, 2022. "Empirical analysis of recent temporal dynamics of research fields: Annual publications in chemistry and related areas as an example," Journal of Informetrics, Elsevier, vol. 16(2).
    5. S. Lozano & L. Calzada-Infante & B. Adenso-Díaz & S. García, 2019. "Complex network analysis of keywords co-occurrence in the recent efficiency analysis literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(2), pages 609-629, August.
    6. Keye Wu & Ziyue Xie & Jia Tina Du, 2024. "Does science disrupt technology? Examining science intensity, novelty, and recency through patent-paper citations in the pharmaceutical field," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(9), pages 5469-5491, September.
    7. Abhijit Thakuria & Dipen Deka, 2024. "A decadal study on identifying latent topics and research trends in open access LIS journals using topic modeling approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 3841-3869, July.
    8. Yuefen Wang & Lipeng Fan & Lei Wu, 2024. "A validation test of the Uzzi et al. novelty measure of innovation and applications to collaboration patterns between institutions," Scientometrics, Springer;Akadémiai Kiadó, vol. 129(7), pages 4379-4394, July.
    9. Wenjie Wei & Hongxu Liu & Zhuanlan Sun, 2022. "Cover papers of top journals are reliable source for emerging topics detection: a machine learning based prediction framework," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(8), pages 4315-4333, August.
    10. Yu-Wei Chang, 2018. "Examining interdisciplinarity of library and information science (LIS) based on LIS articles contributed by non-LIS authors," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(3), pages 1589-1613, September.
    11. Libo Sheng & Dongqing Lyu & Xuanmin Ruan & Hongquan Shen & Ying Cheng, 2023. "The association between prior knowledge and the disruption of an article," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(8), pages 4731-4751, August.
    12. Wang, Xiaoguang & He, Jing & Huang, Han & Wang, Hongyu, 2022. "MatrixSim: A new method for detecting the evolution paths of research topics," Journal of Informetrics, Elsevier, vol. 16(4).
    13. Wu, Lingfei & Kittur, Aniket & Youn, Hyejin & Milojević, Staša & Leahey, Erin & Fiore, Stephen M. & Ahn, Yong-Yeol, 2022. "Metrics and mechanisms: Measuring the unmeasurable in the science of science," Journal of Informetrics, Elsevier, vol. 16(2).
    14. Zhentao Liang & Jin Mao & Gang Li, 2023. "Bias against scientific novelty: A prepublication perspective," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 74(1), pages 99-114, January.
    15. Yue Wang & Ning Li & Bin Zhang & Qian Huang & Jian Wu & Yang Wang, 2023. "The effect of structural holes on producing novel and disruptive research in physics," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(3), pages 1801-1823, March.
    16. Li, Meiling & Wang, Yang & Du, Haifeng & Bai, Aruhan, 2024. "Motivating innovation: The impact of prestigious talent funding on junior scientists," Research Policy, Elsevier, vol. 53(9).
    17. Lin, Yiling & Evans, James A. & Wu, Lingfei, 2022. "New directions in science emerge from disconnection and discord," Journal of Informetrics, Elsevier, vol. 16(1).
    18. Deichmann, Dirk & Moser, Christine & Birkholz, Julie M. & Nerghes, Adina & Groenewegen, Peter & Wang, Shenghui, 2020. "Ideas with impact: How connectivity shapes idea diffusion," Research Policy, Elsevier, vol. 49(1).
    19. Zhongyi Wang & Keying Wang & Jiyue Liu & Jing Huang & Haihua Chen, 2022. "Measuring the innovation of method knowledge elements in scientific literature," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(5), pages 2803-2827, May.
    20. Xu, Shuo & Hao, Liyuan & An, Xin & Yang, Guancan & Wang, Feifei, 2019. "Emerging research topics detection with multiple machine learning models," Journal of Informetrics, Elsevier, vol. 13(4).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:scient:v:129:y:2024:i:11:d:10.1007_s11192-024-04937-0. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.