IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v11y2017i3p629-644.html
   My bibliography  Save this article

Search for evergreens in science: A functional data analysis

Author

Listed:
  • Zhang, Ruizhi
  • Wang, Jian
  • Mei, Yajun

Abstract

Evergreens in science are papers that display a continual rise in annual citations without decline, at least within a sufficiently long time period. Aiming to better understand evergreens in particular and patterns of citation trajectory in general, this paper develops a functional data analysis method to cluster citation trajectories of a sample of 1699 research papers published in 1980 in the American Physical Society (APS) journals. We propose a functional Poisson regression model for individual papers’ citation trajectories, and fit the model to the observed 30-year citations of individual papers by functional principal component analysis and maximum likelihood estimation. Based on the estimated paper-specific coefficients, we apply the K-means clustering algorithm to cluster papers into different groups, for uncovering general types of citation trajectories. The result demonstrates the existence of an evergreen cluster of papers that do not exhibit any decline in annual citations over 30 years.

Suggested Citation

  • Zhang, Ruizhi & Wang, Jian & Mei, Yajun, 2017. "Search for evergreens in science: A functional data analysis," Journal of Informetrics, Elsevier, vol. 11(3), pages 629-644.
  • Handle: RePEc:eee:infome:v:11:y:2017:i:3:p:629-644
    DOI: 10.1016/j.joi.2017.05.007
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157716303583
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2017.05.007?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. J. O. Ramsay, 1998. "Estimating smooth monotone functions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 60(2), pages 365-375.
    2. Rodrigo Costas & Thed N. van Leeuwen & Anthony F.J. van Raan, 2010. "Is scientific literature subject to a ‘Sell-By-Date’? A general methodology to analyze the ‘durability’ of scientific documents," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(2), pages 329-339, February.
    3. Aurel Avramescu, 1979. "Actuality and Obsolescence of Scientific Literature," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 30(5), pages 296-303, September.
    4. Yao, Fang & Muller, Hans-Georg & Wang, Jane-Ling, 2005. "Functional Data Analysis for Sparse Longitudinal Data," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 577-590, June.
    5. Nicoleta Serban & Ana-Maria Staicu & Raymond J. Carroll, 2013. "Multilevel Cross-Dependent Binary Longitudinal Data," Biometrics, The International Biometric Society, vol. 69(4), pages 903-913, December.
    6. Colavizza, Giovanni & Franceschet, Massimo, 2016. "Clustering citation histories in the Physical Review," Journal of Informetrics, Elsevier, vol. 10(4), pages 1037-1051.
    7. Angelika Linde, 2009. "A Bayesian latent variable approach to functional principal components analysis with binary and count data," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 93(3), pages 307-333, September.
    8. Jian Wang & Bart Thijs & Wolfgang Glänzel, 2015. "Interdisciplinarity and Impact: Distinct Effects of Variety, Balance, and Disparity," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-18, May.
    9. Philippe Besse & J. Ramsay, 1986. "Principal components analysis of sampled functions," Psychometrika, Springer;The Psychometric Society, vol. 51(2), pages 285-311, June.
    10. Anthony F. J. van Raan, 2004. "Sleeping Beauties in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 59(3), pages 467-472, March.
    11. Susanne E. Baumgartner & Loet Leydesdorff, 2014. "Group-based trajectory modeling (GBTM) of citations in scholarly literature: Dynamic qualities of “transient” and “sticky knowledge claims”," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(4), pages 797-811, April.
    12. Juan D Rogers, 2010. "Citation analysis of nanotechnology at the field level: implications of R&D evaluation," Research Evaluation, Oxford University Press, vol. 19(4), pages 281-290, October.
    13. Jian Wang, 2013. "Citation time window choice for research impact evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(3), pages 851-872, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wang, Jian & Veugelers, Reinhilde & Stephan, Paula, 2017. "Bias against novelty in science: A cautionary tale for users of bibliometric indicators," Research Policy, Elsevier, vol. 46(8), pages 1416-1436.
    2. Saarela, Mirka & Kärkkäinen, Tommi, 2020. "Can we automate expert-based journal rankings? Analysis of the Finnish publication indicator," Journal of Informetrics, Elsevier, vol. 14(2).
    3. Chakraborty, Joyita & Pradhan, Dinesh K. & Nandi, Subrata, 2024. "A multiple k-means cluster ensemble framework for clustering citation trajectories," Journal of Informetrics, Elsevier, vol. 18(2).
    4. Jianhua Hou & Xiucai Yang, 2019. "Patent sleeping beauties: evolutionary trajectories and identification methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 187-215, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Onodera, Natsuo, 2016. "Properties of an index of citation durability of an article," Journal of Informetrics, Elsevier, vol. 10(4), pages 981-1004.
    2. Jianhua Hou & Xiucai Yang, 2019. "Patent sleeping beauties: evolutionary trajectories and identification methods," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(1), pages 187-215, July.
    3. Min, Chao & Sun, Jianjun & Pei, Lei & Ding, Ying, 2016. "Measuring delayed recognition for papers: Uneven weighted summation and total citations," Journal of Informetrics, Elsevier, vol. 10(4), pages 1153-1165.
    4. Wang, Jian & Veugelers, Reinhilde & Stephan, Paula, 2017. "Bias against novelty in science: A cautionary tale for users of bibliometric indicators," Research Policy, Elsevier, vol. 46(8), pages 1416-1436.
    5. Jian Du & Yishan Wu, 2018. "A parameter-free index for identifying under-cited sleeping beauties in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 959-971, August.
    6. Colavizza, Giovanni & Franceschet, Massimo, 2016. "Clustering citation histories in the Physical Review," Journal of Informetrics, Elsevier, vol. 10(4), pages 1037-1051.
    7. Lutz Bornmann & Adam Y. Ye & Fred Y. Ye, 2018. "Identifying “hot papers” and papers with “delayed recognition” in large-scale datasets by using dynamically normalized citation impact scores," Scientometrics, Springer;Akadémiai Kiadó, vol. 116(2), pages 655-674, August.
    8. Jeff Goldsmith & Vadim Zipunnikov & Jennifer Schrack, 2015. "Generalized multilevel function-on-scalar regression and principal component analysis," Biometrics, The International Biometric Society, vol. 71(2), pages 344-353, June.
    9. Wang, Jian & Hicks, Diana, 2015. "Scientific teams: Self-assembly, fluidness, and interdependence," Journal of Informetrics, Elsevier, vol. 9(1), pages 197-207.
    10. Jianjun Sun & Chao Min & Jiang Li, 2016. "A vector for measuring obsolescence of scientific articles," Scientometrics, Springer;Akadémiai Kiadó, vol. 107(2), pages 745-757, May.
    11. Jian Wang & Bart Thijs & Wolfgang Glänzel, 2015. "Interdisciplinarity and Impact: Distinct Effects of Variety, Balance, and Disparity," PLOS ONE, Public Library of Science, vol. 10(5), pages 1-18, May.
    12. Hou, Jianhua & Yang, Xiucai, 2020. "Social media-based sleeping beauties: Defining, identifying and features," Journal of Informetrics, Elsevier, vol. 14(2).
    13. Wang, Jian, 2016. "Knowledge creation in collaboration networks: Effects of tie configuration," Research Policy, Elsevier, vol. 45(1), pages 68-80.
    14. Shin, Yei Eun & Zhou, Lan & Ding, Yu, 2022. "Joint estimation of monotone curves via functional principal component analysis," Computational Statistics & Data Analysis, Elsevier, vol. 166(C).
    15. Sotaro Shibayama & Jian Wang, 2020. "Measuring originality in science," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 409-427, January.
    16. Zhichao Fang & Rodrigo Costas, 2020. "Studying the accumulation velocity of altmetric data tracked by Altmetric.com," Scientometrics, Springer;Akadémiai Kiadó, vol. 123(2), pages 1077-1101, May.
    17. Jian Wang, 2013. "Citation time window choice for research impact evaluation," Scientometrics, Springer;Akadémiai Kiadó, vol. 94(3), pages 851-872, March.
    18. Winnink, J.J. & Tijssen, Robert J.W. & van Raan, A.F.J., 2019. "Searching for new breakthroughs in science: How effective are computerised detection algorithms?," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 673-686.
    19. You Song & Fangling Situ & Hongjun Zhu & Jinzhi Lei, 2018. "To be the Prince to wake up Sleeping Beauty: the rediscovery of the delayed recognition studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 9-24, October.
    20. Jianhua Hou & Xiucai Yang & Yang Zhang, 2023. "The effect of social media knowledge cascade: an analysis of scientific papers diffusion," Scientometrics, Springer;Akadémiai Kiadó, vol. 128(9), pages 5169-5195, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:11:y:2017:i:3:p:629-644. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.