IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v9y2015i4p762-776.html
   My bibliography  Save this article

On a formula for the h-index

Author

Listed:
  • Bertoli-Barsotti, Lucio
  • Lando, Tommaso

Abstract

The h-index is a celebrated indicator widely used to assess the quality of researchers and organizations. Empirical studies support the fact that the h-index is well correlated with other simple bibliometric indicators, such as the total number of publications N and the total number of citations C. In this paper we introduce a new formula h˜w=h˜w(N,C,cMAX), as a representative predictive formula that relates functionally h to these aggregate indicators, N, C and the highest citation count cMAX. The formula is based on the ‘specific’ assumption of geometrically distributed citations, but provides a good estimate of the h-index for the general case. To empirically evaluate the adequacy of the fit of the proposed formula h˜w, an empirical study with 131 datasets (13,347 papers; 288,972 citations) was carried out. The overall fit (defined as the capacity of h˜w to reproduce the true value of h, for each single scientist) was remarkably accurate. The predicted value was within one of the actual value h for more than 60% of the datasets. We found, in approximately three cases out of four, an absolute error less than or equal to 2, and an average absolute error of only 1.9, for the whole sample of datasets.

Suggested Citation

  • Bertoli-Barsotti, Lucio & Lando, Tommaso, 2015. "On a formula for the h-index," Journal of Informetrics, Elsevier, vol. 9(4), pages 762-776.
  • Handle: RePEc:eee:infome:v:9:y:2015:i:4:p:762-776
    DOI: 10.1016/j.joi.2015.07.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157715300572
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2015.07.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Leo Egghe & Raf Guns & Ronald Rousseau, 2011. "Thoughts on uncitedness: Nobel laureates and Fields medalists as case studies," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 62(8), pages 1637-1644, August.
    2. Gangan Prathap, 2010. "The 100 most prolific economists using the p-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(1), pages 167-172, July.
    3. Mansilla, R. & Köppen, E. & Cocho, G. & Miramontes, P., 2007. "On the behavior of journal impact factor rank-order distribution," Journal of Informetrics, Elsevier, vol. 1(2), pages 155-160.
    4. Gangan Prathap, 2010. "Is there a place for a mock h-index?," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(1), pages 153-165, July.
    5. Quentin L. Burrell, 2013. "Formulae for the h-index: A lack of robustness in Lotkaian informetrics?," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(7), pages 1504-1514, July.
    6. Chrisovalantis Malesios, 2015. "Some variations on the standard theoretical models for the h-index: A comparative analysis," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 66(11), pages 2384-2388, November.
    7. Leo Egghe & Ronald Rousseau, 2006. "An informetric model for the Hirsch-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 69(1), pages 121-129, October.
    8. Aggelos Bletsas & John N. Sahalos, 2009. "Hirsch index rankings require scaling and higher moment," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(12), pages 2577-2586, December.
    9. S. Redner, 1998. "How popular is your paper? An empirical study of the citation distribution," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 4(2), pages 131-134, July.
    10. Leo Egghe & Raf Guns & Ronald Rousseau, 2011. "Thoughts on uncitedness: Nobel laureates and Fields medalists as case studies," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(8), pages 1637-1644, August.
    11. Lafouge, Thierry, 2007. "The source-item coverage of the exponential function," Journal of Informetrics, Elsevier, vol. 1(1), pages 59-67.
    12. Fred Y. Ye, 2009. "An investigation on mathematical models of the h-index," Scientometrics, Springer;Akadémiai Kiadó, vol. 81(2), pages 493-498, November.
    13. Michael J Stringer & Marta Sales-Pardo & Luís A Nunes Amaral, 2008. "Effectiveness of Journal Ranking Schemes as a Tool for Locating Information," PLOS ONE, Public Library of Science, vol. 3(2), pages 1-8, February.
    14. Leo Egghe & Ronald Rousseau, 2012. "The Hirsch index of a shifted Lotka function and its relation with the impact factor," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 63(5), pages 1048-1053, May.
    15. Burrell, Quentin L., 2013. "The h-index: A case of the tail wagging the dog?," Journal of Informetrics, Elsevier, vol. 7(4), pages 774-783.
    16. Gangan Prathap, 2014. "The zynergy-index and the formula for the h-index," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 65(2), pages 426-427, February.
    17. Derek De Solla Price, 1976. "A general theory of bibliometric and other cumulative advantage processes," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 27(5), pages 292-306, September.
    18. Juan Miguel Campanario, 2010. "Distribution of ranks of articles and citations in journals," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(2), pages 419-423, February.
    19. Tommaso Lando & Lucio Bertoli-Barsotti, 2014. "A New Bibliometric Index Based on the Shape of the Citation Distribution," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-15, December.
    20. L Egghe, 2005. "Relations between the continuous and the discrete Lotka power function," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 56(7), pages 664-668, May.
    21. Juan E. Iglesias & Carlos Pecharromán, 2007. "Scaling the h-index for different scientific ISI fields," Scientometrics, Springer;Akadémiai Kiadó, vol. 73(3), pages 303-320, December.
    22. Leo Egghe & Ronald Rousseau, 2012. "The Hirsch index of a shifted Lotka function and its relation with the impact factor," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 63(5), pages 1048-1053, May.
    23. Anthony F. J. Raan, 2006. "Comparison of the Hirsch-index with standard bibliometric indicators and with peer judgment for 147 chemistry research groups," Scientometrics, Springer;Akadémiai Kiadó, vol. 67(3), pages 491-502, June.
    24. Paul Travis Nicholls, 1987. "Estimation of Zipf parameters," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 38(6), pages 443-445, November.
    25. Bárbara S. Lancho-Barrantes & Vicente P. Guerrero-Bote & Félix Moya-Anegón, 2010. "The iceberg hypothesis revisited," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(2), pages 443-461, November.
    26. Edit Csajbók & Anna Berhidi & Lívia Vasas & András Schubert, 2007. "Hirsch-index for countries based on Essential Science Indicators data," Scientometrics, Springer;Akadémiai Kiadó, vol. 73(1), pages 91-117, October.
    27. Fred Y. Ye, 2011. "A unification of three models for the h-index," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(1), pages 205-207, January.
    28. Perc, Matjaž, 2010. "Zipf’s law and log-normal distributions in measures of scientific output across fields and institutions: 40 years of Slovenia’s research as an example," Journal of Informetrics, Elsevier, vol. 4(3), pages 358-364.
    29. Burrell, Quentin L., 2007. "Hirsch's h-index: A stochastic model," Journal of Informetrics, Elsevier, vol. 1(1), pages 16-25.
    30. Lucio Bertoli-Barsotti, 2013. "Improving a decomposition of the h-index," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(7), pages 1522-1522, July.
    31. Schubert, András & Glänzel, Wolfgang, 2007. "A systematic analysis of Hirsch-type indices for journals," Journal of Informetrics, Elsevier, vol. 1(3), pages 179-184.
    32. van Raan, Anthony F.J., 2001. "Two-step competition process leads to quasi power-law income distributions," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 298(3), pages 530-536.
    33. Wallace, Matthew L. & Larivière, Vincent & Gingras, Yves, 2009. "Modeling a century of citation distributions," Journal of Informetrics, Elsevier, vol. 3(4), pages 296-303.
    34. András Schubert & András Korn & András Telcs, 2009. "Hirsch-type indices for characterizing networks," Scientometrics, Springer;Akadémiai Kiadó, vol. 78(2), pages 375-382, February.
    35. Juan Miguel Campanario, 2010. "Distribution of ranks of articles and citations in journals," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(2), pages 419-423, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Biró, Tamás S. & Telcs, András & Józsa, Máté & Néda, Zoltán, 2023. "Gintropic scaling of scientometric indexes," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 618(C).
    2. Hou, Jianhua & Wang, Dongyi & Li, Jing, 2022. "A new method for measuring the originality of academic articles based on knowledge units in semantic networks," Journal of Informetrics, Elsevier, vol. 16(3).
    3. Fassin, Yves, 2024. "The internal dynamics of journals’ h-cores over time," Journal of Informetrics, Elsevier, vol. 18(2).
    4. Brandão, Luana Carneiro & Soares de Mello, João Carlos Correia Baptista, 2019. "A multi-criteria approach to the h-index," European Journal of Operational Research, Elsevier, vol. 276(1), pages 357-363.
    5. Tokmachev, Andrey M., 2023. "Hidden scales in statistics of citation indicators," Journal of Informetrics, Elsevier, vol. 17(1).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lucio Bertoli-Barsotti & Tommaso Lando, 2017. "A theoretical model of the relationship between the h-index and other simple citation indicators," Scientometrics, Springer;Akadémiai Kiadó, vol. 111(3), pages 1415-1448, June.
    2. Bertoli-Barsotti, Lucio & Lando, Tommaso, 2019. "How mean rank and mean size may determine the generalised Lorenz curve: With application to citation analysis," Journal of Informetrics, Elsevier, vol. 13(1), pages 387-396.
    3. Wei, Shelia X. & Tong, Tong & Rousseau, Ronald & Wang, Wanru & Ye, Fred Y., 2022. "Relations among the h-, g-, ψ-, and p-index and offset-ability," Journal of Informetrics, Elsevier, vol. 16(4).
    4. Sangwal, Keshra, 2013. "Comparison of different mathematical functions for the analysis of citation distribution of papers of individual authors," Journal of Informetrics, Elsevier, vol. 7(1), pages 36-49.
    5. Tokmachev, Andrey M., 2023. "Hidden scales in statistics of citation indicators," Journal of Informetrics, Elsevier, vol. 17(1).
    6. Burrell, Quentin L., 2013. "The h-index: A case of the tail wagging the dog?," Journal of Informetrics, Elsevier, vol. 7(4), pages 774-783.
    7. Sangwal, Keshra, 2014. "Distributions of citations of papers of individual authors publishing in different scientific disciplines: Application of Langmuir-type function," Journal of Informetrics, Elsevier, vol. 8(4), pages 972-984.
    8. Maziar Montazerian & Edgar Dutra Zanotto & Hellmut Eckert, 2019. "A new parameter for (normalized) evaluation of H-index: countries as a case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 118(3), pages 1065-1078, March.
    9. Anna Tietze & Philip Hofmann, 2019. "The h-index and multi-author hm-index for individual researchers in condensed matter physics," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(1), pages 171-185, April.
    10. Filippo Radicchi & Claudio Castellano, 2013. "Analysis of bibliometric indicators for individual scholars in a large data set," Scientometrics, Springer;Akadémiai Kiadó, vol. 97(3), pages 627-637, December.
    11. Zhang, Lin & Thijs, Bart & Glänzel, Wolfgang, 2011. "The diffusion of H-related literature," Journal of Informetrics, Elsevier, vol. 5(4), pages 583-593.
    12. Quentin L. Burrell, 2014. "The individual author’s publication–citation process: theory and practice," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 725-742, January.
    13. Deming Lin & Tianhui Gong & Wenbin Liu & Martin Meyer, 2020. "An entropy-based measure for the evolution of h index research," Scientometrics, Springer;Akadémiai Kiadó, vol. 125(3), pages 2283-2298, December.
    14. Tol, Richard S.J., 2013. "The Matthew effect for cohorts of economists," Journal of Informetrics, Elsevier, vol. 7(2), pages 522-527.
    15. Muzammil Tahira & Rose Alinda Alias & Aryati Bakri, 2013. "Scientometric assessment of engineering in Malaysians universities," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(3), pages 865-879, September.
    16. Gangan Prathap, 2014. "Big data and false discovery: analyses of bibliometric indicators from large data sets," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(2), pages 1421-1422, February.
    17. Sangwal, Keshra, 2013. "Citation and impact factor distributions of scientific journals published in individual countries," Journal of Informetrics, Elsevier, vol. 7(2), pages 487-504.
    18. John Panaretos & Chrisovaladis Malesios, 2009. "Assessing scientific research performance and impact with single indices," Scientometrics, Springer;Akadémiai Kiadó, vol. 81(3), pages 635-670, December.
    19. Lucio Bertoli-Barsotti & Tommaso Lando, 2017. "The h-index as an almost-exact function of some basic statistics," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(2), pages 1209-1228, November.
    20. Alonso, S. & Cabrerizo, F.J. & Herrera-Viedma, E. & Herrera, F., 2009. "h-Index: A review focused in its variants, computation and standardization for different scientific fields," Journal of Informetrics, Elsevier, vol. 3(4), pages 273-289.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:9:y:2015:i:4:p:762-776. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.