IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v6y2012i1p66-79.html
   My bibliography  Save this article

Modeling the probabilistic distribution of the impact factor

Author

Listed:
  • Sarabia, José María
  • Prieto, Faustino
  • Trueba, Carmen

Abstract

The study of the informetric distributions, such as distributions of citations and impact factors is one of the most relevant topics in the current informetric research. Several laws for modeling impact factor based on ranks have been proposed, including Zipf, Lavalette and the two-exponent law proposed by Mansilla et al. (2007). In this paper, the underlying probabilistic quantile function corresponding to the Mansilla's two-exponent law is obtained. This result is particularly relevant, since it allows us to know the underlying population, to learn about all its features and to use statistical inference procedures. Several probabilistic descriptive measures are obtained, including moments, Lorenz and Leimkuhler curves and Gini index. The distribution of the order statistics is derived. Least squares estimates are obtained. The different results are illustrated using the data of the impact factors in eight relevant scientific fields.

Suggested Citation

  • Sarabia, José María & Prieto, Faustino & Trueba, Carmen, 2012. "Modeling the probabilistic distribution of the impact factor," Journal of Informetrics, Elsevier, vol. 6(1), pages 66-79.
  • Handle: RePEc:eee:infome:v:6:y:2012:i:1:p:66-79
    DOI: 10.1016/j.joi.2011.09.005
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157711000824
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2011.09.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. L. Egghe, 2005. "Zipfian and Lotkaian continuous concentration theory," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 56(9), pages 935-945, July.
    2. Sarabia, J. -M. & Castillo, Enrique & Slottje, Daniel J., 1999. "An ordered family of Lorenz curves," Journal of Econometrics, Elsevier, vol. 91(1), pages 43-60, July.
    3. Mansilla, R. & Köppen, E. & Cocho, G. & Miramontes, P., 2007. "On the behavior of journal impact factor rank-order distribution," Journal of Informetrics, Elsevier, vol. 1(2), pages 155-160.
    4. Waltman, L. & van Eck, N.J.P., 2009. "Some Comments on Egghe’s Derivation of the Impact Factor Distribution," ERIM Report Series Research in Management ERS-2009-016-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    5. Mishra, SK, 2010. "A note on empirical sample distribution of journal impact factors in major discipline groups," MPRA Paper 20747, University Library of Munich, Germany.
    6. Juan Miguel Campanario, 2010. "Distribution of ranks of articles and citations in journals," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 61(2), pages 419-423, February.
    7. Balakrishnan, N. & Sarabia, José María & Kolev, Nikolai, 2010. "A simple relation between the Leimkuhler curve and the mean residual life," Journal of Informetrics, Elsevier, vol. 4(4), pages 602-607.
    8. Juan Miguel Campanario, 2010. "Distribution of changes in impact factors over time," Scientometrics, Springer;Akadémiai Kiadó, vol. 84(1), pages 35-42, July.
    9. Sarabia, José María, 2008. "A general definition of the Leimkuhler curve," Journal of Informetrics, Elsevier, vol. 2(2), pages 156-163.
    10. Sarabia, José María & Gómez-Déniz, Emilio & Sarabia, María & Prieto, Faustino, 2010. "A general method for generating parametric Lorenz and Leimkuhler curves," Journal of Informetrics, Elsevier, vol. 4(4), pages 524-539.
    11. Egghe, L., 2009. "Mathematical derivation of the impact factor distribution," Journal of Informetrics, Elsevier, vol. 3(4), pages 290-295.
    12. Gastwirth, Joseph L, 1971. "A General Definition of the Lorenz Curve," Econometrica, Econometric Society, vol. 39(6), pages 1037-1039, November.
    13. Juan Miguel Campanario, 2010. "Distribution of ranks of articles and citations in journals," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 61(2), pages 419-423, February.
    14. Waltman, Ludo & van Eck, Nees Jan, 2009. "Some comments on Egghe's derivation of the impact factor distribution," Journal of Informetrics, Elsevier, vol. 3(4), pages 363-366.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Unnikrishnan Nair, N. & Vineshkumar, B., 2022. "Modelling informetric data using quantile functions," Journal of Informetrics, Elsevier, vol. 16(2).
    2. Mrowinski, Maciej J. & Gagolewski, Marek & Siudem, Grzegorz, 2022. "Accidentality in journal citation patterns," Journal of Informetrics, Elsevier, vol. 16(4).
    3. Brzezinski, Michal, 2014. "Empirical modeling of the impact factor distribution," Journal of Informetrics, Elsevier, vol. 8(2), pages 362-368.
    4. Cerovšek, Tomo & Mikoš, Matjaž, 2014. "A comparative study of cross-domain research output and citations: Research impact cubes and binary citation frequencies," Journal of Informetrics, Elsevier, vol. 8(1), pages 147-161.
    5. Richard S.J. Tol, 2013. "Measuring catch-up growth in malnourished populations," Working Paper Series 6013, Department of Economics, University of Sussex Business School.
    6. Bertoli-Barsotti, Lucio & Lando, Tommaso, 2019. "How mean rank and mean size may determine the generalised Lorenz curve: With application to citation analysis," Journal of Informetrics, Elsevier, vol. 13(1), pages 387-396.
    7. Alina MOROSANU, 2013. "Empirical Study Of Different Factors Effects On Articles Publication Regarding Survey Interviewer Characteristics Using Multilevel Regression Model," Management and Marketing Journal, University of Craiova, Faculty of Economics and Business Administration, vol. 0(1), pages 141-156, May.
    8. Tol, Richard S.J., 2013. "Identifying excellent researchers: A new approach," Journal of Informetrics, Elsevier, vol. 7(4), pages 803-810.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Balakrishnan, N. & Sarabia, José María & Kolev, Nikolai, 2010. "A simple relation between the Leimkuhler curve and the mean residual life," Journal of Informetrics, Elsevier, vol. 4(4), pages 602-607.
    2. Brzezinski, Michal, 2014. "Empirical modeling of the impact factor distribution," Journal of Informetrics, Elsevier, vol. 8(2), pages 362-368.
    3. L. Egghe, 2011. "The impact factor rank-order distribution revisited," Scientometrics, Springer;Akadémiai Kiadó, vol. 87(3), pages 683-685, June.
    4. Huang, Ding-wei, 2017. "Impact factor distribution revisited," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 482(C), pages 173-180.
    5. Sarabia, José María & Gómez-Déniz, Emilio & Sarabia, María & Prieto, Faustino, 2010. "A general method for generating parametric Lorenz and Leimkuhler curves," Journal of Informetrics, Elsevier, vol. 4(4), pages 524-539.
    6. Lucio Bertoli-Barsotti & Marek Gagolewski & Grzegorz Siudem & Barbara .Zoga{l}a-Siudem, 2023. "Gini-stable Lorenz curves and their relation to the generalised Pareto distribution," Papers 2304.07480, arXiv.org, revised Jan 2024.
    7. Bertoli-Barsotti, Lucio & Gagolewski, Marek & Siudem, Grzegorz & Żogała-Siudem, Barbara, 2024. "Gini-stable Lorenz curves and their relation to the generalised Pareto distribution," Journal of Informetrics, Elsevier, vol. 18(2).
    8. Sarabia, José María, 2008. "A general definition of the Leimkuhler curve," Journal of Informetrics, Elsevier, vol. 2(2), pages 156-163.
    9. Bárbara S. Lancho-Barrantes & Vicente P. Guerrero-Bote & Félix Moya-Anegón, 2010. "The iceberg hypothesis revisited," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(2), pages 443-461, November.
    10. Bertoli-Barsotti, Lucio & Lando, Tommaso, 2019. "How mean rank and mean size may determine the generalised Lorenz curve: With application to citation analysis," Journal of Informetrics, Elsevier, vol. 13(1), pages 387-396.
    11. Unnikrishnan Nair, N. & Vineshkumar, B., 2022. "Modelling informetric data using quantile functions," Journal of Informetrics, Elsevier, vol. 16(2).
    12. Barry C. Arnold & José María Sarabia, 2018. "Analytic Expressions for Multivariate Lorenz Surfaces," Sankhya A: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 80(1), pages 84-111, December.
    13. Copiello, Sergio, 2019. "Peer and neighborhood effects: Citation analysis using a spatial autoregressive model and pseudo-spatial data," Journal of Informetrics, Elsevier, vol. 13(1), pages 238-254.
    14. E. Gómez-Déniz, 2016. "A family of arctan Lorenz curves," Empirical Economics, Springer, vol. 51(3), pages 1215-1233, November.
    15. Bertoli-Barsotti, Lucio & Lando, Tommaso, 2015. "On a formula for the h-index," Journal of Informetrics, Elsevier, vol. 9(4), pages 762-776.
    16. Mrowinski, Maciej J. & Gagolewski, Marek & Siudem, Grzegorz, 2022. "Accidentality in journal citation patterns," Journal of Informetrics, Elsevier, vol. 16(4).
    17. Hsiang-chi Tseng & Wei-neng Huang & Ding-wei Huang, 2017. "Modified Benford’s law for two-exponent distributions," Scientometrics, Springer;Akadémiai Kiadó, vol. 110(3), pages 1403-1413, March.
    18. Juan Miguel Campanario, 2018. "Are leaders really leading? Journals that are first in Web of Science subject categories in the context of their groups," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 111-130, April.
    19. Masato Okamoto, 2014. "Interpolating the Lorenz Curve: Methods to Preserve Shape and Remain Consistent with the Concentration Curves for Components," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 60(2), pages 349-384, June.
    20. Fontanari Andrea & Cirillo Pasquale & Oosterlee Cornelis W., 2020. "Lorenz-generated bivariate Archimedean copulas," Dependence Modeling, De Gruyter, vol. 8(1), pages 186-209, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:6:y:2012:i:1:p:66-79. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.