IDEAS home Printed from https://ideas.repec.org/p/col/000122/014437.html
   My bibliography  Save this paper

The productivity of top researchers: A semi-nonparametric approach

Author

Listed:
  • Lina M. Cortés
  • Javier Perote
  • Andrés Mora-Valencia

Abstract

Research productivity distributions exhibit heavy tails because it is common for a few researchers to accumulate the majority of the top publications and their corresponding citations. Measurements of this productivity are very sensitive to the field being analyzed and the distribution used. In particular, distributions such as the lognormal distribution seem to systematically underestimate the productivity of the top researchers. In this article, we propose the use of a (log)semi-nonparametric distribution (log-SNP) that nests the lognormal and captures the heavy tail of the productivity distribution through the introduction of new parameters linked to high-order moments. To compare the results, we use research performance data on 140,971 researchers who have produced 253,634 publications in 18 fields of knowledge (O’Boyle and Aguinis, 2012) and show how the log-SNP distribution provides more accurate measures of the performance of the top researchers in their respective fields of knowledge.

Suggested Citation

  • Lina M. Cortés & Javier Perote & Andrés Mora-Valencia, 2016. "The productivity of top researchers: A semi-nonparametric approach," Documentos de Trabajo de Valor Público 14437, Universidad EAFIT.
  • Handle: RePEc:col:000122:014437
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10784/8181
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. da Silva, Roberto & Kalil, Fahad & de Oliveira, José Palazzo Moreira & Martinez, Alexandre Souto, 2012. "Universality in bibliometrics," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 391(5), pages 2119-2128.
    2. Glenn Ellison, 2013. "How Does the Market Use Citation Data? The Hirsch Index in Economics," American Economic Journal: Applied Economics, American Economic Association, vol. 5(3), pages 63-90, July.
    3. Sargan, J D, 1975. "Gram-Charlier Approximations Applied to t Ratios of k-Class Estimators," Econometrica, Econometric Society, vol. 43(2), pages 327-346, March.
    4. Kocher, Martin G. & Luptacik, Mikulas & Sutter, Matthias, 2006. "Measuring productivity of research in economics: A cross-country study using DEA," Socio-Economic Planning Sciences, Elsevier, vol. 40(4), pages 314-332, December.
    5. Bertocchi, Graziella & Gambardella, Alfonso & Jappelli, Tullio & Nappi, Carmela A. & Peracchi, Franco, 2015. "Bibliometric evaluation vs. informed peer review: Evidence from Italy," Research Policy, Elsevier, vol. 44(2), pages 451-466.
    6. Kretschmer, Hildrun & Kretschmer, Theo, 2007. "Lotka's distribution and distribution of co-author pairs’ frequencies," Journal of Informetrics, Elsevier, vol. 1(4), pages 308-337.
    7. Pedro Albarrán & Juan A. Crespo & Ignacio Ortuño & Javier Ruiz-Castillo, 2011. "The skewness of science in 219 sub-fields and a number of aggregates," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(2), pages 385-397, August.
    8. Juan A Crespo & Ignacio Ortuño-Ortín & Javier Ruiz-Castillo, 2012. "The Citation Merit of Scientific Publications," PLOS ONE, Public Library of Science, vol. 7(11), pages 1-9, November.
    9. Abramo, Giovanni & D’Angelo, Ciriaco Andrea, 2014. "Assessing national strengths and weaknesses in research fields," Journal of Informetrics, Elsevier, vol. 8(3), pages 766-775.
    10. Birkmaier, Daniel & Wohlrabe, Klaus, 2014. "The Matthew effect in economics reconsidered," Journal of Informetrics, Elsevier, vol. 8(4), pages 880-889.
    11. Gallant, A Ronald & Nychka, Douglas W, 1987. "Semi-nonparametric Maximum Likelihood Estimation," Econometrica, Econometric Society, vol. 55(2), pages 363-390, March.
    12. Young-Ho Eom & Santo Fortunato, 2011. "Characterizing and Modeling Citation Dynamics," PLOS ONE, Public Library of Science, vol. 6(9), pages 1-7, September.
    13. Mingers, John & Leydesdorff, Loet, 2015. "A review of theory and practice in scientometrics," European Journal of Operational Research, Elsevier, vol. 246(1), pages 1-19.
    14. Phillips, Peter C B, 1977. "A General Theorem in the Theory of Asymptotic Expansions as Approximations to the Finite Sample Distributions of Econometric Estimators," Econometrica, Econometric Society, vol. 45(6), pages 1517-1534, September.
    15. Chung, Kee H & Cox, Raymond A K, 1990. "Patterns of Productivity in the Finance Literature: A Study of the Bibliometric Distributions," Journal of Finance, American Finance Association, vol. 45(1), pages 301-309, March.
    16. Bárbara S. Lancho-Barrantes & Vicente P. Guerrero-Bote & Félix Moya-Anegón, 2010. "The iceberg hypothesis revisited," Scientometrics, Springer;Akadémiai Kiadó, vol. 85(2), pages 443-461, November.
    17. S. Redner, 1998. "How popular is your paper? An empirical study of the citation distribution," The European Physical Journal B: Condensed Matter and Complex Systems, Springer;EDP Sciences, vol. 4(2), pages 131-134, July.
    18. Ruiz-Castillo, Javier & Costas, Rodrigo, 2014. "The skewness of scientific productivity," Journal of Informetrics, Elsevier, vol. 8(4), pages 917-934.
    19. Perc, Matjaž, 2010. "Zipf’s law and log-normal distributions in measures of scientific output across fields and institutions: 40 years of Slovenia’s research as an example," Journal of Informetrics, Elsevier, vol. 4(3), pages 358-364.
    20. Ñíguez, Trino-Manuel & Paya, Ivan & Peel, David & Perote, Javier, 2012. "On the stability of the constant relative risk aversion (CRRA) utility under high degrees of uncertainty," Economics Letters, Elsevier, vol. 115(2), pages 244-248.
    21. Chen, Xiaohong, 2007. "Large Sample Sieve Estimation of Semi-Nonparametric Models," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 76, Elsevier.
    22. Campanario, Juan Miguel, 2015. "Providing impact: The distribution of JCR journals according to references they contribute to the 2-year and 5-year journal impact factors," Journal of Informetrics, Elsevier, vol. 9(2), pages 398-407.
    23. Ignacio Mauleon & Javier Perote, 2000. "Testing densities with financial data: an empirical comparison of the Edgeworth-Sargan density to the Student's t," The European Journal of Finance, Taylor & Francis Journals, vol. 6(2), pages 225-239.
    24. Anne-Wil Harzing & Satu Alakangas, 2016. "Google Scholar, Scopus and the Web of Science: a longitudinal and cross-disciplinary comparison," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(2), pages 787-804, February.
    25. Jordi Duch & Xiao Han T Zeng & Marta Sales-Pardo & Filippo Radicchi & Shayna Otis & Teresa K Woodruff & Luís A Nunes Amaral, 2012. "The Possible Role of Resource Requirements and Academic Career-Choice Risk on Gender Differences in Publication Rate and Impact," PLOS ONE, Public Library of Science, vol. 7(12), pages 1-11, December.
    26. Trino-Manuel Niguez & Ivan Paya & David Peel & Javier Perote, 2013. "Higher-order moments in the theory of diversification and portfolio composition," Working Papers 18297128, Lancaster University Management School, Economics Department.
    27. Hodgson, Geoffrey M & Rothman, Harry, 1999. "The Editors and Authors of Economics Journals: A Case of Institutional Oligopoly?," Economic Journal, Royal Economic Society, vol. 109(453), pages 165-186, February.
    28. Del Brio, Esther B. & Perote, Javier, 2012. "Gram–Charlier densities: Maximum likelihood versus the method of moments," Insurance: Mathematics and Economics, Elsevier, vol. 51(3), pages 531-537.
    29. Anne-Wil Harzing, 2014. "A longitudinal study of Google Scholar coverage between 2012 and 2013," Scientometrics, Springer;Akadémiai Kiadó, vol. 98(1), pages 565-575, January.
    30. Borokhovich, Kenneth A, et al, 1995. "Finance Research Productivity and Influence," Journal of Finance, American Finance Association, vol. 50(5), pages 1691-1717, December.
    31. Kaur, Jasleen & Radicchi, Filippo & Menczer, Filippo, 2013. "Universality of scholarly impact metrics," Journal of Informetrics, Elsevier, vol. 7(4), pages 924-932.
    32. Kaur, Jasleen & Ferrara, Emilio & Menczer, Filippo & Flammini, Alessandro & Radicchi, Filippo, 2015. "Quality versus quantity in scientific impact," Journal of Informetrics, Elsevier, vol. 9(4), pages 800-808.
    33. Finardi, Ugo, 2013. "Correlation between Journal Impact Factor and Citation Performance: An experimental study," Journal of Informetrics, Elsevier, vol. 7(2), pages 357-370.
    34. Day, Theodore Eugene, 2015. "The big consequences of small biases: A simulation of peer review," Research Policy, Elsevier, vol. 44(6), pages 1266-1270.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alfredo Trespalacios & Lina M. Cortés & Javier Perote, 2019. "Modeling the electricity spot price with switching regime semi-nonparametric distributions," Documentos de Trabajo de Valor Público 17618, Universidad EAFIT.
    2. Lina Cortés & Juan M. Lozada & Javier Perote, 2019. "Firm size and concentration inequality: A flexible extension of Gibrat’s law," Documentos de Trabajo de Valor Público 17205, Universidad EAFIT.
    3. Lina M. Cortés & Javier Perote & Andrés Mora-Valencia, 2017. "Implicit probability distribution for WTI options: The Black Scholes vs. the semi-nonparametric approach," Documentos de Trabajo de Valor Público 15923, Universidad EAFIT.
    4. Alfredo Trespalacios & Lina M. Cortés & Javier Perote, 2021. "Modeling Electricity Price and Quantity Uncertainty: An Application for Hedging with Forward Contracts," Energies, MDPI, vol. 14(11), pages 1-26, June.
    5. Lina M Cortés & Juan M Lozada & Javier Perote, 2021. "Firm size and economic concentration: An analysis from a lognormal expansion," PLOS ONE, Public Library of Science, vol. 16(7), pages 1-21, July.
    6. Trespalacios, Alfredo & Cortés, Lina M. & Perote, Javier, 2020. "Uncertainty in electricity markets from a semi-nonparametric approach," Energy Policy, Elsevier, vol. 137(C).
    7. Marek Kwiek, 2018. "High research productivity in vertically undifferentiated higher education systems: Who are the top performers?," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 415-462, April.
    8. Jiménez, Inés & Mora-Valencia, Andrés & Perote, Javier, 2023. "Multivariate dynamics between emerging markets and digital asset markets: An application of the SNP-DCC model," Emerging Markets Review, Elsevier, vol. 56(C).
    9. Robert A. Buckle & John Creedy, 2019. "An evaluation of metrics used by the Performance-based Research Fund process in New Zealand," New Zealand Economic Papers, Taylor & Francis Journals, vol. 53(3), pages 270-287, September.
    10. Cortés, Lina M. & Mora-Valencia, Andrés & Perote, Javier, 2020. "Retrieving the implicit risk neutral density of WTI options with a semi-nonparametric approach," The North American Journal of Economics and Finance, Elsevier, vol. 54(C).
    11. Cortés, Lina M. & Mora-Valencia, Andrés & Perote, Javier, 2017. "Measuring firm size distribution with semi-nonparametric densities," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 485(C), pages 35-47.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Kaur, Jasleen & Ferrara, Emilio & Menczer, Filippo & Flammini, Alessandro & Radicchi, Filippo, 2015. "Quality versus quantity in scientific impact," Journal of Informetrics, Elsevier, vol. 9(4), pages 800-808.
    2. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    3. Del Brio, Esther B. & Perote, Javier, 2012. "Gram–Charlier densities: Maximum likelihood versus the method of moments," Insurance: Mathematics and Economics, Elsevier, vol. 51(3), pages 531-537.
    4. Trespalacios, Alfredo & Cortés, Lina M. & Perote, Javier, 2020. "Uncertainty in electricity markets from a semi-nonparametric approach," Energy Policy, Elsevier, vol. 137(C).
    5. Bouyssou, Denis & Marchant, Thierry, 2016. "Ranking authors using fractional counting of citations: An axiomatic approach," Journal of Informetrics, Elsevier, vol. 10(1), pages 183-199.
    6. Andrés Mora-Valencia & Trino-Manuel Ñíguez & Javier Perote, 2017. "Multivariate approximations to portfolio return distribution," Computational and Mathematical Organization Theory, Springer, vol. 23(3), pages 347-361, September.
    7. Del Brio, Esther B. & Mora-Valencia, Andrés & Perote, Javier, 2017. "The kidnapping of Europe: High-order moments' transmission between developed and emerging markets," Emerging Markets Review, Elsevier, vol. 31(C), pages 96-115.
    8. Bonaccorsi, Andrea & Haddawy, Peter & Cicero, Tindaro & Hassan, Saeed-Ul, 2017. "The solitude of stars. An analysis of the distributed excellence model of European universities," Journal of Informetrics, Elsevier, vol. 11(2), pages 435-454.
    9. Jiménez, Inés & Mora-Valencia, Andrés & Perote, Javier, 2022. "Semi-nonparametric risk assessment with cryptocurrencies," Research in International Business and Finance, Elsevier, vol. 59(C).
    10. Yin, Yian & Wang, Dashun, 2017. "The time dimension of science: Connecting the past to the future," Journal of Informetrics, Elsevier, vol. 11(2), pages 608-621.
    11. Lina Cortés & Juan M. Lozada & Javier Perote, 2019. "Firm size and concentration inequality: A flexible extension of Gibrat’s law," Documentos de Trabajo de Valor Público 17205, Universidad EAFIT.
    12. Victoria Anauati & Sebastian Galiani & Ramiro H. Gálvez, 2016. "Quantifying The Life Cycle Of Scholarly Articles Across Fields Of Economic Research," Economic Inquiry, Western Economic Association International, vol. 54(2), pages 1339-1355, April.
    13. Cortés, Lina M. & Mora-Valencia, Andrés & Perote, Javier, 2017. "Measuring firm size distribution with semi-nonparametric densities," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 485(C), pages 35-47.
    14. Ñíguez, Trino-Manuel & Perote, Javier, 2016. "Multivariate moments expansion density: Application of the dynamic equicorrelation model," Journal of Banking & Finance, Elsevier, vol. 72(S), pages 216-232.
    15. Sergio Copiello, 2019. "The open access citation premium may depend on the openness and inclusiveness of the indexing database, but the relationship is controversial because it is ambiguous where the open access boundary lie," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(2), pages 995-1018, November.
    16. Tol, Richard S.J., 2013. "The Matthew effect for cohorts of economists," Journal of Informetrics, Elsevier, vol. 7(2), pages 522-527.
    17. Trino-Manuel Ñíguez & Javier Perote, 2012. "Forecasting Heavy-Tailed Densities with Positive Edgeworth and Gram-Charlier Expansions," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 74(4), pages 600-627, August.
    18. Del Brio, Esther B. & Mora-Valencia, Andrés & Perote, Javier, 2014. "Semi-nonparametric VaR forecasts for hedge funds during the recent crisis," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 401(C), pages 330-343.
    19. Trino-Manuel Niguez & Ivan Paya & David Peel & Javier Perote, 2013. "Higher-order moments in the theory of diversification and portfolio composition," Working Papers 18297128, Lancaster University Management School, Economics Department.
    20. John Mingers & Jesse R. O’Hanley & Musbaudeen Okunola, 2017. "Using Google Scholar institutional level data to evaluate the quality of university research," Scientometrics, Springer;Akadémiai Kiadó, vol. 113(3), pages 1627-1643, December.

    More about this item

    Keywords

    Research evaluation; Research productivity; Heavy tail distributions; Semi- nonparametric modeling.;
    All these keywords.

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C44 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods: Special Topics - - - Operations Research; Statistical Decision Theory
    • C53 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Forecasting and Prediction Models; Simulation Methods

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:col:000122:014437. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Valor Público EAFIT - Centro de estudios e incidencia (email available below). General contact details of provider: https://edirc.repec.org/data/cieafco.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.