IDEAS home Printed from https://ideas.repec.org/a/spr/stmapp/v22y2013i2p269-283.html
   My bibliography  Save this article

Robust analysis of bibliometric data

Author

Listed:
  • Francesca De Battisti
  • Silvia Salini

Abstract

This work stems from the idea of describing the scientific productivity of Italian statisticians. There are several problems that must be addressed in achieving this goal: What data should be used? Have the data been cleaned? What techniques can be used? We propose the use of multiple sources and multiple metrics to get a complete information base. We check the correctness of the data using multivariate outlier identification techniques. We appropriately transform the data. We apply robust clustering to verify the existence of homogeneous groups. We suggest the use of forward search to establish a ranking among scholars. The proposed methodology, which, in this case, allowed us to group scholars into four homogeneous groups and sort them according to multidimensional data, can be applied to other similar applications in bibliometrics. Copyright Springer-Verlag Berlin Heidelberg 2013

Suggested Citation

  • Francesca De Battisti & Silvia Salini, 2013. "Robust analysis of bibliometric data," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 22(2), pages 269-283, June.
  • Handle: RePEc:spr:stmapp:v:22:y:2013:i:2:p:269-283
    DOI: 10.1007/s10260-012-0217-0
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10260-012-0217-0
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10260-012-0217-0?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Alfio Ferrara & Silvia Salini, 2012. "Ten challenges in modeling bibliographic data for bibliometric analysis," Scientometrics, Springer;Akadémiai Kiadó, vol. 93(3), pages 765-785, December.
    2. Norris, Michael & Oppenheim, Charles, 2007. "Comparing alternatives to the Web of Science for coverage of the social sciences’ literature," Journal of Informetrics, Elsevier, vol. 1(2), pages 161-169.
    3. Éric Archambault & David Campbell & Yves Gingras & Vincent Larivière, 2009. "Comparing bibliometric statistics obtained from the Web of Science and Scopus," Journal of the American Society for Information Science and Technology, Association for Information Science & Technology, vol. 60(7), pages 1320-1326, July.
    4. Benoît Godin, 2006. "On the origins of bibliometrics," Scientometrics, Springer;Akadémiai Kiadó, vol. 68(1), pages 109-133, July.
    5. Pablo D. Batista & Mônica G. Campiteli & Osame Kinouchi, 2006. "Is it possible to compare researchers with different scientific interests?," Scientometrics, Springer;Akadémiai Kiadó, vol. 68(1), pages 179-189, July.
    6. Baccini, Alberto & Barabesi, Lucio, 2011. "Seats at the table: The network of the editorial boards in information and library science," Journal of Informetrics, Elsevier, vol. 5(3), pages 382-391.
    7. Thierry Marchant, 2009. "An axiomatic characterization of the ranking based on the h-index and some other bibliometric rankings of authors," Scientometrics, Springer;Akadémiai Kiadó, vol. 80(2), pages 325-342, August.
    8. Filzmoser, Peter & Maronna, Ricardo & Werner, Mark, 2008. "Outlier identification in high dimensions," Computational Statistics & Data Analysis, Elsevier, vol. 52(3), pages 1694-1711, January.
    9. Giulia Rivellini & Ester Rizzi & Susanna Zaccarin, 2006. "The science network in Italian population research: An analysis according to the social network perspective," Scientometrics, Springer;Akadémiai Kiadó, vol. 67(3), pages 407-418, June.
    10. Massimo Franceschet, 2010. "A comparison of bibliometric indicators for computer science scholars and journals on Web of Science and Google Scholar," Scientometrics, Springer;Akadémiai Kiadó, vol. 83(1), pages 243-258, April.
    11. Atkinson, A.C. & Riani, M., 2007. "Exploratory tools for clustering multivariate data," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 272-285, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Claudio Giachetti & Giancarlo Manzi & Cinzia Colapinto, 2019. "Entry Mode Degree of Control, Firm Performance and Host Country Institutional Development: A Meta-Analysis," Management International Review, Springer, vol. 59(1), pages 3-39, February.
    2. Yajie Zhang & Qiang Yu, 2020. "What is the best article publishing strategy for early career scientists?," Scientometrics, Springer;Akadémiai Kiadó, vol. 122(1), pages 397-408, January.
    3. Waleed M. Sweileh & Sa’ed H. Zyoud & Samah W. Al-Jabi & Ansam F. Sawalha, 2014. "Bibliometric analysis of diabetes mellitus research output from Middle Eastern Arab countries during the period (1996–2012)," Scientometrics, Springer;Akadémiai Kiadó, vol. 101(1), pages 819-832, October.
    4. Silvia Salini & Andrea Cerioli & Fabrizio Laurini & Marco Riani, 2016. "Reliable Robust Regression Diagnostics," International Statistical Review, International Statistical Institute, vol. 84(1), pages 99-127, April.
    5. Chioma Okoro & Oluwatobi Mary Owojori & Nnedinma Umeokafor, 2022. "The Developmental Trajectory of a Decade of Research on Mental Health and Well-Being amongst Graduate Students: A Bibliometric Analysis," IJERPH, MDPI, vol. 19(9), pages 1-20, April.
    6. Lorna Wildgaard, 2015. "A comparison of 17 author-level bibliometric indicators for researchers in Astronomy, Environmental Science, Philosophy and Public Health in Web of Science and Google Scholar," Scientometrics, Springer;Akadémiai Kiadó, vol. 104(3), pages 873-906, September.
    7. Andrea Cerioli & Domenico Perrotta, 2014. "Robust clustering around regression lines with high density regions," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 8(1), pages 5-26, March.
    8. Cerioli, Andrea & Farcomeni, Alessio & Riani, Marco, 2014. "Strong consistency and robustness of the Forward Search estimator of multivariate location and scatter," Journal of Multivariate Analysis, Elsevier, vol. 126(C), pages 167-183.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Waltman, Ludo, 2016. "A review of the literature on citation impact indicators," Journal of Informetrics, Elsevier, vol. 10(2), pages 365-391.
    2. Francesca DE BATTISTI & Silvia SALINI, 2011. "Robust analysis of bibliometric data," Departmental Working Papers 2011-36, Department of Economics, Management and Quantitative Methods at Università degli Studi di Milano.
    3. Gaviria-Marin, Magaly & Merigó, José M. & Baier-Fuentes, Hugo, 2019. "Knowledge management: A global examination based on bibliometric analysis," Technological Forecasting and Social Change, Elsevier, vol. 140(C), pages 194-220.
    4. Basma Albanna & Julia Handl & Richard Heeks, 2021. "Publication outperformance among global South researchers: An analysis of individual-level and publication-level predictors of positive deviance," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(10), pages 8375-8431, October.
    5. Bar-Ilan, Judit, 2008. "Informetrics at the beginning of the 21st century—A review," Journal of Informetrics, Elsevier, vol. 2(1), pages 1-52.
    6. Muzammil Tahira & Rose Alinda Alias & Aryati Bakri, 2013. "Scientometric assessment of engineering in Malaysians universities," Scientometrics, Springer;Akadémiai Kiadó, vol. 96(3), pages 865-879, September.
    7. Maxim N. Kotsemir & Tatiana E. Kuznetsova & Elena G. Nasybulina & Anna G. Pikalova, 2015. "Empirical Analysis of Multinational S&T Collaboration Priorities –The Case of Russia," HSE Working papers WP BRP 53/STI/2015, National Research University Higher School of Economics.
    8. Luigi Aldieri & Gennaro Guida & Maxim Kotsemir & Concetto Paolo Vinci, 2019. "An investigation of impact of research collaboration on academic performance in Italy," Quality & Quantity: International Journal of Methodology, Springer, vol. 53(4), pages 2003-2040, July.
    9. Nga Le Thi Quynh & Groot, Wim & Tomini, Sonila M. & Tomini, Florian, 2017. "Effects of health insurance on labour supply: A systematic review," MERIT Working Papers 2017-017, United Nations University - Maastricht Economic and Social Research Institute on Innovation and Technology (MERIT).
    10. Antonio Cavacini, 2015. "What is the best database for computer science journal articles?," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2059-2071, March.
    11. José Álvarez-García & Amador Durán-Sánchez & María de la Cruz Del Río-Rama & Diego Fernando García-Vélez, 2018. "Active Ageing: Mapping of Scientific Coverage," IJERPH, MDPI, vol. 15(12), pages 1-21, December.
    12. Mingers, John & Leydesdorff, Loet, 2015. "A review of theory and practice in scientometrics," European Journal of Operational Research, Elsevier, vol. 246(1), pages 1-19.
    13. H. Kent Baker & Satish Kumar & Kirti Goyal & Prashant Gupta, 2023. "International journal of finance and economics: A bibliometric overview," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(1), pages 9-46, January.
    14. Lal, Madan & Kumar, Satish & Pandey, Dharen Kumar & Rai, Varun Kumar & Lim, Weng Marc, 2023. "Exchange rate volatility and international trade," Journal of Business Research, Elsevier, vol. 167(C).
    15. Brandão, Luana Carneiro & Soares de Mello, João Carlos Correia Baptista, 2019. "A multi-criteria approach to the h-index," European Journal of Operational Research, Elsevier, vol. 276(1), pages 357-363.
    16. George Emm Halkos & Nickolaos G. Tzeremes, 2011. "Measuring economic journals’ citation efficiency: a data envelopment analysis approach," Scientometrics, Springer;Akadémiai Kiadó, vol. 88(3), pages 979-1001, September.
    17. Saïd Echchakoui, 2020. "Why and how to merge Scopus and Web of Science during bibliometric analysis: the case of sales force literature from 1912 to 2019," Journal of Marketing Analytics, Palgrave Macmillan, vol. 8(3), pages 165-184, September.
    18. Ana Paula dos Santos Rubem & Ariane Lima Moura & João Carlos Correia Baptista Soares de Mello, 2015. "Comparative analysis of some individual bibliometric indices when applied to groups of researchers," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(1), pages 1019-1035, January.
    19. Saïd Echchakoui, 0. "Why and how to merge Scopus and Web of Science during bibliometric analysis: the case of sales force literature from 1912 to 2019," Journal of Marketing Analytics, Palgrave Macmillan, vol. 0, pages 1-20.
    20. Pandey, Dharen Kumar & Hunjra, Ahmed Imran & Hassan, M. Kabir & Rai, Varun Kumar, 2023. "Venture capital financing during crises: A bibliometric review," Research in International Business and Finance, Elsevier, vol. 64(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stmapp:v:22:y:2013:i:2:p:269-283. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.