IDEAS home Printed from https://ideas.repec.org/a/eee/infome/v15y2021i1s1751157720306246.html
   My bibliography  Save this article

Use of classification trees and rule-based models to optimize the funding assignment to research projects: A case study of UTPL

Author

Listed:
  • Fernandez Martinez, Roberto
  • Lostado Lorza, Ruben
  • Santos Delgado, Ana Alexandra
  • Piedra, Nelson

Abstract

In the process of funding research projects, two important factors must be studied. First, experts judges the potential value of a project. Secondly, the research ability is judged by the applicants previous research activity. The most appropriate way to assign the appropriate amount of money to project proposals is always a difficult decision. This work focuses on the second factor based on classifying the researchers previous research activity on an automated logical classification (accepted, rejected) resolving conflicts of interests between administration and applicants and helping in the decision-making process. As the class in these kinds of studies is usually unbalanced, because there are fewer accepted projects than rejected projects, how the use of an imbalanced dataset or a balanced dataset affects to the models is investigated by using several resampling methods. Later, several trees and rule-based machine learning techniques are used to create classification models. This is based on information from the faculty members information of the “Technical Particular University of Loja (UTPL),” in cases, with balanced datasets and those with unbalanced datasets. Multivariate analysis, feature selection, algorithm parameter tuning and validation methods are used to achieve robust classification models. The most accurate results are obtained with a rules-based model and use of the C5.0 algorithm. As the latter provides acceptable accuracy, close to 95 % when predicting both classes and to 99 % when predicting the accepted projects class, both the methodology and final model are validated.

Suggested Citation

  • Fernandez Martinez, Roberto & Lostado Lorza, Ruben & Santos Delgado, Ana Alexandra & Piedra, Nelson, 2021. "Use of classification trees and rule-based models to optimize the funding assignment to research projects: A case study of UTPL," Journal of Informetrics, Elsevier, vol. 15(1).
  • Handle: RePEc:eee:infome:v:15:y:2021:i:1:s1751157720306246
    DOI: 10.1016/j.joi.2020.101107
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S1751157720306246
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.joi.2020.101107?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Kulczycki, Emanuel & Korzeń, Marcin & Korytkowski, Przemysław, 2017. "Toward an excellence-based research funding system: Evidence from Poland," Journal of Informetrics, Elsevier, vol. 11(1), pages 282-298.
    2. Mayra Z Rodriguez & Cesar H Comin & Dalcimar Casanova & Odemir M Bruno & Diego R Amancio & Luciano da F Costa & Francisco A Rodrigues, 2019. "Clustering algorithms: A comparative approach," PLOS ONE, Public Library of Science, vol. 14(1), pages 1-34, January.
    3. Győrffy, Balázs & Herman, Péter & Szabó, István, 2020. "Research funding: past performance is a stronger predictor of future scientific output than reviewer scores," Journal of Informetrics, Elsevier, vol. 14(3).
    4. Jefferson Seide Molléri & Kai Petersen & Emilia Mendes, 2018. "Towards understanding the relation between citations and research quality in software engineering studies," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(3), pages 1453-1478, December.
    5. Jinseok Kim & Jenna Kim, 2018. "The impact of imbalanced training data on machine learning for author name disambiguation," Scientometrics, Springer;Akadémiai Kiadó, vol. 117(1), pages 511-526, October.
    6. Per Ahlgren & Cristian Colliander & Olle Persson, 2012. "Field normalized citation rates, field normalized journal impact and Norwegian weights for allocation of university research funds," Scientometrics, Springer;Akadémiai Kiadó, vol. 92(3), pages 767-780, September.
    7. Butler, Linda, 2003. "Explaining Australia's increased share of ISI publications--the effects of a funding formula based on publication counts," Research Policy, Elsevier, vol. 32(1), pages 143-155, January.
    8. Cruz-Castro, Laura & Sanz-Menéndez, Luis, 2016. "The effects of the economic crisis on public research: Spanish budgetary policies and research organizations," Technological Forecasting and Social Change, Elsevier, vol. 113(PB), pages 157-167.
    9. Canhoto, Ana Isabel & Clear, Fintan, 2020. "Artificial intelligence and machine learning as business tools: A framework for diagnosing value destruction potential," Business Horizons, Elsevier, vol. 63(2), pages 183-193.
    10. Braun, Dietmar, 1998. "The role of funding agencies in the cognitive development of science," Research Policy, Elsevier, vol. 27(8), pages 807-821, December.
    11. Ashkan Ebadi & Andrea Schiffauerova, 2016. "How to boost scientific production? A statistical analysis of research funding and other influencing factors," Scientometrics, Springer;Akadémiai Kiadó, vol. 106(3), pages 1093-1116, March.
    12. Grubinger, Thomas & Zeileis, Achim & Pfeiffer, Karl-Peter, 2014. "evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i01).
    13. Ebadi, Ashkan & Tremblay, Stéphane & Goutte, Cyril & Schiffauerova, Andrea, 2020. "Application of machine learning techniques to assess the trends and alignment of the funded research output," Journal of Informetrics, Elsevier, vol. 14(2).
    14. Simon Hirzel & Tim Hettesheimer & Peter Viebahn & Manfred Fischedick, 2018. "A Decision Support System for Public Funding of Experimental Development in Energy Research," Energies, MDPI, vol. 11(6), pages 1-18, May.
    15. Saarela, Mirka & Kärkkäinen, Tommi & Lahtonen, Tommi & Rossi, Tuomo, 2016. "Expert-based versus citation-based ranking of scholarly and scientific publication channels," Journal of Informetrics, Elsevier, vol. 10(3), pages 693-718.
    16. Saarela, Mirka & Kärkkäinen, Tommi, 2020. "Can we automate expert-based journal rankings? Analysis of the Finnish publication indicator," Journal of Informetrics, Elsevier, vol. 14(2).
    17. Sandström, Ulf & Van den Besselaar, Peter, 2018. "Funding, evaluation, and the performance of national research systems," Journal of Informetrics, Elsevier, vol. 12(1), pages 365-384.
    18. Subochev, Andrey & Aleskerov, Fuad & Pislyakov, Vladimir, 2018. "Ranking journals using social choice theory methods: A novel approach in bibliometrics," Journal of Informetrics, Elsevier, vol. 12(2), pages 416-429.
    19. Wei-dong Zhu & Fang Liu & Yu-wang Chen & Jian-bo Yang & Dong-ling Xu & Dong-peng Wang, 2015. "Research project evaluation and selection: an evidential reasoning rule-based method for aggregating peer review information with reliabilities," Scientometrics, Springer;Akadémiai Kiadó, vol. 105(3), pages 1469-1490, December.
    20. Wade D. Cook & Boaz Golany & Moshe Kress & Michal Penn & Tal Raviv, 2005. "Optimal Allocation of Proposals to Reviewers to Facilitate Effective Ranking," Management Science, INFORMS, vol. 51(4), pages 655-661, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Li, Heyang & Wu, Meijun & Wang, Yougui & Zeng, An, 2022. "Bibliographic coupling networks reveal the advantage of diversification in scientific projects," Journal of Informetrics, Elsevier, vol. 16(3).
    2. Wang, Zhenhua & Ren, Ming & Gao, Dong & Li, Zhuang, 2023. "A Zipf's law-based text generation approach for addressing imbalance in entity extraction," Journal of Informetrics, Elsevier, vol. 17(4).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Saarela, Mirka & Kärkkäinen, Tommi, 2020. "Can we automate expert-based journal rankings? Analysis of the Finnish publication indicator," Journal of Informetrics, Elsevier, vol. 14(2).
    2. Muhammad Dimyati & Adhi Indra Hermanu, 2023. "Evaluating Research Efficiency in Indonesian Higher Education Institution," Evaluation Review, , vol. 47(2), pages 155-181, April.
    3. Hladchenko, Myroslava & Moed, Henk F., 2021. "The effect of publication traditions and requirements in research assessment and funding policies upon the use of national journals in 28 post-socialist countries," Journal of Informetrics, Elsevier, vol. 15(4).
    4. Renata Kudaibergenova & Sandugash Uzakbay & Asselya Makanova & Kymbat Ramadinkyzy & Erlan Kistaubayev & Ruslan Dussekeev & Kadyrzhan Smagulov, 2022. "Managing publication change at Al-Farabi Kazakh National University: a case study," Scientometrics, Springer;Akadémiai Kiadó, vol. 127(1), pages 453-479, January.
    5. Li, Heyang & Wu, Meijun & Wang, Yougui & Zeng, An, 2022. "Bibliographic coupling networks reveal the advantage of diversification in scientific projects," Journal of Informetrics, Elsevier, vol. 16(3).
    6. Shahd Al-Janabi & Lee Wei Lim & Luca Aquili, 2021. "Development of a tool to accurately predict UK REF funding allocation," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(9), pages 8049-8062, September.
    7. Auranen, Otto & Nieminen, Mika, 2010. "University research funding and publication performance--An international comparison," Research Policy, Elsevier, vol. 39(6), pages 822-834, July.
    8. Tóth, Tamás & Demeter, Márton & Csuhai, Sándor & Major, Zsolt Balázs, 2024. "When career-boosting is on the line: Equity and inequality in grant evaluation, productivity, and the educational backgrounds of Marie Skłodowska-Curie Actions individual fellows in social sciences an," Journal of Informetrics, Elsevier, vol. 18(2).
    9. Simon Hirzel & Tim Hettesheimer & Peter Viebahn & Manfred Fischedick, 2018. "A Decision Support System for Public Funding of Experimental Development in Energy Research," Energies, MDPI, vol. 11(6), pages 1-18, May.
    10. Star X. Zhao & Wen Lou & Alice M. Tan & Shuang Yu, 2018. "Do funded papers attract more usage?," Scientometrics, Springer;Akadémiai Kiadó, vol. 115(1), pages 153-168, April.
    11. Hajibabaei, Anahita & Schiffauerova, Andrea & Ebadi, Ashkan, 2022. "Gender-specific patterns in the artificial intelligence scientific ecosystem," Journal of Informetrics, Elsevier, vol. 16(2).
    12. Buehling, Kilian, 2021. "Changing research topic trends as an effect of publication rankings – The case of German economists and the Handelsblatt Ranking," Journal of Informetrics, Elsevier, vol. 15(3).
    13. Ebadi, Ashkan & Tremblay, Stéphane & Goutte, Cyril & Schiffauerova, Andrea, 2020. "Application of machine learning techniques to assess the trends and alignment of the funded research output," Journal of Informetrics, Elsevier, vol. 14(2).
    14. Fernanda Morillo, 2019. "Collaboration and impact of research in different disciplines with international funding (from the EU and other foreign sources)," Scientometrics, Springer;Akadémiai Kiadó, vol. 120(2), pages 807-823, August.
    15. Saarela, Mirka & Kärkkäinen, Tommi & Lahtonen, Tommi & Rossi, Tuomo, 2016. "Expert-based versus citation-based ranking of scholarly and scientific publication channels," Journal of Informetrics, Elsevier, vol. 10(3), pages 693-718.
    16. Rongying Zhao & Xinlai Li & Zhisen Liang & Danyang Li, 2019. "Development strategy and collaboration preference in S&T of enterprises based on funded papers: a case study of Google," Scientometrics, Springer;Akadémiai Kiadó, vol. 121(1), pages 323-347, October.
    17. Fedderke, J.W. & Goldschmidt, M., 2015. "Does massive funding support of researchers work?: Evaluating the impact of the South African research chair funding initiative," Research Policy, Elsevier, vol. 44(2), pages 467-482.
    18. Przemysław Korytkowski & Emanuel Kulczycki, 2019. "Examining how country-level science policy shapes publication patterns: the case of Poland," Scientometrics, Springer;Akadémiai Kiadó, vol. 119(3), pages 1519-1543, June.
    19. Emanuel Kulczycki & Ying Huang & Alesia A. Zuccala & Tim C. E. Engels & Antonio Ferrara & Raf Guns & Janne Pölönen & Gunnar Sivertsen & Zehra Taşkın & Lin Zhang, 2022. "Uses of the Journal Impact Factor in national journal rankings in China and Europe," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 73(12), pages 1741-1754, December.
    20. Hren, Darko & Pina, David G. & Norman, Christopher R. & Marušić, Ana, 2022. "What makes or breaks competitive research proposals? A mixed-methods analysis of research grant evaluation reports," Journal of Informetrics, Elsevier, vol. 16(2).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:infome:v:15:y:2021:i:1:s1751157720306246. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/joi .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.