IDEAS home Printed from https://ideas.repec.org/a/spr/aodasc/v10y2023i4d10.1007_s40745-021-00350-z.html
   My bibliography  Save this article

The Zero-Inflated Negative Binomial Semiparametric Regression Model: Application to Number of Failing Grades Data

Author

Listed:
  • Elton G. Aráujo

    (Universidade Federal de Mato Grosso do Sul)

  • Julio C. S. Vasconcelos

    (Universidade de São Paulo)

  • Denize P. Santos

    (Universidade de São Paulo)

  • Edwin M. M. Ortega

    (Universidade de São Paulo)

  • Dalton Souza

    (Universidade Federal de Mato Grosso do Sul)

  • João P. F. Zanetoni

    (Universidade Federal de Mato Grosso do Sul)

Abstract

In this paper we study the performance of college students, measured by the number of failing grades, considering various covariables that can positively or negatively influence this performance. The students in the sample were undergraduate business majors studying at night at a federal public university in the state of Mato Grosso do Sul, Brazil. Among the factors considered are covariables that had a linear and nonlinear relationship with the students’ performance. We also observed a high percentage of zeros, the reason we used a zero-inflated semiparametric regression model based on the negative binomial distribution to analyze our dataset.We used the penalized maximum likelihood method along with analysis of the residuals to verify the model’s assumptions. We present results, discussion and conclusions about the number of subjects failed by the students.

Suggested Citation

  • Elton G. Aráujo & Julio C. S. Vasconcelos & Denize P. Santos & Edwin M. M. Ortega & Dalton Souza & João P. F. Zanetoni, 2023. "The Zero-Inflated Negative Binomial Semiparametric Regression Model: Application to Number of Failing Grades Data," Annals of Data Science, Springer, vol. 10(4), pages 991-1006, August.
  • Handle: RePEc:spr:aodasc:v:10:y:2023:i:4:d:10.1007_s40745-021-00350-z
    DOI: 10.1007/s40745-021-00350-z
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s40745-021-00350-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s40745-021-00350-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Feng Liu & Yong Shi, 2020. "Investigating Laws of Intelligence Based on AI IQ Research," Annals of Data Science, Springer, vol. 7(3), pages 399-416, September.
    2. Germán Ibacache-Pulgar & Gilberto Paula & Francisco Cysneiros, 2013. "Semiparametric additive models under symmetric distributions," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 22(1), pages 103-121, March.
    3. Md. Karimuzzaman & Nusrat Islam & Sabrina Afroz & Md. Moyazzem Hossain, 2021. "Predicting Stock Market Price of Bangladesh: A Comparative Study of Linear Classification Models," Annals of Data Science, Springer, vol. 8(1), pages 21-38, March.
    4. F. Belloc & A. Maruotti & L. Petrella, 2011. "How individual characteristics affect university students drop-out: a semiparametric mixed-effects model for an Italian case study," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(10), pages 2225-2239.
    5. Hashimoto, Elizabeth M. & Ortega, Edwin M.M. & Paula, Gilberto A. & Barreto, Mauricio L., 2011. "Regression models for grouped survival data: Estimation and sensitivity analysis," Computational Statistics & Data Analysis, Elsevier, vol. 55(2), pages 993-1007, February.
    6. Stasinopoulos, D. Mikis & Rigby, Robert A., 2007. "Generalized Additive Models for Location Scale and Shape (GAMLSS) in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i07).
    7. Garay, Aldo M. & Hashimoto, Elizabeth M. & Ortega, Edwin M.M. & Lachos, Víctor H., 2011. "On estimation and influence diagnostics for zero-inflated negative binomial regression models," Computational Statistics & Data Analysis, Elsevier, vol. 55(3), pages 1304-1318, March.
    8. Xiaoli L. Etienne & Giancarlo Ferrara & Douglas Mugabe, 2019. "How efficient is maize production among smallholder farmers in Zimbabwe? A comparison of semiparametric and parametric frontier efficiency analyses," Applied Economics, Taylor & Francis Journals, vol. 51(26), pages 2855-2871, June.
    9. James M. Tien, 2017. "Internet of Things, Real-Time Decision Making, and Artificial Intelligence," Annals of Data Science, Springer, vol. 4(2), pages 149-178, June.
    10. Tsu‐Tan Fu & Cliff J. Huang & Yung‐Lieh Yang, 2011. "Quality And Economies Of Scale In Higher Education: A Semiparametric Smooth Coefficient Estimation," Contemporary Economic Policy, Western Economic Association International, vol. 29(1), pages 138-149, January.
    11. R. A. Rigby & D. M. Stasinopoulos, 2005. "Generalized additive models for location, scale and shape," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(3), pages 507-554, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Luis Vanegas & Gilberto Paula, 2015. "A semiparametric approach for joint modeling of median and skewness," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 24(1), pages 110-135, March.
    2. Yixuan Wang & Jianzhu Li & Ping Feng & Rong Hu, 2015. "A Time-Dependent Drought Index for Non-Stationary Precipitation Series," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 29(15), pages 5631-5647, December.
    3. Panayi, Efstathios & Peters, Gareth W. & Danielsson, Jon & Zigrand, Jean-Pierre, 2018. "Designating market maker behaviour in limit order book markets," Econometrics and Statistics, Elsevier, vol. 5(C), pages 20-44.
    4. Gauss Cordeiro & Josemar Rodrigues & Mário Castro, 2012. "The exponential COM-Poisson distribution," Statistical Papers, Springer, vol. 53(3), pages 653-664, August.
    5. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    6. Matteo Malavasi & Gareth W. Peters & Pavel V. Shevchenko & Stefan Truck & Jiwook Jang & Georgy Sofronov, 2021. "Cyber Risk Frequency, Severity and Insurance Viability," Papers 2111.03366, arXiv.org, revised Mar 2022.
    7. Tong, Edward N.C. & Mues, Christophe & Thomas, Lyn, 2013. "A zero-adjusted gamma model for mortgage loan loss given default," International Journal of Forecasting, Elsevier, vol. 29(4), pages 548-562.
    8. Xueyan Xu & Fusheng Yu & Runjun Wan, 2023. "A Determining Degree-Based Method for Classification Problems with Interval-Valued Attributes," Annals of Data Science, Springer, vol. 10(2), pages 393-413, April.
    9. D. Chiru Naik & Sagar Rohidas Chavan & P. Sonali, 2023. "Incorporating the climate oscillations in the computation of meteorological drought over India," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 117(3), pages 2617-2646, July.
    10. Kuntz, Laura-Chloé, 2020. "Beta dispersion and market timing," Journal of Empirical Finance, Elsevier, vol. 59(C), pages 235-256.
    11. I. Gijbels & I. Prosdocimi & G. Claeskens, 2010. "Nonparametric estimation of mean and dispersion functions in extended generalized linear models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 19(3), pages 580-608, November.
    12. Fábio Prataviera & Aline Martineli Batista & Edwin M. M. Ortega & Gauss M. Cordeiro & Bruno Montoani Silva, 2023. "The Logit Exponentiated Power Exponential Regression with Applications," Annals of Data Science, Springer, vol. 10(3), pages 713-735, June.
    13. Anda Tang & Pei Quan & Lingfeng Niu & Yong Shi, 2022. "A Survey for Sparse Regularization Based Compression Methods," Annals of Data Science, Springer, vol. 9(4), pages 695-722, August.
    14. Groll, Andreas & Hambuckers, Julien & Kneib, Thomas & Umlauf, Nikolaus, 2019. "LASSO-type penalization in the framework of generalized additive models for location, scale and shape," Computational Statistics & Data Analysis, Elsevier, vol. 140(C), pages 59-73.
    15. Westgate, Bradford S. & Woodard, Dawn B. & Matteson, David S. & Henderson, Shane G., 2016. "Large-network travel time distribution estimation for ambulances," European Journal of Operational Research, Elsevier, vol. 252(1), pages 322-333.
    16. Angelica Gianfreda & Derek Bunn, 2018. "A Stochastic Latent Moment Model for Electricity Price Formation," BEMPS - Bozen Economics & Management Paper Series BEMPS46, Faculty of Economics and Management at the Free University of Bozen.
    17. Komlos, John & Brabec, Marek, 2011. "The trend of BMI values of US adults by deciles, birth cohorts 1882-1986 stratified by gender and ethnicity," Economics & Human Biology, Elsevier, vol. 9(3), pages 234-250, July.
    18. Jussiane Nader Gonçalves & Wagner Barreto-Souza, 2020. "Flexible regression models for counts with high-inflation of zeros," METRON, Springer;Sapienza Università di Roma, vol. 78(1), pages 71-95, April.
    19. Jianzhu Li & Yuming Lei & Senming Tan & Colin D. Bell & Bernard A. Engel & Yixuan Wang, 2018. "Nonstationary Flood Frequency Analysis for Annual Flood Peak and Volume Series in Both Univariate and Bivariate Domain," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 32(13), pages 4239-4252, October.
    20. Uddameri, Venkatesh & Ghaseminejad, Ali & Hernandez, E. Annette, 2020. "A tiered stochastic framework for assessing crop yield loss risks due to water scarcity under different uncertainty levels," Agricultural Water Management, Elsevier, vol. 238(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aodasc:v:10:y:2023:i:4:d:10.1007_s40745-021-00350-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.