IDEAS home Printed from https://ideas.repec.org/a/ntu/ntcmss/vol6-iss1-5-12.html
   My bibliography  Save this article

Appropriate statistical model for zero-inflated count data: simulation based study

Author

Listed:
  • Ayan CHOWDHURY

    (Department of Statistics, University of Chittagong, Bangladesh)

  • Soma Chowdhury BISWAS

    (Department of Statistics, University of Chittagong, Bangladesh)

Abstract

Zero-inflated count data is easily reached in real field. Over-dispersion is the consequence of zero inflation in count data. For modeling this kind of count data, several zero adjusted models such as Zero Inflated Poisson, Zero Inflated Negative Binomial, Hurdle Poisson and Hurdle Negative Binomial models are more suitable than basic statistical models. The best zero adjusted model selection is the key aim of this research. In this study, R code has been used to simulate datasets as well as to compare these models based on Akaike information criterion, Bayesian information criterion and Vuong test. The result of this study suggests that Hurdle Negative Binomial model has been preferred as the best fitted model for count data with excess of zero.

Suggested Citation

  • Ayan CHOWDHURY & Soma Chowdhury BISWAS, 2018. "Appropriate statistical model for zero-inflated count data: simulation based study," Computational Methods in Social Sciences (CMSS), "Nicolae Titulescu" University of Bucharest, Faculty of Economic Sciences, vol. 6(1), pages 5-12, June.
  • Handle: RePEc:ntu:ntcmss:vol6-iss1-5-12
    as

    Download full text from publisher

    File URL: http://cmss.univnt.ro/wp-content/uploads/vol/split/vol_VI_issue_1/CMSS_vol_VI_issue_1_art.001.pdf
    File Function: First version, 2018
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Vuong, Quang H, 1989. "Likelihood Ratio Tests for Model Selection and Non-nested Hypotheses," Econometrica, Econometric Society, vol. 57(2), pages 307-333, March.
    2. Zeileis, Achim & Kleiber, Christian & Jackman, Simon, 2008. "Regression Models for Count Data in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 27(i08).
    3. Mullahy, John, 1986. "Specification and testing of some modified count data models," Journal of Econometrics, Elsevier, vol. 33(3), pages 341-365, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christian Balcells, 2022. "Determinants of firm boundaries and organizational performance: an empirical investigation of the Chilean truck market," Journal of Evolutionary Economics, Springer, vol. 32(2), pages 423-461, April.
    2. Addisu Jember Zeleke & Serena Moscato & Rossella Miglio & Lorenzo Chiari, 2022. "Length of Stay Analysis of COVID-19 Hospitalizations Using a Count Regression Model and Quantile Regression: A Study in Bologna, Italy," IJERPH, MDPI, vol. 19(4), pages 1-18, February.
    3. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 29-46, January.
    4. Greene, William, 2007. "Functional Form and Heterogeneity in Models for Count Data," Foundations and Trends(R) in Econometrics, now publishers, vol. 1(2), pages 113-218, August.
    5. Christopher J. W. Zorn, 1998. "An Analytic and Empirical Examination of Zero-Inflated and Hurdle Poisson Specifications," Sociological Methods & Research, , vol. 26(3), pages 368-400, February.
    6. Ajiferuke, Isola & Famoye, Felix, 2015. "Modelling count response variables in informetric studies: Comparison among count, linear, and lognormal regression models," Journal of Informetrics, Elsevier, vol. 9(3), pages 499-513.
    7. Niklas Elert, 2014. "What determines entry? Evidence from Sweden," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 53(1), pages 55-92, August.
    8. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    9. Ulf‐ G. Gerdtham, 1997. "Equity in Health Care Utilization: Further Tests Based on Hurdle Models and Swedish Micro Data," Health Economics, John Wiley & Sons, Ltd., vol. 6(3), pages 303-319, May.
    10. Stefano Mainardi, 2003. "Testing convergence in life expectancies: count regression models on panel data," Prague Economic Papers, Prague University of Economics and Business, vol. 2003(4), pages 350-370.
    11. Samuel Muehlemann & Juerg Schweri & Rainer Winkelmann & Stefan C. Wolter, 2007. "An Empirical Analysis of the Decision to Train Apprentices," LABOUR, CEIS, vol. 21(3), pages 419-441, September.
    12. J. M. C. Santos Silva, 2001. "A score test for non-nested hypotheses with applications to discrete data models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(5), pages 577-597.
    13. repec:fgv:epgrbe:v:66:n:1:a:3 is not listed on IDEAS
    14. Daniel Biftu Bekalo & Dufera Tejjeba Kebede, 2021. "Zero-Inflated Models for Count Data: An Application to Number of Antenatal Care Service Visits," Annals of Data Science, Springer, vol. 8(4), pages 683-708, December.
    15. Evgenii V. Gilenko & Elena A. Mironova, 2017. "Modern claim frequency and claim severity models: An application to the Russian motor own damage insurance market," Cogent Economics & Finance, Taylor & Francis Journals, vol. 5(1), pages 1311097-131, January.
    16. Livio Finos & Fortunato Pesarin, 2020. "On zero-inflated permutation testing and some related problems," Statistical Papers, Springer, vol. 61(5), pages 2157-2174, October.
    17. Andre Jungmittag, 2019. "Service trade restrictiveness and internationalisation of retail trade," International Economics and Economic Policy, Springer, vol. 16(2), pages 293-333, April.
    18. Melvyn Weeks & Sriya Iyer, 2004. "Multiple social interactions and reproductive externalities: An investigation of fertility behaviour in Kenya," Econometric Society 2004 Latin American Meetings 143, Econometric Society.
    19. William Greene, 2007. "Discrete Choice Modeling," Working Papers 07-6, New York University, Leonard N. Stern School of Business, Department of Economics.
    20. Bilgic, Abdulbaki & Florkowski, Wojciech J., 2003. "Explaning Anglers Behavior Using Count Data Models With Endogenous Switching Regime," 2003 Annual meeting, July 27-30, Montreal, Canada 22087, American Agricultural Economics Association (New Name 2008: Agricultural and Applied Economics Association).
    21. Antoni, Manfred, 2011. "Lifelong learning inequality? The relevance of family background for on-the-job training," IAB-Discussion Paper 201109, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ntu:ntcmss:vol6-iss1-5-12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Stefan Ciucu (email available below). General contact details of provider: https://edirc.repec.org/data/feuntro.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.