IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v027i08.html
   My bibliography  Save this article

Regression Models for Count Data in R

Author

Listed:
  • Zeileis, Achim
  • Kleiber, Christian
  • Jackman, Simon

Abstract

The classical Poisson, geometric and negative binomial regression models for count data belong to the family of generalized linear models and are available at the core of the statistics toolbox in the R system for statistical computing. After reviewing the conceptual and computational features of these methods, a new implementation of hurdle and zero-inflated regression models in the functions hurdle() and zeroinfl() from the package pscl is introduced. It re-uses design and functionality of the basic R functions just as the underlying conceptual tools extend the classical models. Both hurdle and zero-inflated model, are able to incorporate over-dispersion and excess zeros-two problems that typically occur in count data sets in economics and the social sciences-better than their classical counterparts. Using cross-section data on the demand for medical care, it is illustrated how the classical as well as the zero-augmented models can be fitted, inspected and tested in practice.

Suggested Citation

  • Zeileis, Achim & Kleiber, Christian & Jackman, Simon, 2008. "Regression Models for Count Data in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 27(i08).
  • Handle: RePEc:jss:jstsof:v:027:i08
    DOI: http://hdl.handle.net/10.18637/jss.v027.i08
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v027i08/v27i08.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v027i08/pscl_1.00.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v027i08/v27i08.R.zip
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v027i08/DebTrivedi.rda.zip
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v027.i08?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Leisch, Friedrich, 2004. "FlexMix: A General Framework for Finite Mixture Models and Latent Class Regression in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 11(i08).
    2. Mullahy, John, 1986. "Specification and testing of some modified count data models," Journal of Econometrics, Elsevier, vol. 33(3), pages 341-365, December.
    3. Cameron,A. Colin & Trivedi,Pravin K., 2005. "Microeconometrics," Cambridge Books, Cambridge University Press, number 9780521848053, September.
    4. Deb, Partha & Trivedi, Pravin K, 1997. "Demand for Medical Care by the Elderly: A Finite Mixture Approach," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 12(3), pages 313-336, May-June.
    5. Zeileis, Achim, 2004. "Econometric Computing with HC and HAC Covariance Matrix Estimators," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 11(i10).
    6. Zeileis, Achim, 2006. "Object-oriented Computation of Sandwich Estimators," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 16(i09).
    7. Stasinopoulos, D. Mikis & Rigby, Robert A., 2007. "Generalized Additive Models for Location Scale and Shape (GAMLSS) in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i07).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:jss:jstsof:27:i08 is not listed on IDEAS
    2. Christian Kleiber & Achim Zeileis, 2016. "Visualizing Count Data Regressions Using Rootograms," The American Statistician, Taylor & Francis Journals, vol. 70(3), pages 296-303, July.
    3. Zeileis, Achim, 2006. "Object-oriented Computation of Sandwich Estimators," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 16(i09).
    4. Sarah Harris & Wendy Anderson & Musa Kilinc & Liam Fogarty, 2012. "The relationship between fire behaviour measures and community loss: an exploratory analysis for developing a bushfire severity scale," Natural Hazards: Journal of the International Society for the Prevention and Mitigation of Natural Hazards, Springer;International Society for the Prevention and Mitigation of Natural Hazards, vol. 63(2), pages 391-415, September.
    5. Óscar Lourenço & Carlota Quintal & Pedro Lopes Ferreira & Pedro Pita Barros, 2007. "A equidade na utilização de cuidados de saúde em Portugal: Uma avaliação baseada em modelos de contagem," Notas Económicas, Faculty of Economics, University of Coimbra, issue 25, pages 6-26, June.
    6. Greene, William, 2007. "Functional Form and Heterogeneity in Models for Count Data," Foundations and Trends(R) in Econometrics, now publishers, vol. 1(2), pages 113-218, August.
    7. Niklas Elert, 2014. "What determines entry? Evidence from Sweden," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 53(1), pages 55-92, August.
    8. Anikó Bíró, 2014. "Supplementary private health insurance and health care utilization of people aged 50+," Empirical Economics, Springer, vol. 46(2), pages 501-524, March.
    9. Ball, Laurence & Carvalho, Carlos & Evans, Christopher & Antonio Ricci, Luca, 2024. "Weighted Median Inflation Around the World: A Measure of Core Inflation," Journal of International Money and Finance, Elsevier, vol. 142(C).
    10. Harald Oberhofer & Michael Pfaffermayr, 2014. "Two-Part Models for Fractional Responses Defined as Ratios of Integers," Econometrics, MDPI, vol. 2(3), pages 1-22, September.
    11. Stefan Seifert & Christoph Kahle & Silke Hüttel, 2021. "Price Dispersion in Farmland Markets: What Is the Role of Asymmetric Information?," American Journal of Agricultural Economics, John Wiley & Sons, vol. 103(4), pages 1545-1568, August.
    12. Sviták, Jan & Tichem, Jan & Haasbeek, Stefan, 2021. "Price effects of search advertising restrictions," International Journal of Industrial Organization, Elsevier, vol. 77(C).
    13. Stefano Mainardi, 2003. "Testing convergence in life expectancies: count regression models on panel data," Prague Economic Papers, Prague University of Economics and Business, vol. 2003(4), pages 350-370.
    14. Timo Dimitriadis & Xiaochun Liu & Julie Schnaitmann, 2020. "Encompassing Tests for Value at Risk and Expected Shortfall Multi-Step Forecasts based on Inference on the Boundary," Papers 2009.07341, arXiv.org.
    15. Christopher F. Parmeter, 2018. "Estimation of the two-tiered stochastic frontier model with the scaling property," Journal of Productivity Analysis, Springer, vol. 49(1), pages 37-47, February.
    16. Hasler Mario, 2013. "Multiple Contrasts for Repeated Measures," The International Journal of Biostatistics, De Gruyter, vol. 9(1), pages 49-61, July.
    17. Mario Hasler, 2016. "Heteroscedasticity: multiple degrees of freedom vs. sandwich estimation," Statistical Papers, Springer, vol. 57(1), pages 55-68, March.
    18. Kleiber Christian & Zeileis Achim, 2010. "The Grunfeld Data at 50," German Economic Review, De Gruyter, vol. 11(4), pages 404-417, December.
    19. Fabbri, Daniele & Monfardini, Chiara, 2009. "Rationing the public provision of healthcare in the presence of private supplements: Evidence from the Italian NHS," Journal of Health Economics, Elsevier, vol. 28(2), pages 290-304, March.
    20. Lupi, Claudio, 2009. "Unit Root CADF Testing with R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 32(i02).
    21. Livio Finos & Fortunato Pesarin, 2020. "On zero-inflated permutation testing and some related problems," Statistical Papers, Springer, vol. 61(5), pages 2157-2174, October.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:027:i08. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.