IDEAS home Printed from https://ideas.repec.org/a/gam/jstats/v5y2022i1p4-69d719553.html
   My bibliography  Save this article

A Flexible Mixed Model for Clustered Count Data

Author

Listed:
  • Darcy Steeg Morris

    (Center for Statistical Research and Methodology, U.S. Census Bureau, Washington, DC 20233, USA)

  • Kimberly F. Sellers

    (Center for Statistical Research and Methodology, U.S. Census Bureau, Washington, DC 20233, USA
    Mathematics and Statistics Department, Georgetown University, Washington, DC 20057, USA)

Abstract

Clustered count data are commonly modeled using Poisson regression with random effects to account for the correlation induced by clustering. The Poisson mixed model allows for overdispersion via the nature of the within-cluster correlation, however, departures from equi-dispersion may also exist due to the underlying count process mechanism. We study the cross-sectional COM-Poisson regression model—a generalized regression model for count data in light of data dispersion—together with random effects for analysis of clustered count data. We demonstrate model flexibility of the COM-Poisson random intercept model, including choice of the random effect distribution, via simulated and real data examples. We find that COM-Poisson mixed models provide comparable model fit to well-known mixed models for associated special cases of clustered discrete data, and result in improved model fit for data with intermediate levels of over- or underdispersion in the count mechanism. Accordingly, the proposed models are useful for capturing dispersion not consistent with commonly used statistical models, and also serve as a practical diagnostic tool.

Suggested Citation

  • Darcy Steeg Morris & Kimberly F. Sellers, 2022. "A Flexible Mixed Model for Clustered Count Data," Stats, MDPI, vol. 5(1), pages 1-18, January.
  • Handle: RePEc:gam:jstats:v:5:y:2022:i:1:p:4-69:d:719553
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2571-905X/5/1/4/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2571-905X/5/1/4/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Seth D. Guikema & Jeremy P. Goffelt, 2008. "A Flexible Count Data Regression Model for Risk Analysis," Risk Analysis, John Wiley & Sons, vol. 28(1), pages 213-223, February.
    2. Hyoyoung Choo-Wosoba & Somnath Datta, 2018. "Analyzing clustered count data with a cluster-specific random effect zero-inflated Conway–Maxwell–Poisson distribution," Journal of Applied Statistics, Taylor & Francis Journals, vol. 45(5), pages 799-814, April.
    3. Galit Shmueli & Thomas P. Minka & Joseph B. Kadane & Sharad Borle & Peter Boatwright, 2005. "A useful distribution for fitting discrete data: revival of the Conway–Maxwell–Poisson distribution," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 54(1), pages 127-142, January.
    4. Bates, Douglas & Mächler, Martin & Bolker, Ben & Walker, Steve, 2015. "Fitting Linear Mixed-Effects Models Using lme4," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 67(i01).
    5. N. E. Breslow, 1984. "Extra‐Poisson Variation in Log‐Linear Models," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 33(1), pages 38-44, March.
    6. Kimberly F. Sellers & Sharad Borle & Galit Shmueli, 2012. "The COM‐Poisson model for count data: a survey of methods and applications," Applied Stochastic Models in Business and Industry, John Wiley & Sons, vol. 28(2), pages 104-116, March.
    7. Chatla, Suneel Babu & Shmueli, Galit, 2018. "Efficient estimation of COM–Poisson regression and a generalized additive model," Computational Statistics & Data Analysis, Elsevier, vol. 121(C), pages 71-88.
    8. Hyoyoung Choo-Wosoba & Steven M. Levy & Somnath Datta, 2016. "Marginal regression models for clustered count data based on zero-inflated Conway–Maxwell–Poisson distribution with applications," Biometrics, The International Biometric Society, vol. 72(2), pages 606-618, June.
    9. Dominique Lord & Srinivas Reddy Geedipally & Seth D. Guikema, 2010. "Extension of the Application of Conway‐Maxwell‐Poisson Models: Analyzing Traffic Crash Data Exhibiting Underdispersion," Risk Analysis, John Wiley & Sons, vol. 30(8), pages 1268-1276, August.
    10. Rainer Winkelmann, 2008. "Econometric Analysis of Count Data," Springer Books, Springer, edition 0, number 978-3-540-78389-3, December.
    11. Hausman, Jerry & Hall, Bronwyn H & Griliches, Zvi, 1984. "Econometric Models for Count Data with an Application to the Patents-R&D Relationship," Econometrica, Econometric Society, vol. 52(4), pages 909-938, July.
    12. William Greene, 2007. "Fixed and Random Effects Models for Count Data," Working Papers 07-15, New York University, Leonard N. Stern School of Business, Department of Economics.
    13. Fernanda B. Rizzato & Roseli A. Leandro & Clarice G.B. Demétrio & Geert Molenberghs, 2016. "A Bayesian approach to analyse overdispersed longitudinal count data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(11), pages 2085-2109, August.
    14. Kimberly F. Sellers & Darcy S. Morris, 2017. "Underdispersion models: Models that are “under the radar”," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 46(24), pages 12075-12086, December.
    15. A. Huang & A. S. I. Kim, 2021. "Bayesian Conway–Maxwell–Poisson regression models for overdispersed and underdispersed counts," Communications in Statistics - Theory and Methods, Taylor & Francis Journals, vol. 50(13), pages 3094-3105, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Douglas Toledo & Cristiane Akemi Umetsu & Antonio Fernando Monteiro Camargo & Idemauro Antonio Rodrigues Lara, 2022. "Flexible models for non-equidispersed count data: comparative performance of parametric models to deal with underdispersion," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 106(3), pages 473-497, September.
    2. Xun-Jian Li & Guo-Liang Tian & Mingqian Zhang & George To Sum Ho & Shuang Li, 2023. "Modeling Under-Dispersed Count Data by the Generalized Poisson Distribution via Two New MM Algorithms," Mathematics, MDPI, vol. 11(6), pages 1-24, March.
    3. Nasim Vahabi & Anoshirvan Kazemnejad & Somnath Datta, 2018. "A Marginalized Overdispersed Location Scale Model for Clustered Ordinal Data," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 80(1), pages 103-134, December.
    4. S. Hadi Khazraee & Antonio Jose Sáez‐Castillo & Srinivas Reddy Geedipally & Dominique Lord, 2015. "Application of the Hyper‐Poisson Generalized Linear Model for Analyzing Motor Vehicle Crashes," Risk Analysis, John Wiley & Sons, vol. 35(5), pages 919-930, May.
    5. Royce A. Francis & Srinivas Reddy Geedipally & Seth D. Guikema & Soma Sekhar Dhavala & Dominique Lord & Sarah LaRocca, 2012. "Characterizing the Performance of the Conway‐Maxwell Poisson Generalized Linear Model," Risk Analysis, John Wiley & Sons, vol. 32(1), pages 167-183, January.
    6. Seng Huat Ong & Shin Zhu Sim & Shuangzhe Liu & Hari M. Srivastava, 2023. "A Family of Finite Mixture Distributions for Modelling Dispersion in Count Data," Stats, MDPI, vol. 6(3), pages 1-14, September.
    7. Meena Badade & T. V. Ramanathan, 2022. "Probabilistic Frontier Regression Models for Count Type Output Data," Journal of Quantitative Economics, Springer;The Indian Econometric Society (TIES), vol. 20(1), pages 235-260, September.
    8. Bedbur, S. & Kamps, U., 2023. "Uniformly most powerful unbiased tests for the dispersion parameter of the Conway–Maxwell–Poisson distribution," Statistics & Probability Letters, Elsevier, vol. 196(C).
    9. Kara M. Reynolds, 2009. "Overcoming Free Riding: A Cross Country Analysis of Firm Participation in Antidumping Petitions," Working Papers 2009-01, American University, Department of Economics.
    10. Lluís Bermúdez & Dimitris Karlis & Isabel Morillo, 2020. "Modelling Unobserved Heterogeneity in Claim Counts Using Finite Mixture Models," Risks, MDPI, vol. 8(1), pages 1-13, January.
    11. Lord, Dominique & Mannering, Fred, 2010. "The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives," Transportation Research Part A: Policy and Practice, Elsevier, vol. 44(5), pages 291-305, June.
    12. Burda, Martin & Harding, Matthew & Hausman, Jerry, 2012. "A Poisson mixture model of discrete choice," Journal of Econometrics, Elsevier, vol. 166(2), pages 184-203.
    13. repec:hal:spmain:info:hdl:2441/eu4vqp9ompqllr09j0031f620 is not listed on IDEAS
    14. Joan Costa‐Font & Cristina Vilaplana‐Prieto, 2020. "‘More than one red herring'? Heterogeneous effects of ageing on health care utilisation," Health Economics, John Wiley & Sons, Ltd., vol. 29(S1), pages 8-29, October.
    15. Karkinsky, Tom & Riedel, Nadine, 2012. "Corporate taxation and the choice of patent location within multinational firms," Journal of International Economics, Elsevier, vol. 88(1), pages 176-185.
    16. Antoine Dechezleprêtre & Matthieu Glachant, 2014. "Does Foreign Environmental Policy Influence Domestic Innovation? Evidence from the Wind Industry," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 58(3), pages 391-413, July.
    17. Costantini, Valeria & Crespi, Francesco & Palma, Alessandro, 2017. "Characterizing the policy mix and its impact on eco-innovation: A patent analysis of energy-efficient technologies," Research Policy, Elsevier, vol. 46(4), pages 799-819.
    18. Rodica Gilles & Seik Kim, 2013. "Distribution-Free Estimation of Zero-Inflated Models with Unobserved Heterogeneity," Working Papers UWEC-2013-03, University of Washington, Department of Economics.
    19. Rainer Winkelmann, 2015. "Counting on count data models," IZA World of Labor, Institute of Labor Economics (IZA), pages 148-148, May.
    20. Zhou, Can & Jiao, Yan & Browder, Joan, 2019. "K-aggregated transformation of discrete distributions improves modeling count data with excess ones," Ecological Modelling, Elsevier, vol. 407(C), pages 1-1.
    21. Pérez-Granja, Ubay & Pérez-Sánchez, José María & Pérez-Rodríguez, Jorge V., 2024. "Assessing economic performance and aviation accidents using zero-inflated and over-dispersed panel data models," Journal of Air Transport Management, Elsevier, vol. 118(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jstats:v:5:y:2022:i:1:p:4-69:d:719553. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.