IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v50y2021i1p365-400.html
   My bibliography  Save this article

Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions

Author

Listed:
  • Tianji Cai
  • Yiwei Xia
  • Yisu Zhou

Abstract

Analysts of discrete data often face the challenge of managing the tendency of inflation on certain values. When treated improperly, such phenomenon may lead to biased estimates and incorrect inferences. This study extends the existing literature on single-value inflated models and develops a general framework to handle variables with more than one inflated value. To assess the performance of the proposed maximum likelihood estimator, we conducted Monte Carlo experiments under several scenarios for different levels of inflated probabilities under multinomial, ordinal, Poisson, and zero-truncated Poisson outcomes with covariates. We found that ignoring the inflations leads to substantial bias and poor inference of the inflations—not only for the intercept(s) of the inflated categories but other coefficients as well. Specifically, higher values of inflated probabilities are associated with larger biases. By contrast, the generalized inflated discrete models (GIDMs) perform well with unbiased estimates and satisfactory coverages even when the number of parameters that need to be estimated is quite large. We showed that model fit criteria, such as Akaike information criterion, could be used in selecting the appropriate specifications of inflated models. Lastly, the GIDM was implemented using large-scale health survey data as a comparison to conventional modeling approaches such as various Poisson and Ordered Logit models. We showed that the GIDM fits the data better in general. The current work provides a practical approach to analyze multimodal data that exists in many fields, such as heaping in self-reported behavioral outcomes, inflated categories of indifference and neutral in attitude surveys, large amounts of zero, and low occurrences of delinquent behaviors.

Suggested Citation

  • Tianji Cai & Yiwei Xia & Yisu Zhou, 2021. "Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions," Sociological Methods & Research, , vol. 50(1), pages 365-400, February.
  • Handle: RePEc:sae:somere:v:50:y:2021:i:1:p:365-400
    DOI: 10.1177/0049124118782535
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124118782535
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124118782535?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Lisa Farrell & Tim R. L. Fry & Mark N. Harris, 2003. "“A Pack A Day For Twenty Years”:Smoking And Cigarette Pack Sizes," Department of Economics - Working Papers Series 887, The University of Melbourne.
    2. Harris, Mark N. & Zhao, Xueyan, 2007. "A zero-inflated ordered probit model, with an application to modelling tobacco consumption," Journal of Econometrics, Elsevier, vol. 141(2), pages 1073-1099, December.
    3. Bagozzi, Benjamin E. & Mukherjee, Bumba, 2012. "A Mixture Model for Middle Category Inflation in Ordered Survey Responses," Political Analysis, Cambridge University Press, vol. 20(3), pages 369-386, July.
    4. Vuong, Quang H, 1989. "Likelihood Ratio Tests for Model Selection and Non-nested Hypotheses," Econometrica, Econometric Society, vol. 57(2), pages 307-333, March.
    5. Moghimbeigi, Abbas & Eshraghian, Mohammad Reza & Mohammad, Kazem & McArdle, Brian, 2009. "A score test for zero-inflation in multilevel count data," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1239-1248, February.
    6. Martin Ridout & John Hinde & Clarice G. B. Demétrio, 2001. "A Score Test for Testing a Zero‐Inflated Poisson Regression Model Against Zero‐Inflated Negative Binomial Alternatives," Biometrics, The International Biometric Society, vol. 57(1), pages 219-223, March.
    7. Michael Luca & Georgios Zervas, 2016. "Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud," Management Science, INFORMS, vol. 62(12), pages 3412-3427, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Pusch, Natasha, 2024. "Estimating the prevalence and frequency of adolescent substance use using zero-inflated models and variables associated with social learning, social bond, and opportunity," Journal of Criminal Justice, Elsevier, vol. 94(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Cai, Tianji & Xia, Yiwei & Zhou, Yisu, 2017. "Generalized Inflated Discrete Models: A Strategy to Work with Multimodal Discrete Distributions," SocArXiv 4r2j3, Center for Open Science.
    2. David Dale & Andrei Sirchenko, 2021. "Estimation of nested and zero-inflated ordered probit models," Stata Journal, StataCorp LP, vol. 21(1), pages 3-38, March.
    3. Benjamin E. Bagozzi, 2016. "The baseline-inflated multinomial logit model for international relations research," Conflict Management and Peace Science, Peace Science Society (International), vol. 33(2), pages 174-197, April.
    4. repec:fgv:epgrbe:v:66:n:1:a:3 is not listed on IDEAS
    5. Hantzsche, Arno, 2022. "Fiscal uncertainty and sovereign credit risk," European Economic Review, Elsevier, vol. 148(C).
    6. Lim, Hwa Kyung & Song, Juwon & Jung, Byoung Cheol, 2013. "Score tests for zero-inflation and overdispersion in two-level count data," Computational Statistics & Data Analysis, Elsevier, vol. 61(C), pages 67-82.
    7. Baksh, M. Fazil & Böhning, Dankmar & Lerdsuwansri, Rattana, 2011. "An extension of an over-dispersion test for count data," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 466-474, January.
    8. Schröder, Bruno, 2012. "Práticas restritivas, barreiras à entrada e concorrência no mercado brasileiro de exibição cinematográfica," Revista Brasileira de Economia - RBE, EPGE Brazilian School of Economics and Finance - FGV EPGE (Brazil), vol. 66(1), March.
    9. Derek S. Young & Andrew M. Raim & Nancy R. Johnson, 2017. "Zero-inflated modelling for characterizing coverage errors of extracts from the US Census Bureau's Master Address File," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(1), pages 73-97, January.
    10. Zhen Xu & G. Cornelis van Kooten, 2013. "Count Models and Wildfire in British Columbia," Working Papers 2013-06, University of Victoria, Department of Economics, Resource Economics and Policy Analysis Research Group.
    11. El-Shagi, Makram & von Schweinitz, Gregor, 2017. "Why they keep missing: An empirical investigation of rational inattention of rating agencies," IWH Discussion Papers 1/2017, Halle Institute for Economic Research (IWH), revised 2017.
    12. J Paul Dunne & Nan Tian, 2016. "Determinants of Civil War and Excess Zeroes," SALDRU Working Papers 191, Southern Africa Labour and Development Research Unit, University of Cape Town.
    13. David ARISTEI & Manuela Gallo, 2012. "The Drivers of Household Over-Indebtedness and Delinquency on Mortgage Loans: Evidence from Italian Microdata," Quaderni del Dipartimento di Economia, Finanza e Statistica 105/2012, Università di Perugia, Dipartimento Economia.
    14. William Greene, 2009. "Models for count data with endogenous participation," Empirical Economics, Springer, vol. 36(1), pages 133-173, February.
    15. Hadi Saboori & Mahdi Doostparast, 2024. "Control Charts Based on Zero to k Inflated Power Series Regression Models and Their Applications," Sankhya B: The Indian Journal of Statistics, Springer;Indian Statistical Institute, vol. 86(2), pages 442-476, November.
    16. William Greene, 2007. "Discrete Choice Modeling," Working Papers 07-6, New York University, Leonard N. Stern School of Business, Department of Economics.
    17. Bilgic, Abdulbaki & Florkowski, Wojciech J. & Yen, Steven T. & Akbay, Cuma, 2013. "Tobacco spending patterns and their health-related implications in Turkey," Journal of Policy Modeling, Elsevier, vol. 35(1), pages 1-15.
    18. Sirchenko Andrei, 2012. "A model for ordinal responses with an application to policy interest rate," EERC Working Paper Series 12/13e, EERC Research Network, Russia and CIS.
    19. Makram El‐Shagi & Gregor von Schweinitz, 2022. "Why they keep missing: An empirical investigation of sovereign bond ratings and their timing," Scottish Journal of Political Economy, Scottish Economic Society, vol. 69(2), pages 186-224, May.
    20. Mark N. Harris & Xueyan Zhao, 2004. "Modelling Tobacco Consumption with a Zero-Inflated Ordered Probit Model," Monash Econometrics and Business Statistics Working Papers 14/04, Monash University, Department of Econometrics and Business Statistics.
    21. Fumagalli, Elena & Fumagalli, Laura, 2022. "Subjective well-being and the gender composition of the reference group: Evidence from a survey experiment," Journal of Economic Behavior & Organization, Elsevier, vol. 194(C), pages 196-219.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:50:y:2021:i:1:p:365-400. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.