IDEAS home Printed from https://ideas.repec.org/a/spr/metrik/v79y2016i4d10.1007_s00184-015-0563-7.html
   My bibliography  Save this article

Semiparametric estimation of a zero-inflated Poisson regression model with missing covariates

Author

Listed:
  • T. Martin Lukusa

    (Feng Chia University)

  • Shen-Ming Lee

    (Feng Chia University)

  • Chin-Shang Li

    (University of California)

Abstract

Zero-inflated Poisson (ZIP) regression models have been widely used to study the effects of covariates in count data sets that have many zeros. However, often some covariates involved in ZIP regression modeling have missing values. Assuming that the selection probability is known or unknown and estimated via a non-parametric method, we propose the inverse probability weighting (IPW) method to estimate the parameters of the ZIP regression model with covariates missing at random. The asymptotic properties of the proposed estimators are studied in detail under certain regularity conditions. Both theoretical analysis and simulation results show that the semiparametric IPW estimator is more efficient than the true weight IPW estimator. The practical use of the proposed methodology is illustrated with data from a motorcycle survey of traffic regulations conducted in 2007 in Taiwan by the Ministry of Transportation and Communication.

Suggested Citation

  • T. Martin Lukusa & Shen-Ming Lee & Chin-Shang Li, 2016. "Semiparametric estimation of a zero-inflated Poisson regression model with missing covariates," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 79(4), pages 457-483, May.
  • Handle: RePEc:spr:metrik:v:79:y:2016:i:4:d:10.1007_s00184-015-0563-7
    DOI: 10.1007/s00184-015-0563-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00184-015-0563-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00184-015-0563-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Chen, Xue-Dong & Fu, Ying-Zi, 2011. "Model selection for zero-inflated regression with missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 765-773, January.
    2. Daniel B. Hall & Jing Shen, 2010. "Robust Estimation for Zero‐Inflated Poisson Regression," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 37(2), pages 237-252, June.
    3. Vuong, Quang H, 1989. "Likelihood Ratio Tests for Model Selection and Non-nested Hypotheses," Econometrica, Econometric Society, vol. 57(2), pages 307-333, March.
    4. Shen-Ming Lee & Chin-Shang Li & Shu-Hui Hsieh & Li-Hui Huang, 2012. "Semiparametric estimation of logistic regression model with missing covariates and outcome," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 75(5), pages 621-653, July.
    5. Jansakul, N. & Hinde, J. P., 2002. "Score Tests for Zero-Inflated Poisson Models," Computational Statistics & Data Analysis, Elsevier, vol. 40(1), pages 75-96, July.
    6. Wang, Suojin & Wang, C. Y., 2001. "A note on kernel assisted estimators in missing covariate regression," Statistics & Probability Letters, Elsevier, vol. 55(4), pages 439-449, December.
    7. Mullahy, John, 1986. "Specification and testing of some modified count data models," Journal of Econometrics, Elsevier, vol. 33(3), pages 341-365, December.
    8. D. Böhning & E. Dietz & P. Schlattmann & L. Mendonça & U. Kirchner, 1999. "The zero‐inflated Poisson model and the decayed, missing and filled teeth index in dental epidemiology," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 162(2), pages 195-209.
    9. Creemers, An & Aerts, Marc & Hens, Niel & Molenberghs, Geert, 2012. "A nonparametric approach to weighted estimating equations for regression analysis with missing covariates," Computational Statistics & Data Analysis, Elsevier, vol. 56(1), pages 100-113, January.
    10. C. Y. Wang & Hua Yun Chen, 2001. "Augmented Inverse Probability Weighted Estimator for Cox Missing Covariate Regression," Biometrics, The International Biometric Society, vol. 57(2), pages 414-419, June.
    11. Hsieh, S.H. & Lee, S.M. & Shen, P.S., 2009. "Semiparametric analysis of randomized response data with missing covariates in logistic regression," Computational Statistics & Data Analysis, Elsevier, vol. 53(7), pages 2673-2692, May.
    12. Shou-En Lu & Yong Lin & Wei-Chung Joe Shih, 2004. "Analyzing Excessive No Changes in Clinical Trials with Clustered Data," Biometrics, The International Biometric Society, vol. 60(1), pages 257-267, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lukusa, Martin T. & Phoa, Frederick Kin Hing, 2020. "A note on the weighting-type estimations of the zero-inflated Poisson regression model with missing data in covariates," Statistics & Probability Letters, Elsevier, vol. 158(C).
    2. Eric Han & Majid Mojirsheibani, 2021. "On histogram-based regression and classification with incomplete data," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 84(5), pages 635-662, July.
    3. Buu-Chau Truong & Nguyen Van Thuan & Nguyen Huu Hau & Michael McAleer, 2019. "Applications of the Newton-Raphson Method in Decision Sciences and Education," Advances in Decision Sciences, Asia University, Taiwan, vol. 23(4), pages 52-80, December.
    4. Shen-Ming Lee & Truong-Nhat Le & Phuoc-Loc Tran & Chin-Shang Li, 2023. "Estimation of logistic regression with covariates missing separately or simultaneously via multiple imputation methods," Computational Statistics, Springer, vol. 38(2), pages 899-934, June.
    5. Shen-Ming Lee & T. Martin Lukusa & Chin-Shang Li, 2020. "Estimation of a zero-inflated Poisson regression model with missing covariates via nonparametric multiple imputation methods," Computational Statistics, Springer, vol. 35(2), pages 725-754, June.
    6. Kim-Hung Pho & Tuan-Kiet Tran & Thi Diem-Chinh Ho & Wing-Keung Wong, 2019. "Optimal Solution Techniques in Decision Sciences A Review," Advances in Decision Sciences, Asia University, Taiwan, vol. 23(1), pages 114-161, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Shen-Ming Lee & T. Martin Lukusa & Chin-Shang Li, 2020. "Estimation of a zero-inflated Poisson regression model with missing covariates via nonparametric multiple imputation methods," Computational Statistics, Springer, vol. 35(2), pages 725-754, June.
    2. Abbas Moghimbeigi & Mohammed Reza Eshraghian & Kazem Mohammad & Brian Mcardle, 2008. "Multilevel zero-inflated negative binomial regression modeling for over-dispersed count data with extra zeros," Journal of Applied Statistics, Taylor & Francis Journals, vol. 35(10), pages 1193-1202.
    3. Lukusa, Martin T. & Phoa, Frederick Kin Hing, 2020. "A note on the weighting-type estimations of the zero-inflated Poisson regression model with missing data in covariates," Statistics & Probability Letters, Elsevier, vol. 158(C).
    4. L. Elbakidze & Y. H. Jin, 2015. "Are Economic Development and Education Improvement Associated with Participation in Transnational Terrorism?," Risk Analysis, John Wiley & Sons, vol. 35(8), pages 1520-1535, August.
    5. K. F. Lam & Hongqi Xue & Yin Bun Cheung, 2006. "Semiparametric Analysis of Zero-Inflated Count Data," Biometrics, The International Biometric Society, vol. 62(4), pages 996-1003, December.
    6. Cindy Xin Feng, 2021. "A comparison of zero-inflated and hurdle models for modeling zero-inflated count data," Journal of Statistical Distributions and Applications, Springer, vol. 8(1), pages 1-19, December.
    7. Moghimbeigi, Abbas & Eshraghian, Mohammad Reza & Mohammad, Kazem & McArdle, Brian, 2009. "A score test for zero-inflation in multilevel count data," Computational Statistics & Data Analysis, Elsevier, vol. 53(4), pages 1239-1248, February.
    8. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 29-46, January.
    9. Greene, William, 2007. "Functional Form and Heterogeneity in Models for Count Data," Foundations and Trends(R) in Econometrics, now publishers, vol. 1(2), pages 113-218, August.
    10. Christopher J. W. Zorn, 1998. "An Analytic and Empirical Examination of Zero-Inflated and Hurdle Poisson Specifications," Sociological Methods & Research, , vol. 26(3), pages 368-400, February.
    11. Ajiferuke, Isola & Famoye, Felix, 2015. "Modelling count response variables in informetric studies: Comparison among count, linear, and lognormal regression models," Journal of Informetrics, Elsevier, vol. 9(3), pages 499-513.
    12. Niklas Elert, 2014. "What determines entry? Evidence from Sweden," The Annals of Regional Science, Springer;Western Regional Science Association, vol. 53(1), pages 55-92, August.
    13. Ulf‐ G. Gerdtham, 1997. "Equity in Health Care Utilization: Further Tests Based on Hurdle Models and Swedish Micro Data," Health Economics, John Wiley & Sons, Ltd., vol. 6(3), pages 303-319, May.
    14. Stefano Mainardi, 2003. "Testing convergence in life expectancies: count regression models on panel data," Prague Economic Papers, Prague University of Economics and Business, vol. 2003(4), pages 350-370.
    15. Samuel Muehlemann & Juerg Schweri & Rainer Winkelmann & Stefan C. Wolter, 2007. "An Empirical Analysis of the Decision to Train Apprentices," LABOUR, CEIS, vol. 21(3), pages 419-441, September.
    16. J. M. C. Santos Silva, 2001. "A score test for non-nested hypotheses with applications to discrete data models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 16(5), pages 577-597.
    17. David Todem & Wei‐Wen Hsu & KyungMann Kim, 2023. "Nonparametric scanning tests of homogeneity for hierarchical models with continuous covariates," Biometrics, The International Biometric Society, vol. 79(3), pages 2063-2075, September.
    18. repec:fgv:epgrbe:v:66:n:1:a:3 is not listed on IDEAS
    19. Daniel Biftu Bekalo & Dufera Tejjeba Kebede, 2021. "Zero-Inflated Models for Count Data: An Application to Number of Antenatal Care Service Visits," Annals of Data Science, Springer, vol. 8(4), pages 683-708, December.
    20. Tousifur Rahman & Partha Jyoti Hazarika & M. Masoom Ali & Manash Pratim Barman, 2022. "Three-Inflated Poisson Distribution and its Application in Suicide Cases of India During Covid-19 Pandemic," Annals of Data Science, Springer, vol. 9(5), pages 1103-1127, October.
    21. Melvyn Weeks & Sriya Iyer, 2004. "Multiple social interactions and reproductive externalities: An investigation of fertility behaviour in Kenya," Econometric Society 2004 Latin American Meetings 143, Econometric Society.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metrik:v:79:y:2016:i:4:d:10.1007_s00184-015-0563-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.