IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v74y2022i3d10.1007_s10463-021-00806-2.html
   My bibliography  Save this article

Robust model selection with covariables missing at random

Author

Listed:
  • Zhongqi Liang

    (Zhejiang Gongshang University)

  • Qihua Wang

    (Zhejiang Gongshang University
    Academy of Mathematics and Systems Science, Chinese Academy of Sciences)

  • Yuting Wei

    (Universtiy of Science and Technology of China)

Abstract

Let $$f_{Y|X,Z}(y|x,z)$$ f Y | X , Z ( y | x , z ) be the conditional probability function of Y given (X, Z), where Y is the scalar response variable, while (X, Z) is the covariable vector. This paper proposes a robust model selection criterion for $$f_{Y|X,Z}(y|x,z)$$ f Y | X , Z ( y | x , z ) with X missing at random. The proposed method is developed based on a set of assumed models for the selection probability function. However, the consistency of model selection by our proposal does not require these models to be correctly specified, while it only requires that the selection probability function is a function of these assumed selective probability functions. Under some conditions, it is proved that the model selection by the proposed method is consistent and the estimator for population parameter vector is consistent and asymptotically normal. A Monte Carlo study was conducted to evaluate the finite-sample performance of our proposal. A real data analysis was used to illustrate the practical application of our proposal.

Suggested Citation

  • Zhongqi Liang & Qihua Wang & Yuting Wei, 2022. "Robust model selection with covariables missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 539-557, June.
  • Handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00806-2
    DOI: 10.1007/s10463-021-00806-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10463-021-00806-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10463-021-00806-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ibrahim, Joseph G. & Zhu, Hongtu & Tang, Niansheng, 2008. "Model Selection Criteria for Missing-Data Problems Using the EM Algorithm," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1648-1658.
    2. Craig A. Rolling & Yuhong Yang, 2014. "Model selection for estimating treatment effects," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(4), pages 749-769, September.
    3. Jiming Jiang & Thuan Nguyen & J. Sunil Rao, 2015. "The E-MS Algorithm: Model Selection With Incomplete Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1136-1147, September.
    4. Claeskens,Gerda & Hjort,Nils Lid, 2008. "Model Selection and Model Averaging," Cambridge Books, Cambridge University Press, number 9780521852258, November.
    5. Wang, Qihua & Su, Miaomiao & Wang, Ruoyu, 2021. "A beyond multiple robust approach for missing response problem," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    6. Xinyu Zhang & Haiying Wang & Yanyuan Ma & Raymond J. Carroll, 2017. "Linear Model Selection When Covariates Contain Errors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1553-1561, October.
    7. Gourieroux,Christian & Monfort,Alain, 1995. "Statistics and Econometric Models," Cambridge Books, Cambridge University Press, number 9780521405515, November.
    8. Wei, Yuting & Wang, Qihua & Duan, Xiaogang & Qin, Jing, 2021. "Bias-corrected Kullback–Leibler distance criterion based model selection with covariables missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    9. Gerda Claeskens & Fabrizio Consentino, 2008. "Variable Selection with Incomplete Covariate Data," Biometrics, The International Biometric Society, vol. 64(4), pages 1062-1069, December.
    10. Gourieroux,Christian & Monfort,Alain, 1995. "Statistics and Econometric Models 2 volume set," Cambridge Books, Cambridge University Press, number 9780521478373, July.
    11. Andrew Gelman & Iven Van Mechelen & Geert Verbeke & Daniel F. Heitjan & Michel Meulders, 2005. "Multiple Imputation for Model Checking: Completed-Data Plots with Missing and Latent Data," Biometrics, The International Biometric Society, vol. 61(1), pages 74-85, March.
    12. Claeskens G. & Hjort N.L., 2003. "The Focused Information Criterion," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 900-916, January.
    13. Qin Shao & Lijian Yang, 2017. "Oracally efficient estimation and consistent model selection for auto-regressive moving average time series with trend," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 507-524, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wei, Yuting & Wang, Qihua & Duan, Xiaogang & Qin, Jing, 2021. "Bias-corrected Kullback–Leibler distance criterion based model selection with covariables missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    2. Edvard Bakhitov, 2020. "Frequentist Shrinkage under Inequality Constraints," Papers 2001.10586, arXiv.org.
    3. Bo E. Honoré & Luojia Hu, 2023. "The COVID-19 pandemic and Asian American employment," Empirical Economics, Springer, vol. 64(5), pages 2053-2083, May.
    4. Patrick Gagliardini & Christian Gouriéroux, 2011. "Approximate Derivative Pricing for Large Classes of Homogeneous Assets with Systematic Risk," Journal of Financial Econometrics, Oxford University Press, vol. 9(2), pages 237-280, Spring.
    5. Gerhard, Frank & Hess, Dieter & Pohlmeier, Winfried, 1998. "What a Difference a Day Makes: On the Common Market Microstructure of Trading Days," CoFE Discussion Papers 98/01, University of Konstanz, Center of Finance and Econometrics (CoFE).
    6. Shapiro, Dmitry & Shi, Xianwen & Zillante, Artie, 2014. "Level-k reasoning in a generalized beauty contest," Games and Economic Behavior, Elsevier, vol. 86(C), pages 308-329.
    7. Prosper Dovonon & Alastair Hall & Frank Kleibergen, 2018. "Inference in Second-Order Identified Models," CIRANO Working Papers 2018s-36, CIRANO.
    8. Samuele Centorrino & María Pérez‐Urdiales & Boris Bravo‐Ureta & Alan Wall, 2024. "Binary endogenous treatment in stochastic frontier models with an application to soil conservation in El Salvador," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 365-382, April.
    9. Detering, Nils & Packham, Natalie, 2018. "Model risk of contingent claims," IRTG 1792 Discussion Papers 2018-036, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    10. Aronsson, Thomas & Jenderny, Katharina & Lanot, Gauthier, 2021. "Maximum Likelihood Bunching Estimators of the ETI," Umeå Economic Studies 987, Umeå University, Department of Economics.
    11. Jean-Fran�ois Richard, 2011. "Book Review: Econometric Modeling and Inference," Econometric Reviews, Taylor & Francis Journals, vol. 30(5), pages 577-581, October.
    12. Brosig, Stephan, 2000. "A model of household type specific food demand behaviour in Hungary," IAMO Discussion Papers 30, Leibniz Institute of Agricultural Development in Transition Economies (IAMO).
    13. Yuting Wei & Qihua Wang & Wei Liu, 2021. "Model averaging for linear models with responses missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 73(3), pages 535-553, June.
    14. Jiang, Wei & Josse, Julie & Lavielle, Marc, 2020. "Logistic regression with missing covariates—Parameter estimation, model selection and prediction within a joint-modeling framework," Computational Statistics & Data Analysis, Elsevier, vol. 145(C).
    15. Blevins, Jason R. & Kim, Minhae, 2024. "Nested Pseudo likelihood estimation of continuous-time dynamic discrete games," Journal of Econometrics, Elsevier, vol. 238(2).
    16. Gouriéroux, Christian & Monfort, Alain & Renne, Jean-Paul, 2017. "Statistical inference for independent component analysis: Application to structural VAR models," Journal of Econometrics, Elsevier, vol. 196(1), pages 111-126.
    17. Antonio Diez de los Rios, 2017. "Optimal Estimation of Multi-Country Gaussian Dynamic Term Structure Models Using Linear Regressions," Staff Working Papers 17-33, Bank of Canada.
    18. Kasahara, Hiroyuki & Shimotsu, Katsumi, 2019. "Asymptotic properties of the maximum likelihood estimator in regime switching econometric models," Journal of Econometrics, Elsevier, vol. 208(2), pages 442-467.
    19. Denis Fougère & Thierry Kamionka, 2003. "Bayesian inference for the mover-stayer model in continuous time with an application to labour market transition data," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 18(6), pages 697-723.
    20. Antonio Diez de Los Rios, 2015. "A New Linear Estimator for Gaussian Dynamic Term Structure Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(2), pages 282-295, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00806-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.