IDEAS home Printed from https://ideas.repec.org/a/spr/aistmt/v74y2022i3d10.1007_s10463-021-00806-2.html
   My bibliography  Save this article

Robust model selection with covariables missing at random

Author

Listed:
  • Zhongqi Liang

    (Zhejiang Gongshang University)

  • Qihua Wang

    (Zhejiang Gongshang University
    Academy of Mathematics and Systems Science, Chinese Academy of Sciences)

  • Yuting Wei

    (Universtiy of Science and Technology of China)

Abstract

Let $$f_{Y|X,Z}(y|x,z)$$ f Y | X , Z ( y | x , z ) be the conditional probability function of Y given (X, Z), where Y is the scalar response variable, while (X, Z) is the covariable vector. This paper proposes a robust model selection criterion for $$f_{Y|X,Z}(y|x,z)$$ f Y | X , Z ( y | x , z ) with X missing at random. The proposed method is developed based on a set of assumed models for the selection probability function. However, the consistency of model selection by our proposal does not require these models to be correctly specified, while it only requires that the selection probability function is a function of these assumed selective probability functions. Under some conditions, it is proved that the model selection by the proposed method is consistent and the estimator for population parameter vector is consistent and asymptotically normal. A Monte Carlo study was conducted to evaluate the finite-sample performance of our proposal. A real data analysis was used to illustrate the practical application of our proposal.

Suggested Citation

  • Zhongqi Liang & Qihua Wang & Yuting Wei, 2022. "Robust model selection with covariables missing at random," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(3), pages 539-557, June.
  • Handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00806-2
    DOI: 10.1007/s10463-021-00806-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10463-021-00806-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10463-021-00806-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gerda Claeskens & Fabrizio Consentino, 2008. "Variable Selection with Incomplete Covariate Data," Biometrics, The International Biometric Society, vol. 64(4), pages 1062-1069, December.
    2. Gourieroux,Christian & Monfort,Alain, 1995. "Statistics and Econometric Models 2 volume set," Cambridge Books, Cambridge University Press, number 9780521478373, July.
    3. Gourieroux,Christian & Monfort,Alain, 1995. "Statistics and Econometric Models," Cambridge Books, Cambridge University Press, number 9780521477444.
    4. Ibrahim, Joseph G. & Zhu, Hongtu & Tang, Niansheng, 2008. "Model Selection Criteria for Missing-Data Problems Using the EM Algorithm," Journal of the American Statistical Association, American Statistical Association, vol. 103(484), pages 1648-1658.
    5. Claeskens,Gerda & Hjort,Nils Lid, 2008. "Model Selection and Model Averaging," Cambridge Books, Cambridge University Press, number 9780521852258.
    6. Wang, Qihua & Su, Miaomiao & Wang, Ruoyu, 2021. "A beyond multiple robust approach for missing response problem," Computational Statistics & Data Analysis, Elsevier, vol. 155(C).
    7. Craig A. Rolling & Yuhong Yang, 2014. "Model selection for estimating treatment effects," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(4), pages 749-769, September.
    8. Jiming Jiang & Thuan Nguyen & J. Sunil Rao, 2015. "The E-MS Algorithm: Model Selection With Incomplete Data," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1136-1147, September.
    9. Xinyu Zhang & Haiying Wang & Yanyuan Ma & Raymond J. Carroll, 2017. "Linear Model Selection When Covariates Contain Errors," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1553-1561, October.
    10. Andrew Gelman & Iven Van Mechelen & Geert Verbeke & Daniel F. Heitjan & Michel Meulders, 2005. "Multiple Imputation for Model Checking: Completed-Data Plots with Missing and Latent Data," Biometrics, The International Biometric Society, vol. 61(1), pages 74-85, March.
    11. Claeskens G. & Hjort N.L., 2003. "The Focused Information Criterion," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 900-916, January.
    12. Wei, Yuting & Wang, Qihua & Duan, Xiaogang & Qin, Jing, 2021. "Bias-corrected Kullback–Leibler distance criterion based model selection with covariables missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    13. Qin Shao & Lijian Yang, 2017. "Oracally efficient estimation and consistent model selection for auto-regressive moving average time series with trend," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(2), pages 507-524, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wei, Yuting & Wang, Qihua & Duan, Xiaogang & Qin, Jing, 2021. "Bias-corrected Kullback–Leibler distance criterion based model selection with covariables missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    2. Edvard Bakhitov, 2020. "Frequentist Shrinkage under Inequality Constraints," Papers 2001.10586, arXiv.org.
    3. Bo E. Honoré & Luojia Hu, 2023. "The COVID-19 pandemic and Asian American employment," Empirical Economics, Springer, vol. 64(5), pages 2053-2083, May.
    4. Patrick Gagliardini & Christian Gouriéroux, 2011. "Approximate Derivative Pricing for Large Classes of Homogeneous Assets with Systematic Risk," Journal of Financial Econometrics, Oxford University Press, vol. 9(2), pages 237-280, Spring.
    5. Luis Orea & David Roibás & Alan Wall, 2004. "Choosing the Technical Efficiency Orientation to Analyze Firms' Technology: A Model Selection Test Approach," Journal of Productivity Analysis, Springer, vol. 22(1), pages 51-71, July.
    6. Gerhard, Frank & Hess, Dieter & Pohlmeier, Winfried, 1998. "What a Difference a Day Makes: On the Common Market Microstructure of Trading Days," CoFE Discussion Papers 98/01, University of Konstanz, Center of Finance and Econometrics (CoFE).
    7. Alexandre Petkovic & David Veredas, 2009. "Aggregation of linear models for panel data," Working Papers ECARES 2009-012, ULB -- Universite Libre de Bruxelles.
    8. Roxana Chiriac & Valeri Voev, 2011. "Modelling and forecasting multivariate realized volatility," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 26(6), pages 922-947, September.
    9. Joseph G. Ibrahim & Hongtu Zhu & Ramon I. Garcia & Ruixin Guo, 2011. "Fixed and Random Effects Selection in Mixed Effects Models," Biometrics, The International Biometric Society, vol. 67(2), pages 495-503, June.
    10. Shapiro, Dmitry & Shi, Xianwen & Zillante, Artie, 2014. "Level-k reasoning in a generalized beauty contest," Games and Economic Behavior, Elsevier, vol. 86(C), pages 308-329.
    11. Dirick, Lore & Claeskens, Gerda & Baesens, Bart, 2015. "An Akaike information criterion for multiple event mixture cure models," European Journal of Operational Research, Elsevier, vol. 241(2), pages 449-457.
    12. Gouriéroux, Christian & Monfort, Alain & Zakoian, Jean-Michel, 2017. "Pseudo-Maximum Likelihood and Lie Groups of Linear Transformations," MPRA Paper 79623, University Library of Munich, Germany.
    13. C. Gouriéroux & A. Monfort & J.‐M. Zakoïan, 2019. "Consistent Pseudo‐Maximum Likelihood Estimators and Groups of Transformations," Econometrica, Econometric Society, vol. 87(1), pages 327-345, January.
    14. Prosper Dovonon & Alastair Hall & Frank Kleibergen, 2018. "Inference in Second-Order Identified Models," CIRANO Working Papers 2018s-36, CIRANO.
    15. Gagliardini, Patrick & Gourieroux, Christian, 2014. "Efficiency In Large Dynamic Panel Models With Common Factors," Econometric Theory, Cambridge University Press, vol. 30(5), pages 961-1020, October.
    16. Samuele Centorrino & María Pérez‐Urdiales & Boris Bravo‐Ureta & Alan Wall, 2024. "Binary endogenous treatment in stochastic frontier models with an application to soil conservation in El Salvador," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 39(3), pages 365-382, April.
    17. Zsuzsa Bakk & Jouni Kuha, 2018. "Two-Step Estimation of Models Between Latent Classes and External Variables," Psychometrika, Springer;The Psychometric Society, vol. 83(4), pages 871-892, December.
    18. David T. Frazier & Eric Renault, 2016. "Indirect Inference With(Out) Constraints," Papers 1607.06163, arXiv.org, revised Aug 2019.
    19. De Luca, Giuseppe & Magnus, Jan R. & Peracchi, Franco, 2018. "Weighted-average least squares estimation of generalized linear models," Journal of Econometrics, Elsevier, vol. 204(1), pages 1-17.
    20. Detering, Nils & Packham, Natalie, 2018. "Model risk of contingent claims," IRTG 1792 Discussion Papers 2018-036, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:aistmt:v:74:y:2022:i:3:d:10.1007_s10463-021-00806-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.