IDEAS home Printed from https://ideas.repec.org/a/bpj/ijbist/v11y2015i1p51-67n10.html
   My bibliography  Save this article

A Universal Approximate Cross-Validation Criterion for Regular Risk Functions

Author

Listed:
  • Commenges Daniel

    (INSERM, ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Bordeaux F-33000, France ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Universite de Bordeaux, Bordeaux F-33000, France)

  • Proust-Lima Cécile

    (INSERM, ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Bordeaux F-33000, France ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Universite de Bordeaux, Bordeaux F-33000, France)

  • Samieri Cécilia

    (INSERM, ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Bordeaux F-33000, France ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Universite de Bordeaux, Bordeaux F-33000, France)

  • Liquet Benoit

    (INSERM, ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Bordeaux F-33000, France ISPED, Centre INSERM U-897-Epidemiologie-Biostatistique, Universite de Bordeaux, Bordeaux F-33000, France School of Mathematics and Physics, The University of Queensland, St Lucia, Brisbane, Queensland 4066, Australia)

Abstract

Selection of estimators is an essential task in modeling. A general framework is that the estimators of a distribution are obtained by minimizing a function (the estimating function) and assessed using another function (the assessment function). A classical case is that both functions estimate an information risk (specifically cross-entropy); this corresponds to using maximum likelihood estimators and assessing them by Akaike information criterion (AIC). In more general cases, the assessment risk can be estimated by leave-one-out cross-validation. Since leave-one-out cross-validation is computationally very demanding, we propose in this paper a universal approximate cross-validation criterion under regularity conditions (UACVR). This criterion can be adapted to different types of estimators, including penalized likelihood and maximum a posteriori estimators, and also to different assessment risk functions, including information risk functions and continuous rank probability score (CRPS). UACVR reduces to Takeuchi information criterion (TIC) when cross-entropy is the risk for both estimation and assessment. We provide the asymptotic distributions of UACVR and of a difference of UACVR values for two estimators. We validate UACVR using simulations and provide an illustration on real data both in the psychometric context where estimators of the distributions of ordered categorical data derived from threshold models and models based on continuous approximations are compared.

Suggested Citation

  • Commenges Daniel & Proust-Lima Cécile & Samieri Cécilia & Liquet Benoit, 2015. "A Universal Approximate Cross-Validation Criterion for Regular Risk Functions," The International Journal of Biostatistics, De Gruyter, vol. 11(1), pages 51-67, May.
  • Handle: RePEc:bpj:ijbist:v:11:y:2015:i:1:p:51-67:n:10
    DOI: 10.1515/ijb-2015-0004
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/ijb-2015-0004
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/ijb-2015-0004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Vuong, Quang H, 1989. "Likelihood Ratio Tests for Model Selection and Non-nested Hypotheses," Econometrica, Econometric Society, vol. 57(2), pages 307-333, March.
    2. Gneiting, Tilmann & Raftery, Adrian E., 2007. "Strictly Proper Scoring Rules, Prediction, and Estimation," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 359-378, March.
    3. Daniel Commenges & Pierre Joly & Anne Gégout‐Petit & Benoit Liquet, 2007. "Choice between Semi‐parametric Estimators of Markov and Non‐Markov Multi‐state Models from Coarsened Observations," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 34(1), pages 33-52, March.
    4. Cécile Proust & Hélène Jacqmin-Gadda & Jeremy M. G. Taylor & Julien Ganiayre & Daniel Commenges, 2006. "A Nonlinear Model with Latent Process for Cognitive Evolution Using Multivariate Longitudinal Data," Biometrics, The International Biometric Society, vol. 62(4), pages 1014-1024, December.
    5. Florin Vaida & Suzette Blanchard, 2005. "Conditional Akaike information for mixed-effects models," Biometrika, Biometrika Trust, vol. 92(2), pages 351-370, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Simona Buscemi & Antonella Plaia, 2020. "Model selection in linear mixed-effect models," AStA Advances in Statistical Analysis, Springer;German Statistical Society, vol. 104(4), pages 529-575, December.
    2. Breitmoser, Yves & Tan, Jonathan H.W., 2020. "Why should majority voting be unfair?," Journal of Economic Behavior & Organization, Elsevier, vol. 175(C), pages 281-295.
    3. Braun, Julia & Sabanés Bové, Daniel & Held, Leonhard, 2014. "Choice of generalized linear mixed models using predictive crossvalidation," Computational Statistics & Data Analysis, Elsevier, vol. 75(C), pages 190-202.
    4. Diks, Cees & Panchenko, Valentyn & Sokolinskiy, Oleg & van Dijk, Dick, 2014. "Comparing the accuracy of multivariate density forecasts in selected regions of the copula support," Journal of Economic Dynamics and Control, Elsevier, vol. 48(C), pages 79-94.
    5. Diks, Cees & Fang, Hao, 2020. "Comparing density forecasts in a risk management context," International Journal of Forecasting, Elsevier, vol. 36(2), pages 531-551.
    6. Daniel Commenges & Benoit Liquet & Cécile Proust-Lima, 2012. "Choice of Prognostic Estimators in Joint Models by Estimating Differences of Expected Conditional Kullback–Leibler Risks," Biometrics, The International Biometric Society, vol. 68(2), pages 380-387, June.
    7. Cees Diks & Valentyn Panchenko & Oleg Sokolinskiy, & Dick van Dijk, 2013. "Comparing the Accuracy of Copula-Based Multivariate Density Forecasts in Selected Regions of Support," Tinbergen Institute Discussion Papers 13-061/III, Tinbergen Institute.
    8. Andrew Martinez, 2017. "Testing for Differences in Path Forecast Accuracy: Forecast-Error Dynamics Matter," Working Papers (Old Series) 1717, Federal Reserve Bank of Cleveland.
    9. Claudia Czado & Tilmann Gneiting & Leonhard Held, 2009. "Predictive Model Assessment for Count Data," Biometrics, The International Biometric Society, vol. 65(4), pages 1254-1261, December.
    10. Paul Doukhan & Joseph Rynkiewicz & Yahia Salhi, 2021. "Optimal Neighborhood Selection for AR-ARCH Random Fields with Application to Mortality," Stats, MDPI, vol. 5(1), pages 1-26, December.
    11. Daniel Commenges, 2019. "Dealing with death when studying disease or physiological marker: the stochastic system approach to causality," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 25(3), pages 381-405, July.
    12. Corradi, Valentina & Fosten, Jack & Gutknecht, Daniel, 2023. "Out-of-sample tests for conditional quantile coverage an application to Growth-at-Risk," Journal of Econometrics, Elsevier, vol. 236(2).
    13. Julia Braun & Leonhard Held & Bruno Ledergerber, 2012. "Predictive Cross-validation for the Choice of Linear Mixed-Effects Models with Application to Data from the Swiss HIV Cohort Study," Biometrics, The International Biometric Society, vol. 68(1), pages 53-61, March.
    14. Breitmoser, Yves & Tan, Jonathan H.W., 2014. "Reference Dependent Altruism," MPRA Paper 52774, University Library of Munich, Germany.
    15. Oh, Dong Hwan & Patton, Andrew J., 2016. "High-dimensional copula-based distributions with mixed frequency data," Journal of Econometrics, Elsevier, vol. 193(2), pages 349-366.
    16. Almeida, Carlos & Czado, Claudia, 2012. "Efficient Bayesian inference for stochastic time-varying copula models," Computational Statistics & Data Analysis, Elsevier, vol. 56(6), pages 1511-1527.
    17. Fabrice Gilles & Sabina Issehnane & Florent Sari, 2022. "Using short-term jobs as a way to find a regular job. What kind of role for local context?," TEPP Working Paper 2022-07, TEPP.
    18. Bijlsma Ineke & van den Brakel Jan & van der Velden Rolf & Allen Jim, 2020. "Estimating Literacy Levels at a Detailed Regional Level: an Application Using Dutch Data," Journal of Official Statistics, Sciendo, vol. 36(2), pages 251-274, June.
    19. Jie Huang & Haiming Zhou & Nader Ebrahimi, 2022. "Bayesian Bivariate Cure Rate Models Using Copula Functions," International Journal of Statistics and Probability, Canadian Center of Science and Education, vol. 11(3), pages 1-9, May.
    20. Paulo M. D. C. Parente & Richard J. Smith, 2021. "Quasi‐maximum likelihood and the kernel block bootstrap for nonlinear dynamic models," Journal of Time Series Analysis, Wiley Blackwell, vol. 42(4), pages 377-405, July.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:11:y:2015:i:1:p:51-67:n:10. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.