IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v168y2018icp63-74.html
   My bibliography  Save this article

Classified mixed logistic model prediction

Author

Listed:
  • Sun, Hanmei
  • Nguyen, Thuan
  • Luan, Yihui
  • Jiang, Jiming

Abstract

We develop a classified mixed logistic model prediction (CMLMP) method for clustered binary data by extending a method proposed by Jiang et al. (2018) for continuous outcome data. By identifying a class, or cluster, that the new observations belong to, we are able to improve the prediction accuracy of a probabilistic mixed effect associated with a future observation over the traditional method of logistic regression and mixed model prediction without matching the class. Furthermore, we develop a new strategy for identifying the class for the new observations by utilizing covariates information, which improves accuracy of the class identification. In addition, we develop a method of obtaining second-order unbiased estimators of the mean squared prediction errors (MSPEs) for CMLMP, which are used to provide measures of uncertainty. We prove consistency of CMLMP, and demonstrate finite-sample performance of CMLMP via simulation studies. Our results show that the proposed CMLMP method outperforms the traditional methods in terms of predictive performance. An application to medical data is discussed.

Suggested Citation

  • Sun, Hanmei & Nguyen, Thuan & Luan, Yihui & Jiang, Jiming, 2018. "Classified mixed logistic model prediction," Journal of Multivariate Analysis, Elsevier, vol. 168(C), pages 63-74.
  • Handle: RePEc:eee:jmvana:v:168:y:2018:i:c:p:63-74
    DOI: 10.1016/j.jmva.2018.06.004
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X18300769
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2018.06.004?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jiming Jiang & J. Sunil Rao & Jie Fan & Thuan Nguyen, 2018. "Classified Mixed Model Prediction," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 113(521), pages 269-279, January.
    2. Karl W. Broman & Terence P. Speed, 2002. "A model selection approach for the identification of quantitative trait loci in experimental crosses," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 64(4), pages 641-656, October.
    3. Jiming Jiang & P. Lahiri, 2001. "Empirical Best Prediction for Small Area Inference with Binary Data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 53(2), pages 217-243, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. K. Shuvo Bakar & Nicholas Biddle & Philip Kokic & Huidong Jin, 2020. "A Bayesian spatial categorical model for prediction to overlapping geographical areas in sample surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 535-563, February.
    2. Frommlet, Florian & Ruhaltinger, Felix & Twaróg, Piotr & Bogdan, Małgorzata, 2012. "Modified versions of Bayesian Information Criterion for genome-wide association studies," Computational Statistics & Data Analysis, Elsevier, vol. 56(5), pages 1038-1051.
    3. Zak-Szatkowska, Malgorzata & Bogdan, Malgorzata, 2011. "Modified versions of the Bayesian Information Criterion for sparse Generalized Linear Models," Computational Statistics & Data Analysis, Elsevier, vol. 55(11), pages 2908-2924, November.
    4. Miguel Boubeta & María José Lombardía & Domingo Morales, 2016. "Empirical best prediction under area-level Poisson mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(3), pages 548-569, September.
    5. Ralf Münnich & Jan Pablo Burgard & Siegfried Gabler & Matthias Ganninger & Jan-Philipp Kolb, 2016. "Small Area Estimation In The German Census 2011," Statistics in Transition New Series, Polish Statistical Association, vol. 17(1), pages 25-40, March.
    6. Angelo Moretti, 2023. "Estimation of small area proportions under a bivariate logistic mixed model," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3663-3684, August.
    7. Marit Ackermann & Mathieu Clément-Ziza & Jacob J Michaelson & Andreas Beyer, 2012. "Teamwork: Improved eQTL Mapping Using Combinations of Machine Learning Methods," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-8, July.
    8. Jiming Jiang & P. Lahiri, 2006. "Mixed model prediction and small area estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 15(1), pages 1-96, June.
    9. María Dolores Esteban & María José Lombardía & Esther López‐Vizcaíno & Domingo Morales & Agustín Pérez, 2022. "Empirical best prediction of small area bivariate parameters," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1699-1727, December.
    10. Boubeta, Miguel & Lombardía, María José & Morales, Domingo, 2017. "Poisson mixed models for studying the poverty in small areas," Computational Statistics & Data Analysis, Elsevier, vol. 107(C), pages 32-47.
    11. Jerry J. Maples, 2017. "Improving small area estimates of disability: combining the American Community Survey with the Survey of Income and Program Participation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(4), pages 1211-1227, October.
    12. Tomáš Hobza & Domingo Morales & Laureano Santamaría, 2018. "Small area estimation of poverty proportions under unit-level temporal binomial-logit mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 270-294, June.
    13. Flores-Agreda, Daniel & Cantoni, Eva, 2019. "Bootstrap estimation of uncertainty in prediction for generalized linear mixed models," Computational Statistics & Data Analysis, Elsevier, vol. 130(C), pages 1-17.
    14. Malay Ghosh, 2020. "Small area estimation: its evolution in five decades," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 1-22, August.
    15. Anders Skrondal & Sophia Rabe‐Hesketh, 2009. "Prediction in multilevel generalized linear models," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 172(3), pages 659-687, June.
    16. C. M. Kendziorski & M. Chen & M. Yuan & H. Lan & A. D. Attie, 2006. "Statistical Methods for Expression Quantitative Trait Loci (eQTL) Mapping," Biometrics, The International Biometric Society, vol. 62(1), pages 19-27, March.
    17. Merfeld,Joshua David & Newhouse,David Locke & Weber,Michael & Lahiri,Partha, 2022. "Combining Survey and Geospatial Data Can Significantly Improve Gender-DisaggregatedEstimates of Labor Market Outcomes," Policy Research Working Paper Series 10077, The World Bank.
    18. Yawei He & Zehua Chen, 2016. "The EBIC and a sequential procedure for feature selection in interactive linear models with high-dimensional data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 68(1), pages 155-180, February.
    19. Hao Sun & Emily Berg & Zhengyuan Zhu, 2022. "Bivariate small‐area estimation for binary and gaussian variables based on a conditionally specified model," Biometrics, The International Biometric Society, vol. 78(4), pages 1555-1565, December.
    20. Chun Wang, 2021. "Using Penalized EM Algorithm to Infer Learning Trajectories in Latent Transition CDM," Psychometrika, Springer;The Psychometric Society, vol. 86(1), pages 167-189, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:168:y:2018:i:c:p:63-74. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.