IDEAS home Printed from https://ideas.repec.org/a/spr/metrik/v85y2022i4d10.1007_s00184-021-00837-y.html
   My bibliography  Save this article

$$\ell _2$$ ℓ 2 -penalized approximate likelihood inference in logit mixed models for regional prevalence estimation under covariate rank-deficiency

Author

Listed:
  • Joscha Krause

    (Trier University)

  • Jan Pablo Burgard

    (Trier University)

  • Domingo Morales

    (University Miguel Hernández de Elche)

Abstract

Regional prevalence estimation requires the use of suitable statistical methods on epidemiologic data with substantial local detail. Small area estimation with medical treatment records as covariates marks a promising combination for this purpose. However, medical routine data often has strong internal correlation due to diagnosis-related grouping in the records. Depending on the strength of the correlation, the space spanned by the covariates can become rank-deficient. In this case, prevalence estimates suffer from unacceptable uncertainty as the individual contributions of the covariates to the model cannot be identified properly. We propose an area-level logit mixed model for regional prevalence estimation with a new fitting algorithm to solve this problem. We extend the Laplace approximation to the log-likelihood by an $$\ell _2$$ ℓ 2 -penalty in order to stabilize the estimation process in the presence of covariate rank-deficiency. Empirical best predictors under the model and a parametric bootstrap for mean squared error estimation are presented. A Monte Carlo simulation study is conducted to evaluate the properties of our methodology in a controlled environment. We further provide an empirical application where the district-level prevalence of multiple sclerosis in Germany is estimated using health insurance records.

Suggested Citation

  • Joscha Krause & Jan Pablo Burgard & Domingo Morales, 2022. "$$\ell _2$$ ℓ 2 -penalized approximate likelihood inference in logit mixed models for regional prevalence estimation under covariate rank-deficiency," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 85(4), pages 459-489, May.
  • Handle: RePEc:spr:metrik:v:85:y:2022:i:4:d:10.1007_s00184-021-00837-y
    DOI: 10.1007/s00184-021-00837-y
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s00184-021-00837-y
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s00184-021-00837-y?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Gonzalez-Manteiga, W. & Lombardia, M.J. & Molina, I. & Morales, D. & Santamaria, L., 2007. "Estimation of the mean squared error of predictors of small area linear parameters under a logistic mixed model," Computational Statistics & Data Analysis, Elsevier, vol. 51(5), pages 2720-2733, February.
    2. Isabel Molina & Ayoub Saei & M. José Lombardía, 2007. "Small area estimates of labour force participation under a multinomial logit mixed model," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 170(4), pages 975-1000, October.
    3. Ray Chambers & Nicola Salvati & Nikos Tzavidis, 2016. "Semiparametric small area estimation for binary outcomes with application to unemployment estimation for local authorities in the UK," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 179(2), pages 453-479, February.
    4. Jiming Jiang & P. Lahiri, 2001. "Empirical Best Prediction for Small Area Inference with Binary Data," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 53(2), pages 217-243, June.
    5. M. Giovanna Ranalli & Giorgio E. Montanari & Cecilia Vicarelli, 2018. "Estimation of small area counts with the benchmarking property," METRON, Springer;Sapienza Università di Roma, vol. 76(3), pages 349-378, December.
    6. A. F. Militino & M. D. Ugarte & T. Goicoa, 2015. "Deriving small area estimates from information technology business surveys," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(4), pages 1051-1067, October.
    7. Miguel Boubeta & María José Lombardía & Domingo Morales, 2016. "Empirical best prediction under area-level Poisson mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(3), pages 548-569, September.
    8. Boubeta, Miguel & Lombardía, María José & Morales, Domingo, 2017. "Poisson mixed models for studying the poverty in small areas," Computational Statistics & Data Analysis, Elsevier, vol. 107(C), pages 32-47.
    9. Esther López-Vizcaíno & María José Lombardía & Domingo Morales, 2015. "Small area estimation of labour force indicators under a multinomial model with correlated time and area effects," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 178(3), pages 535-565, June.
    10. Tomáš Hobza & Domingo Morales & Laureano Santamaría, 2018. "Small area estimation of poverty proportions under unit-level temporal binomial-logit mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 270-294, June.
    11. Adam Branscum & Timothy Hanson & Ian Gardner, 2008. "Bayesian non-parametric models for regional prevalence estimation," Journal of Applied Statistics, Taylor & Francis Journals, vol. 35(5), pages 567-582.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2020. "Small area estimation of proportions under area-level compositional mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(3), pages 793-818, September.
    2. Domingo Morales & Joscha Krause & Jan Pablo Burgard, 2022. "On the Use of Aggregate Survey Data for Estimating Regional Major Depressive Disorder Prevalence," Psychometrika, Springer;The Psychometric Society, vol. 87(1), pages 344-368, March.
    3. María Dolores Esteban & María José Lombardía & Esther López-Vizcaíno & Domingo Morales & Agustín Pérez, 2023. "Small area estimation of average compositions under multivariate nested error regression models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 32(2), pages 651-676, June.
    4. Tomáš Hobza & Domingo Morales & Laureano Santamaría, 2018. "Small area estimation of poverty proportions under unit-level temporal binomial-logit mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 270-294, June.
    5. Angelo Moretti, 2023. "Estimation of small area proportions under a bivariate logistic mixed model," Quality & Quantity: International Journal of Methodology, Springer, vol. 57(4), pages 3663-3684, August.
    6. María Dolores Esteban & María José Lombardía & Esther López‐Vizcaíno & Domingo Morales & Agustín Pérez, 2022. "Empirical best prediction of small area bivariate parameters," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1699-1727, December.
    7. Chandra, Hukum & Salvati, Nicola & Chambers, Ray, 2018. "Small area estimation under a spatially non-linear model," Computational Statistics & Data Analysis, Elsevier, vol. 126(C), pages 19-38.
    8. Joscha Krause & Jan Pablo Burgard & Domingo Morales, 2022. "Robust prediction of domain compositions from uncertain data using isometric logratio transformations in a penalized multivariate Fay–Herriot model," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 76(1), pages 65-96, February.
    9. M. Giovanna Ranalli & Giorgio E. Montanari & Cecilia Vicarelli, 2018. "Estimation of small area counts with the benchmarking property," METRON, Springer;Sapienza Università di Roma, vol. 76(3), pages 349-378, December.
    10. Boubeta, Miguel & Lombardía, María José & Morales, Domingo, 2017. "Poisson mixed models for studying the poverty in small areas," Computational Statistics & Data Analysis, Elsevier, vol. 107(C), pages 32-47.
    11. Roberto Benavent & Domingo Morales, 2021. "Small area estimation under a temporal bivariate area-level linear mixed model with independent time effects," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 195-222, March.
    12. Miguel Boubeta & María José Lombardía & Domingo Morales, 2016. "Empirical best prediction under area-level Poisson mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(3), pages 548-569, September.
    13. James Dawber & Nicola Salvati & Enrico Fabrizi & Nikos Tzavidis, 2022. "Expectile regression for multi‐category outcomes with application to small area estimation of labour force participation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(S2), pages 590-619, December.
    14. Merfeld,Joshua David & Newhouse,David Locke & Weber,Michael & Lahiri,Partha, 2022. "Combining Survey and Geospatial Data Can Significantly Improve Gender-DisaggregatedEstimates of Labor Market Outcomes," Policy Research Working Paper Series 10077, The World Bank.
    15. Hao Sun & Emily Berg & Zhengyuan Zhu, 2022. "Bivariate small‐area estimation for binary and gaussian variables based on a conditionally specified model," Biometrics, The International Biometric Society, vol. 78(4), pages 1555-1565, December.
    16. Isabel Molina & Ewa Strzalkowska‐Kominiak, 2020. "Estimation of proportions in small areas: application to the labour force using the Swiss Census Structural Survey," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 281-310, January.
    17. Adam Chwila & Tomasz Żądło, 2020. "On the choice of the number of Monte Carlo iterations and bootstrap replicates in Empirical Best Prediction," Statistics in Transition New Series, Polish Statistical Association, vol. 21(2), pages 35-60, June.
    18. Domingo Morales & María del Mar Rueda & Dolores Esteban, 2018. "Model-Assisted Estimation of Small Area Poverty Measures: An Application within the Valencia Region in Spain," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 138(3), pages 873-900, August.
    19. Chwila Adam & Żądło Tomasz, 2020. "On the choice of the number of Monte Carlo iterations and bootstrap replicates in Empirical Best Prediction," Statistics in Transition New Series, Polish Statistical Association, vol. 21(2), pages 35-60, June.
    20. Ralf Münnich & Jan Pablo Burgard & Siegfried Gabler & Matthias Ganninger & Jan-Philipp Kolb, 2016. "Small Area Estimation In The German Census 2011," Statistics in Transition New Series, Polish Statistical Association, vol. 17(1), pages 25-40, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:metrik:v:85:y:2022:i:4:d:10.1007_s00184-021-00837-y. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.