IDEAS home Printed from https://ideas.repec.org/a/spr/testjl/v28y2019i2d10.1007_s11749-018-0594-2.html
   My bibliography  Save this article

A generalized mixed model for skewed distributions applied to small area estimation

Author

Listed:
  • Monique Graf

    (Institut de Statistique, Université de Neuchâtel
    Elpacos Statistics)

  • J. Miguel Marín

    (Universidad Carlos III de Madrid)

  • Isabel Molina

    (Universidad Carlos III de Madrid)

Abstract

Models with random (or mixed) effects are commonly used for panel data, in microarrays, small area estimation and many other applications. When the variable of interest is continuous, normality is commonly assumed, either in the original scale or after some transformation. However, the normal distribution is not always well suited for modeling data on certain variables, such as those found in Econometrics, which often show skewness even at the log scale. Finding the correct transformation to achieve normality is not straightforward since the true distribution is not known in practice. As an alternative, we propose to consider a much more flexible distribution called generalized beta of the second kind (GB2). The GB2 distribution contains four parameters with two of them controlling the shape of each tail, which makes it very flexible to accommodate different forms of skewness. Based on a multivariate extension of the GB2 distribution, we propose a new model with random effects designed for skewed response variables that extends the usual log-normal-nested error model. Under this new model, we find empirical best predictors of linear and nonlinear characteristics, including poverty indicators, in small areas. Simulation studies illustrate the good properties, in terms of bias and efficiency, of the estimators based on the proposed multivariate GB2 model. Results from an application to poverty mapping in Spanish provinces also indicate efficiency gains with respect to the conventional log-normal-nested error model used for poverty mapping.

Suggested Citation

  • Monique Graf & J. Miguel Marín & Isabel Molina, 2019. "A generalized mixed model for skewed distributions applied to small area estimation," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 28(2), pages 565-597, June.
  • Handle: RePEc:spr:testjl:v:28:y:2019:i:2:d:10.1007_s11749-018-0594-2
    DOI: 10.1007/s11749-018-0594-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11749-018-0594-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11749-018-0594-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. González-Manteiga, W. & Lombardi­a, M.J. & Molina, I. & Morales, D. & Santamari­a, L., 2008. "Analytic and bootstrap approximations of prediction errors under a multivariate Fay-Herriot model," Computational Statistics & Data Analysis, Elsevier, vol. 52(12), pages 5242-5252, August.
    2. Parker, Simon C, 1997. "The Distribution of Self-Employment Income in the United Kingdom, 1976-1991," Economic Journal, Royal Economic Society, vol. 107(441), pages 455-466, March.
    3. McDonald, James B. & Xu, Yexiao J., 1995. "A generalization of the beta distribution with applications," Journal of Econometrics, Elsevier, vol. 66(1-2), pages 133-152.
    4. McDonald, James B & Butler, Richard J, 1987. "Some Generalized Mixture Distributions with an Application to Unemployment Duration," The Review of Economics and Statistics, MIT Press, vol. 69(2), pages 232-240, May.
    5. Chris Elbers & Jean O. Lanjouw & Peter Lanjouw, 2003. "Micro--Level Estimation of Poverty and Inequality," Econometrica, Econometric Society, vol. 71(1), pages 355-364, January.
    6. Theodossiou, Panayiotis & McDonald, James B. & Hansen, Christian B., 2007. "Some Flexible Parametric Models for Partially Adaptive Estimators of Econometric Models," Economics - The Open-Access, Open-Assessment E-Journal (2007-2020), Kiel Institute for the World Economy (IfW Kiel), vol. 1, pages 1-20.
    7. Pfeffermann, Danny & Sverchkov, Michail, 2007. "Small-Area Estimation Under Informative Probability Sampling of Areas and Within the Selected Areas," Journal of the American Statistical Association, American Statistical Association, vol. 102, pages 1427-1439, December.
    8. Samuel Dastrup & Rachel Hartshorn & James McDonald, 2007. "The impact of taxes and transfer payments on the distribution of income: A parametric comparison," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 5(3), pages 353-369, December.
    9. James B. McDonald, 2008. "Some Generalized Functions for the Size Distribution of Income," Economic Studies in Inequality, Social Exclusion, and Well-Being, in: Duangkamon Chotikapanich (ed.), Modeling Income Distributions and Lorenz Curves, chapter 3, pages 37-55, Springer.
    10. Arne Henningsen & Ott Toomet, 2011. "maxLik: A package for maximum likelihood estimation in R," Computational Statistics, Springer, vol. 26(3), pages 443-458, September.
    11. Tomáš Hobza & Domingo Morales & Laureano Santamaría, 2018. "Small area estimation of poverty proportions under unit-level temporal binomial-logit mixed models," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 27(2), pages 270-294, June.
    12. Yang, Xipei & Frees, Edward W. & Zhang, Zhengjun, 2011. "A generalized beta copula with applications in modeling multivariate long-tailed data," Insurance: Mathematics and Economics, Elsevier, vol. 49(2), pages 265-284, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Juan Manuel Espejo Benítez & José María Millán Tapia, 2023. "Población en riesgo de pobreza y/o exclusión social. Propuesta metodológica para la estimación del indicador AROPE en los municipios de Andalucía," Hacienda Pública Española / Review of Public Economics, IEF, vol. 246(3), pages 101-135, September.
    2. Molina Isabel, 2020. "Discussion of “Small area estimation: its evolution in five decades”, by Malay Ghosh," Statistics in Transition New Series, Statistics Poland, vol. 21(4), pages 40-44, August.
    3. Guadarrama, María & Morales, Domingo & Molina, Isabel, 2021. "Time stable empirical best predictors under a unit-level model," Computational Statistics & Data Analysis, Elsevier, vol. 160(C).
    4. María José Lombardía & Esther López‐Vizcaíno & Cristina Rueda, 2022. "A new approach to the gender pay gap decomposition by economic activity," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 219-245, January.
    5. Patrick Krennmair & Timo Schmid, 2022. "Flexible domain prediction using mixed effects random forests," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1865-1894, November.
    6. Marchetti Stefano & Tzavidis Nikos, 2021. "Robust Estimation of the Theil Index and the Gini Coeffient for Small Areas," Journal of Official Statistics, Sciendo, vol. 37(4), pages 955-979, December.
    7. Isabel Molina, 2020. "Discussion of "Small area estimation: its evolution in five decades", by Malay Ghosh," Statistics in Transition New Series, Polish Statistical Association, vol. 21(4), pages 40-44, August.
    8. María Dolores Esteban & María José Lombardía & Esther López‐Vizcaíno & Domingo Morales & Agustín Pérez, 2022. "Empirical best prediction of small area bivariate parameters," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 49(4), pages 1699-1727, December.
    9. Natalia Rojas‐Perilla & Sören Pannier & Timo Schmid & Nikos Tzavidis, 2020. "Data‐driven transformations in small area estimation," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(1), pages 121-148, January.
    10. Aldo Gardini & Enrico Fabrizi & Carlo Trivisano, 2022. "Poverty and inequality mapping based on a unit‐level log‐normal mixture model," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(4), pages 2073-2096, October.
    11. Isabel Molina & Paul Corral & Minh Nguyen, 2022. "Estimation of poverty and inequality in small areas: review and discussion," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1143-1166, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. José María Sarabia & Vanesa Jordá & Faustino Prieto & Montserrat Guillén, 2020. "Multivariate Classes of GB2 Distributions with Applications," Mathematics, MDPI, vol. 9(1), pages 1-21, December.
    2. Vladimir Hlasny, 2021. "Parametric representation of the top of income distributions: Options, historical evidence, and model selection," Journal of Economic Surveys, Wiley Blackwell, vol. 35(4), pages 1217-1256, September.
    3. Michał Brzeziński, 2013. "Parametric Modelling of Income Distribution in Central and Eastern Europe," Central European Journal of Economic Modelling and Econometrics, Central European Journal of Economic Modelling and Econometrics, vol. 5(3), pages 207-230, September.
    4. Callealta Barroso, Francisco Javier & García-Pérez, Carmelo & Prieto-Alaiz, Mercedes, 2020. "Modelling income distribution using the log Student’s t distribution: New evidence for European Union countries," Economic Modelling, Elsevier, vol. 89(C), pages 512-522.
    5. Sung Y. Park & Anil K. Bera, 2018. "Information theoretic approaches to income density estimation with an application to the U.S. income data," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 16(4), pages 461-486, December.
    6. Shi, Peng & Valdez, Emiliano A., 2011. "A copula approach to test asymmetric information with applications to predictive modeling," Insurance: Mathematics and Economics, Elsevier, vol. 49(2), pages 226-239, September.
    7. Peng Shi & Wei Zhang, 2011. "A copula regression model for estimating firm efficiency in the insurance industry," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(10), pages 2271-2287.
    8. Higbee, Joshua D. & Jensen, Jonathan E. & McDonald, James B., 2019. "The asymmetric log-Laplace distribution as a limiting case of the generalized beta distribution," Statistics & Probability Letters, Elsevier, vol. 151(C), pages 73-78.
    9. Isabel Molina & Paul Corral & Minh Nguyen, 2022. "Estimation of poverty and inequality in small areas: review and discussion," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(4), pages 1143-1166, December.
    10. Sarabia, José María & Jordá, Vanesa, 2014. "Explicit expressions of the Pietra index for the generalized function for the size distribution of income," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 416(C), pages 582-595.
    11. Walter, Paul & Weimer, Katja, 2018. "Estimating poverty and inequality indicators using interval censored income data from the German microcensus," Discussion Papers 2018/10, Free University Berlin, School of Business & Economics.
    12. Monique Graf & Desislava Nedyalkova, 2014. "Modeling of Income and Indicators of Poverty and Social Exclusion Using the Generalized Beta Distribution of the Second Kind," Review of Income and Wealth, International Association for Research in Income and Wealth, vol. 60(4), pages 821-842, December.
    13. Boccanfuso, Dorothée & Richard, Patrick & Savard, Luc, 2013. "Parametric and nonparametric income distribution estimators in CGE micro-simulation modeling," Economic Modelling, Elsevier, vol. 35(C), pages 892-899.
    14. Andrew M. Jones & James Lomas & Nigel Rice, 2014. "Applying Beta‐Type Size Distributions To Healthcare Cost Regressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(4), pages 649-670, June.
    15. Higbee, Joshua D. & McDonald, James B., 2024. "A comparison of the GB2 and skewed generalized log-t distributions with an application in finance," Journal of Econometrics, Elsevier, vol. 240(2).
    16. BenSaïda, Ahmed & Slim, Skander, 2016. "Highly flexible distributions to fit multiple frequency financial returns," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 442(C), pages 203-213.
    17. Chotikapanich, Duangkamon & Griffiths, William E, 2002. "Estimating Lorenz Curves Using a Dirichlet Distribution," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(2), pages 290-295, April.
    18. Chotikapanich, Duangkamon & Griffiths, William E. & Rao, D.S. Prasada & Karunarathne, Wasana, 2014. "Income Distributions, Inequality, and Poverty in Asia, 1992–2010," ADBI Working Papers 468, Asian Development Bank Institute.
    19. Hajargasht, Gholamreza & Griffiths, William E., 2013. "Pareto–lognormal distributions: Inequality, poverty, and estimation from grouped income data," Economic Modelling, Elsevier, vol. 33(C), pages 593-604.
    20. Saissi Hassani, Samir & Dionne, Georges, 2021. "The New International Regulation of Market Risk: Roles of VaR and CVaR in Model Validation," Working Papers 21-1, HEC Montreal, Canada Research Chair in Risk Management.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:testjl:v:28:y:2019:i:2:d:10.1007_s11749-018-0594-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.