IDEAS home Printed from https://ideas.repec.org/a/csb/stintr/v16y2015i4p541-562.html
   My bibliography  Save this article

Small area estimation for skewed data in the presence of zeroes

Author

Listed:
  • Forough Karlberg

Abstract

Skewed distributions with representative outliers pose a problem in many surveys. Various small area prediction approaches for skewed data based on transformation models have been proposed. However, in certain applications of those predictors, the fact that the survey data also contain a non-negligible number of zero-valued observations is sometimes dealt with rather crudely, for instance by arbitrarily adding a constant to each value (to allow zeroes to be considered as “positive observations, only smaller”, instead of acknowledging their qualitatively different nature). On the other hand, while a lognormal-logistic model has been proposed (to incorporate skewed distributions as well as zeroes), that model does not include any hierarchical aspects, and is therefore not explicitly adapted to small area prediction. In this paper, we consolidate the two approaches by extending one of the already established log-transformation mixed small area prediction models to incorporate a logistic component. This allows for the simultaneous, systematic treatment of domain effects, outliers and zero-valued observations in a single framework. We benchmark the resulting model-based predictors (against relevant alternatives) in applications to simulated data as well as empirical data from the Australian Agricultural and Grazing Industries Survey.

Suggested Citation

  • Forough Karlberg, 2015. "Small area estimation for skewed data in the presence of zeroes," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(4), pages 541-562, December.
  • Handle: RePEc:csb:stintr:v:16:y:2015:i:4:p:541-562
    as

    Download full text from publisher

    File URL: http://index.stat.gov.pl/repec/files/csb/stintr/csb_stintr_v16_2015_i4_n6.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ray Chambers & Nikos Tzavidis, 2006. "M-quantile models for small area estimation," Biometrika, Biometrika Trust, vol. 93(2), pages 255-268, June.
    2. Huggett, Mark, 1996. "Wealth distribution in life-cycle economies," Journal of Monetary Economics, Elsevier, vol. 38(3), pages 469-494, December.
    3. Ray Chambers & Hukum Chandra & Nicola Salvati & Nikos Tzavidis, 2014. "Outlier robust small area estimation," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 76(1), pages 47-69, January.
    4. Forough Karlberg, 2000. "Population total prediction under a lognormal superpopulation model," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(3-4), pages 53-80.
    5. Mincer, Jacob, 1970. "The Distribution of Labor Incomes: A Survey with Special Reference to the Human Capital Approach," Journal of Economic Literature, American Economic Association, vol. 8(1), pages 1-26, March.
    6. Eric V. Slud & Tapabrata Maiti, 2006. "Mean‐squared error estimation in transformed Fay–Herriot models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 68(2), pages 239-257, April.
    7. Wang, Junyuan & Fuller, Wayne A., 2003. "The Mean Squared Error of Small Area Predictors Constructed With Estimated Area Variances," Journal of the American Statistical Association, American Statistical Association, vol. 98, pages 716-723, January.
    8. Jiang, Jiming & Lahiri, P., 2006. "Estimation of Finite Population Domain Means: A Model-Assisted Empirical Best Prediction Approach," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 301-311, March.
    9. Molina, Isabel, 2009. "Uncertainty under a multivariate nested-error regression model with logarithmic transformation," Journal of Multivariate Analysis, Elsevier, vol. 100(5), pages 963-980, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Forough Karlberg, 2015. "Small Area Estimation For Skewed Data In The Presence Of Zeroes," Statistics in Transition New Series, Polish Statistical Association, vol. 16(4), pages 541-562, December.
    2. Karlberg Forough, 2015. "Small Area Estimation for Skewed Data in the Presence of Zeroes," Statistics in Transition New Series, Statistics Poland, vol. 16(4), pages 541-562, December.
    3. Berg, Emily & Chandra, Hukum, 2014. "Small area prediction for a unit-level lognormal model," Computational Statistics & Data Analysis, Elsevier, vol. 78(C), pages 159-175.
    4. J. N. K. Rao, 2015. "Inferential issues in model-based small area estimation: some new developments," Statistics in Transition new series, Główny Urząd Statystyczny (Polska), vol. 16(4), pages 491-510, December.
    5. J. N. K. Rao, 2015. "Inferential Issues In Model-Based Small Area Estimation: Some New Developments," Statistics in Transition New Series, Polish Statistical Association, vol. 16(4), pages 491-510, December.
    6. Rao J. N. K., 2015. "Inferential Issues in Model-Based Small Area Estimation: Some New Developments," Statistics in Transition New Series, Statistics Poland, vol. 16(4), pages 491-510, December.
    7. Otto-Sobotka, Fabian & Salvati, Nicola & Ranalli, Maria Giovanna & Kneib, Thomas, 2019. "Adaptive semiparametric M-quantile regression," Econometrics and Statistics, Elsevier, vol. 11(C), pages 116-129.
    8. Dian Handayani & Henk Folmer & Anang Kurnia & Khairil Anwar Notodiputro, 2018. "The spatial empirical Bayes predictor of the small area mean for a lognormal variable of interest and spatially correlated random effects," Empirical Economics, Springer, vol. 55(1), pages 147-167, August.
    9. G. Bertarelli & R. Chambers & N. Salvati, 2021. "Outlier robust small domain estimation via bias correction and robust bootstrapping," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 30(1), pages 331-357, March.
    10. Valéry Dongmo Jiongo & Pierre Nguimkeu, 2018. "Bootstrapping Mean Squared Errors of Robust Small-Area Estimators: Application to the Method-of-Payments Data," Staff Working Papers 18-28, Bank of Canada.
    11. Fabrizi, Enrico & Salvati, Nicola & Trivisano, Carlo, 2020. "Robust Bayesian small area estimation based on quantile regression," Computational Statistics & Data Analysis, Elsevier, vol. 145(C).
    12. Stefano Marchetti & Maciej Beręsewicz & Nicola Salvati & Marcin Szymkowiak & Łukasz Wawrowski, 2018. "The use of a three‐level M‐quantile model to map poverty at local administrative unit 1 in Poland," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 181(4), pages 1077-1104, October.
    13. Bhuiyan, M. Kamruj Jaman & Hossain, M. Jamal & Islam, Mohammad Amirul & Imam, M. Farouq & Quddus, Md. Abdul, 2020. "Small Area Estimation Of Nutritional Status Of Under-Five Children In Sylhet Division: An M-Quantile Approach," Bangladesh Journal of Agricultural Economics, Bangladesh Agricultural University, vol. 41(1), July.
    14. Merlo, Luca & Petrella, Lea & Salvati, Nicola & Tzavidis, Nikos, 2022. "Marginal M-quantile regression for multivariate dependent data," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    15. Baldermann, Claudia & Salvati, Nicola & Schmid, Timo, 2016. "Robust small area estimation under spatial non-stationarity," Discussion Papers 2016/5, Free University Berlin, School of Business & Economics.
    16. Sugasawa, Shonosuke & Kubokawa, Tatsuya, 2015. "Parametric transformed Fay–Herriot model for small area estimation," Journal of Multivariate Analysis, Elsevier, vol. 139(C), pages 295-311.
    17. Fernando A. S. Moura & André Felipe Neves & Denise Britz do N. Silva, 2017. "Small area models for skewed Brazilian business survey data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 180(4), pages 1039-1055, October.
    18. N. Salvati & E. Fabrizi & M. G. Ranalli & R. L. Chambers, 2021. "Small area estimation with linked data," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(1), pages 78-107, February.
    19. Dehnel Grażyna & Wawrowski Łukasz, 2020. "Robust estimation of wages in small enterprises: the application to Poland’s districts," Statistics in Transition New Series, Statistics Poland, vol. 21(1), pages 137-157, March.
    20. Paul A. Smith & Chiara Bocci & Nikos Tzavidis & Sabine Krieg & Marc J. E. Smeets, 2021. "Robust estimation for small domains in business surveys," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(2), pages 312-334, March.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:csb:stintr:v:16:y:2015:i:4:p:541-562. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Beata Witek (email available below). General contact details of provider: https://edirc.repec.org/data/gusgvpl.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.