IDEAS home Printed from https://ideas.repec.org/a/eee/jmvana/v184y2021ics0047259x21000154.html
   My bibliography  Save this article

Heckman selection-t model: Parameter estimation via the EM-algorithm

Author

Listed:
  • Lachos, Victor H.
  • Prates, Marcos O.
  • Dey, Dipak K.

Abstract

The Heckman selection model is perhaps the most popular econometric model in the analysis of data with sample selection. The analyses of this model are based on the normality assumption for the error terms, however, in some applications, the distribution of the error term departs significantly from normality, for instance, in the presence of heavy tails and/or atypical observation. In this paper, we explore the Heckman selection-t model where the random errors follow a bivariate Student’s-t distribution. We develop an analytically tractable and efficient EM-type algorithm for iteratively computing maximum likelihood estimates of the parameters, with standard errors as a by-product. The algorithm has closed-form expressions at the E-step, that rely on formulas for the mean and variance of the truncated Student’s-t distributions. Simulation studies show the vulnerability of the Heckman selection-normal model, as well as the robustness aspects of the Heckman selection-t model. Two real examples are analyzed, illustrating the usefulness of the proposed methods. The proposed algorithms and methods are implemented in the new R package HeckmanEM.

Suggested Citation

  • Lachos, Victor H. & Prates, Marcos O. & Dey, Dipak K., 2021. "Heckman selection-t model: Parameter estimation via the EM-algorithm," Journal of Multivariate Analysis, Elsevier, vol. 184(C).
  • Handle: RePEc:eee:jmvana:v:184:y:2021:i:c:s0047259x21000154
    DOI: 10.1016/j.jmva.2021.104737
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0047259X21000154
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmva.2021.104737?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. William H. Rogers & John W. Tukey, 1972. "Understanding some long‐tailed symmetrical distributions," Statistica Neerlandica, Netherlands Society for Statistics and Operations Research, vol. 26(3), pages 211-226, September.
    2. Ding, Peng, 2014. "Bayesian robust inference of sample selection using selection-t models," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 451-464.
    3. Ahn, Hyungtaik & Powell, James L., 1993. "Semiparametric estimation of censored selection models with a nonparametric selection mechanism," Journal of Econometrics, Elsevier, vol. 58(1-2), pages 3-29, July.
    4. Arellano-Valle, Reinaldo B. & Bolfarine, Heleno, 1995. "On some characterizations of the t-distribution," Statistics & Probability Letters, Elsevier, vol. 25(1), pages 79-85, October.
    5. Matos, Larissa A. & Lachos, Victor H. & Balakrishnan, N. & Labra, Filidor V., 2013. "Influence diagnostics in linear and nonlinear mixed-effects models with censored data," Computational Statistics & Data Analysis, Elsevier, vol. 57(1), pages 450-464.
    6. Lee, Lung-Fei, 1983. "Generalized Econometric Models with Selectivity," Econometrica, Econometric Society, vol. 51(2), pages 507-512, March.
    7. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    8. Jeffrey M Wooldridge, 2010. "Econometric Analysis of Cross Section and Panel Data," MIT Press Books, The MIT Press, edition 2, volume 1, number 0262232588, April.
    9. Harald Tauchmann, 2010. "Consistency of Heckman-type two-step estimators for the multivariate sample-selection model," Applied Economics, Taylor & Francis Journals, vol. 42(30), pages 3895-3902.
    10. L. Zhao & C. Dorea & C. Gonçalves, 2001. "On Determination of the Order of a Markov Chain," Statistical Inference for Stochastic Processes, Springer, vol. 4(3), pages 273-282, October.
    11. Heckman, James J, 1974. "Shadow Prices, Market Wages, and Labor Supply," Econometrica, Econometric Society, vol. 42(4), pages 679-694, July.
    12. Mitali Das & Whitney K. Newey & Francis Vella, 2003. "Nonparametric Estimation of Sample Selection Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 70(1), pages 33-58.
    13. Kai, Li, 1998. "Bayesian inference in a simultaneous equation model with limited dependent variables," Journal of Econometrics, Elsevier, vol. 85(2), pages 387-400, August.
    14. A. Azzalini & A. Capitanio, 1999. "Statistical applications of the multivariate skew normal distribution," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 61(3), pages 579-602.
    15. Zhao, Jun & Kim, Hea-Jung & Kim, Hyoung-Moon, 2020. "New EM-type algorithms for the Heckman selection model," Computational Statistics & Data Analysis, Elsevier, vol. 146(C).
    16. Yulia V. Marchenko & Marc G. Genton, 2012. "A Heckman Selection- t Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 304-317, March.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Helton Saulo & Roberto Vila & Shayane S. Cordeiro, 2022. "Symmetric generalized Heckman models," Papers 2206.10054, arXiv.org.
    2. Saulo, Helton & Vila, Roberto & Cordeiro, Shayane S. & Leiva, Víctor, 2023. "Bivariate symmetric Heckman models and their characterization," Journal of Multivariate Analysis, Elsevier, vol. 193(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Victor Chernozhukov & Ivan Fernandez-Val & Siyi Luo, 2018. "Distribution regression with sample selection, with an application to wage decompositions in the UK," CeMMAP working papers CWP68/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    2. Emmanuel O. Ogundimu & Jane L. Hutton, 2016. "A Sample Selection Model with Skew-normal Distribution," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 43(1), pages 172-190, March.
    3. Ding, Peng, 2014. "Bayesian robust inference of sample selection using selection-t models," Journal of Multivariate Analysis, Elsevier, vol. 124(C), pages 451-464.
    4. Liu, Ruixuan & Yu, Zhengfei, 2022. "Sample selection models with monotone control functions," Journal of Econometrics, Elsevier, vol. 226(2), pages 321-342.
    5. Yulia V. Marchenko & Marc G. Genton, 2012. "A Heckman Selection- t Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 304-317, March.
    6. Breunig, Christoph & Mammen, Enno & Simoni, Anna, 2018. "Nonparametric estimation in case of endogenous selection," Journal of Econometrics, Elsevier, vol. 202(2), pages 268-285.
    7. Lewbel, Arthur, 2007. "Endogenous selection or treatment model estimation," Journal of Econometrics, Elsevier, vol. 141(2), pages 777-806, December.
    8. Wiemann, Paul F.V. & Klein, Nadja & Kneib, Thomas, 2022. "Correcting for sample selection bias in Bayesian distributional regression models," Computational Statistics & Data Analysis, Elsevier, vol. 168(C).
    9. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    10. Manuel Arellano & Stéphane Bonhomme, 2017. "Quantile Selection Models With an Application to Understanding Changes in Wage Inequality," Econometrica, Econometric Society, vol. 85, pages 1-28, January.
    11. Martin Huber, 2012. "Identification of Average Treatment Effects in Social Experiments Under Alternative Forms of Attrition," Journal of Educational and Behavioral Statistics, , vol. 37(3), pages 443-474, June.
    12. Casey B. Mulligan & Yona Rubinstein, 2004. "The Closing of the Gender Gap as a Roy Model Illusion," NBER Working Papers 10892, National Bureau of Economic Research, Inc.
    13. Gayle, George-Levi & Viauroux, Christelle, 2007. "Root-N consistent semiparametric estimators of a dynamic panel-sample-selection model," Journal of Econometrics, Elsevier, vol. 141(1), pages 179-212, November.
    14. Richard Blundell & Amanda Gosling & Hidehiko Ichimura & Costas Meghir, 2007. "Changes in the Distribution of Male and Female Wages Accounting for Employment Composition Using Bounds," Econometrica, Econometric Society, vol. 75(2), pages 323-363, March.
    15. Martin Huber & Giovanni Mellace, 2014. "Testing exclusion restrictions and additive separability in sample selection models," Empirical Economics, Springer, vol. 47(1), pages 75-92, August.
    16. Malikov, Emir & Kumbhakar, Subal C. & Sun, Yiguo, 2016. "Varying coefficient panel data model in the presence of endogenous selectivity and fixed effects," Journal of Econometrics, Elsevier, vol. 190(2), pages 233-251.
    17. Emmanuel O. Ogundimu, 2022. "Regularization and variable selection in Heckman selection model," Statistical Papers, Springer, vol. 63(2), pages 421-439, April.
    18. Victor Chernozhukov & Ivan Fernandez-Val & Siyi Luo, 2023. "Distribution regression with sample selection and UK wage decomposition," CeMMAP working papers 09/23, Institute for Fiscal Studies.
    19. Wojtyś, Małgorzata & Marra, Giampiero & Radice, Rosalba, 2018. "Copula based generalized additive models for location, scale and shape with non-random sample selection," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 1-14.
    20. Manuel Arellano & Stéphane Bonhomme, 2017. "Sample Selection in Quantile Regression: A Survey," Working Papers wp2018_1702, CEMFI.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jmvana:v:184:y:2021:i:c:s0047259x21000154. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/622892/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.