IDEAS home Printed from https://ideas.repec.org/a/gam/jecnmx/v10y2022i2p13-d777879.html
   My bibliography  Save this article

A Binary Choice Model with Sample Selection and Covariate-Related Misclassification

Author

Listed:
  • Jorge González Chapela

    (Academia General Militar, Centro Universitario de la Defensa de Zaragoza, 50090 Zaragoza, Spain)

Abstract

Misclassification of a binary response variable and nonrandom sample selection are data issues frequently encountered by empirical researchers. For cases in which both issues feature simultaneously in a data set, we formulate a sample selection model for a misclassified binary outcome in which the conditional probabilities of misclassification are allowed to depend on covariates. Assuming the availability of validation data, the pseudo-maximum likelihood technique can be used to estimate the model. The performance of the estimator accounting for misclassification and sample selection is compared to that of estimators offering partial corrections. An empirical example illustrates the proposed framework.

Suggested Citation

  • Jorge González Chapela, 2022. "A Binary Choice Model with Sample Selection and Covariate-Related Misclassification," Econometrics, MDPI, vol. 10(2), pages 1-20, March.
  • Handle: RePEc:gam:jecnmx:v:10:y:2022:i:2:p:13-:d:777879
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2225-1146/10/2/13/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2225-1146/10/2/13/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Thomas Dohmen & Armin Falk & David Huffman & Uwe Sunde, 2010. "Are Risk Aversion and Impatience Related to Cognitive Ability?," American Economic Review, American Economic Association, vol. 100(3), pages 1238-1260, June.
    2. Jean-Louis Arcand & Linguère M'Baye, 2013. "Braving the waves: the role of time and risk preferences in illegal migration from Senegal," CERDI Working papers halshs-00855937, HAL.
    3. Gary S. Becker & Casey B. Mulligan, 1997. "The Endogenous Determination of Time Preference," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 112(3), pages 729-758.
    4. William W. Gould & Jeffrey Pitblado & Brian Poi, 2010. "Maximum Likelihood Estimation with Stata," Stata Press books, StataCorp LP, edition 4, number ml4, March.
    5. Aller, Carlos & González Chapela, Jorge, 2013. "Misclassification of the dependent variable in a debt–repayment behavior context," Journal of Empirical Finance, Elsevier, vol. 23(C), pages 162-172.
    6. Jonathan Cohen & Keith Marzilli Ericson & David Laibson & John Myles White, 2020. "Measuring Time Preferences," Journal of Economic Literature, American Economic Association, vol. 58(2), pages 299-347, June.
    7. Maria Felice Arezzo & Giuseppina Guagnano, 2019. "Misclassification in Binary Choice Models with Sample Selection," Econometrics, MDPI, vol. 7(3), pages 1-19, July.
    8. Gibson, John & McKenzie, David, 2011. "The microeconomic determinants of emigration and return migration of the best and brightest: Evidence from the Pacific," Journal of Development Economics, Elsevier, vol. 95(1), pages 18-29, May.
    9. Gourieroux, Christian & Monfort, Alain & Trognon, Alain, 1984. "Pseudo Maximum Likelihood Methods: Theory," Econometrica, Econometric Society, vol. 52(3), pages 681-700, May.
    10. Raven Molloy & Christopher L. Smith & Abigail Wozniak, 2011. "Internal Migration in the United States," Journal of Economic Perspectives, American Economic Association, vol. 25(3), pages 173-196, Summer.
    11. Train,Kenneth E., 2009. "Discrete Choice Methods with Simulation," Cambridge Books, Cambridge University Press, number 9780521766555, September.
    12. Ramalho, Esmeralda A., 2002. "Regression models for choice-based samples with misclassification in the response variable," Journal of Econometrics, Elsevier, vol. 106(1), pages 171-201, January.
    13. Bruce Meyer & Nikolas Mittag, 2013. "Misclassification In Binary Choice Models," Working Papers 13-27, Center for Economic Studies, U.S. Census Bureau.
    14. Poterba, James M & Summers, Lawrence H, 1995. "Unemployment Benefits and Labor Market Transitions: A Multinomial Logit Model with Errors in Classification," The Review of Economics and Statistics, MIT Press, vol. 77(2), pages 207-216, May.
    15. Bollinger, Christopher R & David, Martin H, 2001. "Estimation with Response Error and Nonresponse: Food-Stamp Participation in the SIPP," Journal of Business & Economic Statistics, American Statistical Association, vol. 19(2), pages 129-141, April.
    16. Butler, J S, 1996. "Estimating the Correlation in Censored Probit Models," The Review of Economics and Statistics, MIT Press, vol. 78(2), pages 356-358, May.
    17. Bound, John & Brown, Charles & Mathiowetz, Nancy, 2001. "Measurement error in survey data," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 5, chapter 59, pages 3705-3843, Elsevier.
    18. Meyer, Bruce D. & Mittag, Nikolas, 2017. "Misclassification in binary choice models," Journal of Econometrics, Elsevier, vol. 200(2), pages 295-311.
    19. Aline Bütikofer & Giovanni Peri, 2021. "How Cognitive Ability and Personality Traits Affect Geographic Mobility," Journal of Labor Economics, University of Chicago Press, vol. 39(2), pages 559-595.
    20. Van de Ven, Wynand P. M. M. & Van Praag, Bernard M. S., 1981. "The demand for deductibles in private health insurance : A probit model with sample selection," Journal of Econometrics, Elsevier, vol. 17(2), pages 229-252, November.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jorge González Chapela, 2022. "Is there a patience premium on migration?," Empirical Economics, Springer, vol. 63(4), pages 2025-2055, October.
    2. González Chapela, Jorge, 2020. "Patience goes a long way: Evidence from Spain," MPRA Paper 98711, University Library of Munich, Germany.
    3. Lin, Zhongjian & Hu, Yingyao, 2024. "Binary choice with misclassification and social interactions, with an application to peer effects in attitude," Journal of Econometrics, Elsevier, vol. 238(1).
    4. Bruckmeier, Kerstin & Riphahn, Regina T. & Wiemers, Jürgen, 2019. "Benefit underreporting in survey data and its consequences for measuring non-take-up: new evidence from linked administrative and survey data," IAB-Discussion Paper 201906, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany].
    5. Zhang, Han, 2021. "How Using Machine Learning Classification as a Variable in Regression Leads to Attenuation Bias and What to Do About It," SocArXiv 453jk, Center for Open Science.
    6. Massimiliano Bratti & Alfonso Miranda, 2010. "Non‐pecuniary returns to higher education: the effect on smoking intensity in the UK," Health Economics, John Wiley & Sons, Ltd., vol. 19(8), pages 906-920, August.
    7. Kureishi, Wataru & Paule-Paludkiewicz, Hannah & Tsujiyama, Hitoshi & Wakabayashi, Midori, 2021. "Time preferences over the life cycle and household saving puzzles," Journal of Monetary Economics, Elsevier, vol. 124(C), pages 123-139.
    8. Aronsson, Thomas & Hetschko, Clemens & Schöb, Ronnie, 2023. "Populism and Impatience," Umeå Economic Studies 1019, Umeå University, Department of Economics.
    9. Meyer, Bruce D. & Mittag, Nikolas, 2021. "An empirical total survey error decomposition using data combination," Journal of Econometrics, Elsevier, vol. 224(2), pages 286-305.
    10. Molinari, Francesca, 2008. "Partial identification of probability distributions with misclassified data," Journal of Econometrics, Elsevier, vol. 144(1), pages 81-117, May.
    11. Meyer, Bruce D. & Mittag, Nikolas, 2017. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," IZA Discussion Papers 10943, Institute of Labor Economics (IZA).
    12. Ha Trong Nguyen & Huong Thu Le & Luke Connelly & Francis Mitrou, 2023. "Accuracy of self‐reported private health insurance coverage," Health Economics, John Wiley & Sons, Ltd., vol. 32(12), pages 2709-2729, December.
    13. Yingyao Hu & Zhongjian Lin, 2018. "Misclassification and the hidden silent rivalry," CeMMAP working papers CWP12/18, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    14. Bruce Meyer & Nikolas Mittag, 2017. "Using Linked Survey and Administrative Data to Better Measure Income: Implications for Poverty, Program Effectiveness and Holes in the Safety Net," Working Papers 2017-075, Human Capital and Economic Opportunity Working Group.
    15. Meyer, Bruce D. & Mittag, Nikolas, 2017. "Misclassification in binary choice models," Journal of Econometrics, Elsevier, vol. 200(2), pages 295-311.
    16. Florens Pfann & Gerard Pfann, 2024. "Can trust explain patience? A cross-country analysis," French Stata Users' Group Meetings 2024 02, Stata Users Group.
    17. Watanabe, Hajime & Maruyama, Takuya, 2024. "A Bayesian sample selection model with a binary outcome for handling residential self-selection in individual car ownership," Journal of choice modelling, Elsevier, vol. 51(C).
    18. Ida, Takanori & Goto, Rei & Takahashi, Yuko & Nishimura, Shuzo, 2011. "Can economic-psychological parameters predict successful smoking cessation?," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 40(3), pages 285-295, May.
    19. Samuel Bazzi & Lisa Cameron & Simone Schaner & Firman Witoelar, 2021. "Information, Intermediaries, and International Migration," Melbourne Institute Working Paper Series wp2021n30, Melbourne Institute of Applied Economic and Social Research, The University of Melbourne.
    20. Finn, Claire & Harmon, Colm, 2006. "A dynamic model of demand for private health insurance in Ireland," Papers HRBWP17, Economic and Social Research Institute (ESRI).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jecnmx:v:10:y:2022:i:2:p:13-:d:777879. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.