IDEAS home Printed from https://ideas.repec.org/a/sae/somere/v18y1989i2-3p360-390.html
   My bibliography  Save this article

Selection Bias in Linear Regression, Logit and Probit Models

Author

Listed:
  • JEFFREY A. DUBIN

    (California Institute of Technology)

  • DOUGLAS RIVERS

    (Stanford University)

Abstract

Missing data are common in observational studies due to self-selection of subjects. Missing data can bias estimates of linear regression and related models. The nature of selection bias and econometric methods for correcting it are described. The econometric approach relies upon a specification of the selection mechanism. We extend this approach to binary logit and probit models and provide a simple test for selection bias in these models. An analysis of candidate preference in the 1984 U.S. presidential election illustrates the technique.

Suggested Citation

  • Jeffrey A. Dubin & Douglas Rivers, 1989. "Selection Bias in Linear Regression, Logit and Probit Models," Sociological Methods & Research, , vol. 18(2-3), pages 360-390, November.
  • Handle: RePEc:sae:somere:v:18:y:1989:i:2-3:p:360-390
    DOI: 10.1177/0049124189018002006
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0049124189018002006
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0049124189018002006?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dubin, Jeffrey A & McFadden, Daniel L, 1984. "An Econometric Analysis of Residential Electric Appliance Holdings and Consumption," Econometrica, Econometric Society, vol. 52(2), pages 345-362, March.
    2. Zvi Griliches, 1957. "Specification Bias in Estimates of Production Functions," American Journal of Agricultural Economics, Agricultural and Applied Economics Association, vol. 39(1), pages 8-20.
    3. Chamberlain, Gary, 1986. "Asymptotic efficiency in semi-parametric models with censoring," Journal of Econometrics, Elsevier, vol. 32(2), pages 189-218, July.
    4. Gourieroux, Christian & Monfort, Alain & Renault, Eric & Trognon, Alain, 1987. "Generalised residuals," Journal of Econometrics, Elsevier, vol. 34(1-2), pages 5-32.
    5. Heckman, James, 2013. "Sample selection bias as a specification error," Applied Econometrics, Russian Presidential Academy of National Economy and Public Administration (RANEPA), vol. 31(3), pages 129-137.
    6. Squire, Peverill & Wolfinger, Raymond E. & Glass, David P., 1987. "Residential Mobility and Voter Turnout," American Political Science Review, Cambridge University Press, vol. 81(1), pages 45-65, March.
    7. Lung-Fei Lee, 1982. "Some Approaches to the Correction of Selectivity Bias," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 49(3), pages 355-372.
    8. Heckman, James J. & Robb, Richard Jr., 1985. "Alternative methods for evaluating the impact of interventions : An overview," Journal of Econometrics, Elsevier, vol. 30(1-2), pages 239-267.
    9. Duncan, Gregory M., 1987. "A simplified approach to M-estimation with application to two-stage estimators," Journal of Econometrics, Elsevier, vol. 34(3), pages 373-389, March.
    10. Powell, James L., 1984. "Least absolute deviations estimation for the censored regression model," Journal of Econometrics, Elsevier, vol. 25(3), pages 303-325, July.
    11. Stoker, Thomas M, 1986. "Consistent Estimation of Scaled Coefficients," Econometrica, Econometric Society, vol. 54(6), pages 1461-1481, November.
    12. Arabmazar, Abbas & Schmidt, Peter, 1981. "Further evidence on the robustness of the Tobit estimator to heteroskedasticity," Journal of Econometrics, Elsevier, vol. 17(2), pages 253-258, November.
    13. Amemiya, Takeshi, 1984. "Tobit models: A survey," Journal of Econometrics, Elsevier, vol. 24(1-2), pages 3-61.
    14. Powell, James L., 1987. "Semiparametric Estimation Of Bivariate Latent Variable Models," SSRI Workshop Series 292689, University of Wisconsin-Madison, Social Systems Research Institute.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kailitz, Steffen & Tanneberg, Dag, 2015. "Legitimation, Kooptation, Repression und das Überleben von Autokratien „im Umfeld autokratischer Wahlen". Eine Replik auf den Beitrag von Hans Lueders und Aurel Croissant," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 9(1/2), pages 73-82.
    2. Maria Felice Arezzo & Giuseppina Guagnano, 2019. "Misclassification in Binary Choice Models with Sample Selection," Econometrics, MDPI, vol. 7(3), pages 1-19, July.
    3. Fieger, Peter, 2013. "Using Student Outcome Survey Data for institutional performance measurement," MPRA Paper 76839, University Library of Munich, Germany.
    4. Mac an Bhaird, Ciarán, 2013. "Demand for debt and equity before and after the financial crisis," Research in International Business and Finance, Elsevier, vol. 28(C), pages 105-117.
    5. Becker, Rolf, 2000. "Determinanten der Studierbereitschaft in Ostdeutschland : eine empirische Anwendung der Humankapital- und Werterwartungstheorie am Beispiel sächsicher Abiturienten in den Jahren 1996 und 1998 (Determi," Mitteilungen aus der Arbeitsmarkt- und Berufsforschung, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 33(2), pages 261-276.
    6. Birhanu, Mulugeta Yitayih & Girma, Anteneh & Puskur, Ranjitha, 2017. "Determinants of success and intensity of livestock feed technologies use in Ethiopia: Evidence from a positive deviance perspective," Technological Forecasting and Social Change, Elsevier, vol. 115(C), pages 15-25.
    7. Joshua Clinton & John Lapinski, 2004. "Targeted advertising and voter turnout: An experimental study of the 2000 presidential election," Natural Field Experiments 00226, The Field Experiments Website.
    8. Maria Felice Arezzo & Giuseppina Guagnano, 2018. "Response-Based Sampling for Binary Choice Models With Sample Selection," Econometrics, MDPI, vol. 6(1), pages 1-17, March.
    9. Cheng, Tyrone C. & Lo, Celia C., 2018. "Racial disparities in the proportion of needed services maltreated children received," Children and Youth Services Review, Elsevier, vol. 94(C), pages 72-81.
    10. Jessica Pearlman & Lisa D. Pearce & Dirgha J. Ghimire & Prem Bhandari & Taylor Hargrove, 2017. "Postmarital Living Arrangements in Historically Patrilocal Settings: Integrating Household Fission and Migration Perspectives," Demography, Springer;Population Association of America (PAA), vol. 54(4), pages 1425-1449, August.
    11. Giampiero Marra & Rosalba Radice & Till Bärnighausen & Simon N. Wood & Mark E. McGovern, 2017. "A Simultaneous Equation Approach to Estimating HIV Prevalence With Nonignorable Missing Responses," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 484-496, April.
    12. Becker, Rolf, 2000. "Determinanten der Studierbereitschaft in Ostdeutschland : eine empirische Anwendung der Humankapital- und Werterwartungstheorie am Beispiel sächsicher Abiturienten in den Jahren 1996 und 1998 (Determi," Mitteilungen aus der Arbeitsmarkt- und Berufsforschung, Institut für Arbeitsmarkt- und Berufsforschung (IAB), Nürnberg [Institute for Employment Research, Nuremberg, Germany], vol. 33(2), pages 261-276.
    13. Valérie Revest & Alessandro Sapio, 2016. "Graduation and sell-out strategies in the Alternative Investment Market," Discussion Papers 4_2016, CRISEI, University of Naples "Parthenope", Italy.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Gordon B. Dahl, 2002. "Mobility and the Return to Education: Testing a Roy Model with Multiple Markets," Econometrica, Econometric Society, vol. 70(6), pages 2367-2420, November.
    2. Angrist, Joshua D., 1997. "Conditional independence in sample selection models," Economics Letters, Elsevier, vol. 54(2), pages 103-112, February.
    3. Donald, Stephen G., 1995. "Two-step estimation of heteroskedastic sample selection models," Journal of Econometrics, Elsevier, vol. 65(2), pages 347-380, February.
    4. James J. Heckman, 2005. "Micro Data, Heterogeneity and the Evaluation of Public Policy Part 2," The American Economist, Sage Publications, vol. 49(1), pages 16-44, March.
    5. Bhat, Chandra R. & Eluru, Naveen, 2009. "A copula-based approach to accommodate residential self-selection effects in travel behavior modeling," Transportation Research Part B: Methodological, Elsevier, vol. 43(7), pages 749-765, August.
    6. Claudia PIGINI, 2012. "Of Butterflies and Caterpillars: Bivariate Normality in the Sample Selection Model," Working Papers 377, Universita' Politecnica delle Marche (I), Dipartimento di Scienze Economiche e Sociali.
    7. Qi Li & Jeffrey Scott Racine, 2006. "Nonparametric Econometrics: Theory and Practice," Economics Books, Princeton University Press, edition 1, volume 1, number 8355.
    8. Lanot, Gauthier & Walker, Ian, 1998. "The union/non-union wage differential: An application of semi-parametric methods," Journal of Econometrics, Elsevier, vol. 84(2), pages 327-349, June.
    9. Paul Ellickson & Sanjog Misra, 2012. "Enriching interactions: Incorporating outcome data into static discrete games," Quantitative Marketing and Economics (QME), Springer, vol. 10(1), pages 1-26, March.
    10. Hajivassiliou, Vassilis A. & Ruud, Paul A., 1986. "Classical estimation methods for LDV models using simulation," Handbook of Econometrics, in: R. F. Engle & D. McFadden (ed.), Handbook of Econometrics, edition 1, volume 4, chapter 40, pages 2383-2441, Elsevier.
    11. Simons, Andrew M., 2022. "What is the optimal locus of control for social assistance programs? Evidence from the Productive Safety Net Program in Ethiopia," Journal of Development Economics, Elsevier, vol. 158(C).
    12. Golan, Amos & Judge, George & Perloff, Jeffrey, 1997. "Estimation and inference with censored and ordered multinomial response data," Journal of Econometrics, Elsevier, vol. 79(1), pages 23-51, July.
    13. Patrick Kline & Christopher R. Walters, 2019. "On Heckits, LATE, and Numerical Equivalence," Econometrica, Econometric Society, vol. 87(2), pages 677-696, March.
    14. Gilpin, Gregory A., 2011. "Reevaluating the effect of non-teaching wages on teacher attrition," Economics of Education Review, Elsevier, vol. 30(4), pages 598-616, August.
    15. Jochmans, Koen, 2015. "Multiplicative-error models with sample selection," Journal of Econometrics, Elsevier, vol. 184(2), pages 315-327.
    16. Yu, Ping & Phillips, Peter C.B., 2018. "Threshold regression with endogeneity," Journal of Econometrics, Elsevier, vol. 203(1), pages 50-68.
    17. Kamel Jedidi & Carl F. Mela & Sunil Gupta, 1999. "Managing Advertising and Promotion for Long-Run Profitability," Marketing Science, INFORMS, vol. 18(1), pages 1-22.
    18. Ana Fernandez Sainz & Juan Rodriguez-Poo & Inmaculada Villanua Martin, 2002. "Finite sample behavior of two step estimators in selection models," Computational Statistics, Springer, vol. 17(1), pages 1-16, March.
    19. Randall A. Lewis & James B. McDonald, 2014. "Partially Adaptive Estimation of the Censored Regression Model," Econometric Reviews, Taylor & Francis Journals, vol. 33(7), pages 732-750, October.
    20. François Bourguignon & Martin Fournier & Marc Gurgand, 2007. "Selection Bias Corrections Based On The Multinomial Logit Model: Monte Carlo Comparisons," Journal of Economic Surveys, Wiley Blackwell, vol. 21(1), pages 174-205, February.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:somere:v:18:y:1989:i:2-3:p:360-390. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.