IDEAS home Printed from https://ideas.repec.org/p/syb/wpbsba/2123-18063.html
   My bibliography  Save this paper

Consistent Estimation of Linear Regression Models Using Matched Data

Author

Listed:
  • Hirukawa, Masayuki
  • Prokhorov, Artem

Abstract

Economists often use matched samples, especially when dealing with earnings data where a number of missing observations need to be imputed. In this paper, we demonstrate that the ordinary least squares estimator of the linear regression model using matched samples is inconsistent and has a nonstandard convergence rate to its probability limit. If only a few variables are used to impute the missing data, then it is possible to correct for the bias. We propose two semiparametric bias-corrected estimators and explore their asymptotic properties. The estimators have an indirect-inference interpretation and they attain the parametric convergence rate if the number of matching variables is no greater than three. Monte Carlo simulations confirm that the bias correction works very well in such cases.

Suggested Citation

  • Hirukawa, Masayuki & Prokhorov, Artem, 2017. "Consistent Estimation of Linear Regression Models Using Matched Data," Working Papers 2123/18063, University of Sydney Business School, Discipline of Business Analytics.
  • Handle: RePEc:syb:wpbsba:2123/18063
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/2123/18063
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. A. Smith, Jeffrey & E. Todd, Petra, 2005. "Does matching overcome LaLonde's critique of nonexperimental estimators?," Journal of Econometrics, Elsevier, vol. 125(1-2), pages 305-353.
    2. Currie, Janet & Yelowitz, Aaron, 2000. "Are public housing projects good for kids?," Journal of Public Economics, Elsevier, vol. 75(1), pages 99-124, January.
    3. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    4. Peter C. B. Phillips & Bruce E. Hansen, 1990. "Statistical Inference in Instrumental Variables Regression with I(1) Processes," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 57(1), pages 99-125.
    5. Ridder, Geert & Moffitt, Robert, 2007. "The Econometrics of Data Combination," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 75, Elsevier.
    6. Judith K. Hellerstein & Guido W. Imbens, 1999. "Imposing Moment Restrictions From Auxiliary Data By Weighting," The Review of Economics and Statistics, MIT Press, vol. 81(1), pages 1-14, February.
    7. Borjas, George J., 2004. "Food insecurity and public assistance," Journal of Public Economics, Elsevier, vol. 88(7-8), pages 1421-1443, July.
    8. Jacob A. Mincer, 1974. "Introduction to "Schooling, Experience, and Earnings"," NBER Chapters, in: Schooling, Experience, and Earnings, pages 1-4, National Bureau of Economic Research, Inc.
    9. Matias Busso & John DiNardo & Justin McCrary, 2014. "New Evidence on the Finite Sample Properties of Propensity Score Reweighting and Matching Estimators," The Review of Economics and Statistics, MIT Press, vol. 96(5), pages 885-897, December.
    10. McKinley Blackburn & David Neumark, 1992. "Unobserved Ability, Efficiency Wages, and Interindustry Wage Differentials," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 107(4), pages 1421-1436.
    11. Alberto Abadie & Guido W. Imbens, 2016. "Matching on the Estimated Propensity Score," Econometrica, Econometric Society, vol. 84, pages 781-807, March.
    12. Angrist, Joshua D & Krueger, Alan B, 1995. "Split-Sample Instrumental Variables Estimates of the Return to Schooling," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(2), pages 225-235, April.
    13. Tomoki Fujii, 2008. "Two-sample estimation of poverty rates for disabled people: an application to Tanzania," Working Papers 02-2008, Singapore Management University, School of Economics.
    14. Bostic, Raphael & Gabriel, Stuart & Painter, Gary, 2009. "Housing wealth, financial wealth, and consumption: New evidence from micro data," Regional Science and Urban Economics, Elsevier, vol. 39(1), pages 79-89, January.
    15. Carrasco, Marine & Florens, Jean-Pierre, 2002. "Simulation-Based Method of Moments and Efficiency," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(4), pages 482-492, October.
    16. Manuel Arellano & Costas Meghir, 1992. "Female Labour Supply and On-the-Job Search: An Empirical Model Estimated Using Complementary Data Sets," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 59(3), pages 537-559.
    17. Christopher R. Bollinger & Barry T. Hirsch, 2006. "Match Bias from Earnings Imputation in the Current Population Survey: The Case of Imperfect Matching," Journal of Labor Economics, University of Chicago Press, vol. 24(3), pages 483-520, July.
    18. Alberto Abadie & Guido W. Imbens, 2012. "A Martingale Representation for Matching Estimators," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(498), pages 833-843, June.
    19. Abadie, Alberto & Imbens, Guido W., 2011. "Bias-Corrected Matching Estimators for Average Treatment Effects," Journal of Business & Economic Statistics, American Statistical Association, vol. 29(1), pages 1-11.
    20. Jacob A. Mincer, 1974. "Schooling, Experience, and Earnings," NBER Books, National Bureau of Economic Research, Inc, number minc74-1.
    21. Olympia Bover, 2005. "Wealth effects on consumption: microeconometric estimates from the Spanish survey of household finances," Working Papers 0522, Banco de España.
    22. Horowitz, Joel L & Spokoiny, Vladimir G, 2001. "An Adaptive, Rate-Optimal Test of a Parametric Mean-Regression Model against a Nonparametric Alternative," Econometrica, Econometric Society, vol. 69(3), pages 599-631, May.
    23. Prokhorov, Artem & Schmidt, Peter, 2009. "GMM redundancy results for general missing data problems," Journal of Econometrics, Elsevier, vol. 151(1), pages 47-55, July.
    24. Yatchew, A., 1997. "An elementary estimator of the partial linear model," Economics Letters, Elsevier, vol. 57(2), pages 135-143, December.
    25. Joshua D. Angrist & Alan B. Krueger, 1993. "Split Sample Instrumental Variables," Working Papers 699, Princeton University, Department of Economics, Industrial Relations Section..
    26. Smith, A A, Jr, 1993. "Estimating Nonlinear Time-Series Models Using Simulated Vector Autoregressions," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 8(S), pages 63-84, Suppl. De.
    27. Guido W. Imbens & Tony Lancaster, 1994. "Combining Micro and Macro Data in Microeconometric Models," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 61(4), pages 655-680.
    28. White, Halbert, 1980. "A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity," Econometrica, Econometric Society, vol. 48(4), pages 817-838, May.
    29. Barry T. Hirsch & Edward J. Schumacher, 2004. "Match Bias in Wage Gap Estimates Due to Earnings Imputation," Journal of Labor Economics, University of Chicago Press, vol. 22(3), pages 689-722, July.
    30. Atsushi Inoue & Gary Solon, 2010. "Two-Sample Instrumental Variables Estimators," The Review of Economics and Statistics, MIT Press, vol. 92(3), pages 557-561, August.
    31. James J. Heckman & Justin L. Tobias & Edward Vytlacil, 2000. "Simple Estimators for Treatment Parameters in a Latent Variable Framework with an Application to Estimating the Returns to Schooling," NBER Working Papers 7950, National Bureau of Economic Research, Inc.
    32. Chen J. & Shao J., 2001. "Jackknife Variance Estimation for Nearest-Neighbor Imputation," Journal of the American Statistical Association, American Statistical Association, vol. 96, pages 260-269, March.
    33. Irina Murtazashvili & Di Liu & Artem Prokhorov, 2015. "Two-sample nonparametric estimation of intergenerational income mobility in the United States and Sweden," Canadian Journal of Economics, Canadian Economics Association, vol. 48(5), pages 1733-1761, December.
    34. Thomas S. Dee & William N. Evans, 2003. "Teen Drinking and Educational Attainment: Evidence from Two-Sample Instrumental Variables Estimates," Journal of Labor Economics, University of Chicago Press, vol. 21(1), pages 178-209, January.
    35. Jacob A. Mincer, 1974. "Schooling and Earnings," NBER Chapters, in: Schooling, Experience, and Earnings, pages 41-63, National Bureau of Economic Research, Inc.
    36. Pagan, Adrian, 1984. "Econometric Issues in the Analysis of Regressions with Generated Regressors," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 25(1), pages 221-247, February.
    37. Lusardi, Annamaria, 1996. "Permanent Income, Current Income, and Consumption: Evidence from Two Panel Data Sets," Journal of Business & Economic Statistics, American Statistical Association, vol. 14(1), pages 81-90, January.
    38. Alberto Abadie & Guido W. Imbens, 2006. "Large Sample Properties of Matching Estimators for Average Treatment Effects," Econometrica, Econometric Society, vol. 74(1), pages 235-267, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Irina Murtazashvili & Di Liu & Artem Prokhorov, 2015. "Two‐sample nonparametric estimation of intergenerational income mobility in the United States and Sweden," Canadian Journal of Economics/Revue canadienne d'économique, John Wiley & Sons, vol. 48(5), pages 1733-1761, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benjamin Scharadin & Edward C. Jaenicke, 2020. "Time spent on childcare and the household Healthy Eating Index," Review of Economics of the Household, Springer, vol. 18(2), pages 357-386, June.
    2. Thomas F. Crossley & Peter Levell & Stavros Poupakis, 2022. "Regression with an imputed dependent variable," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 37(7), pages 1277-1294, November.
    3. Julie L. Hotchkiss & Anil Rupasingha & Thor Watson, 2022. "In-migration and Dilution of Community Social Capital," International Regional Science Review, , vol. 45(1), pages 36-57, January.
    4. Pacini, David & Windmeijer, Frank, 2016. "Robust inference for the Two-Sample 2SLS estimator," Economics Letters, Elsevier, vol. 146(C), pages 50-54.
    5. Julie L. Hotchkiss, 2019. "US Decennial Census return rates: the role of social capital," International Journal of Social Economics, Emerald Group Publishing Limited, vol. 46(5), pages 648-668, January.
    6. Ashworth, Jared & Ransom, Tyler, 2019. "Has the college wage premium continued to rise? Evidence from multiple U.S. surveys," Economics of Education Review, Elsevier, vol. 69(C), pages 149-154.
    7. Barry T. Hirsch, 2008. "Wage Gaps Large and Small," Southern Economic Journal, John Wiley & Sons, vol. 74(4), pages 914-933, April.
    8. Prokhorov, Artem & Schmidt, Peter, 2009. "GMM redundancy results for general missing data problems," Journal of Econometrics, Elsevier, vol. 151(1), pages 47-55, July.
    9. Justin L. Tobias, 2003. "Are Returns to Schooling Concentrated Among the Most Able? A Semiparametric Analysis of the Ability–earnings Relationships," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 65(1), pages 1-29, February.
    10. Xiang, Di & Zhan, Lue & Bordignon, Massimo, 2020. "A reconsideration of the sugar sweetened beverage tax in a household production model," Food Policy, Elsevier, vol. 95(C).
    11. Timothy B. Armstrong & Michal Kolesár, 2021. "Finite‐Sample Optimal Estimation and Inference on Average Treatment Effects Under Unconfoundedness," Econometrica, Econometric Society, vol. 89(3), pages 1141-1177, May.
    12. Faqin Lin & Can Huang & Xiaobo He & Chao Zhang, 2013. "Do more highly educated entrepreneurs matter?," Asian-Pacific Economic Literature, The Crawford School, The Australian National University, vol. 27(2), pages 104-116, November.
    13. Rodríguez-Oreggia, Eduardo & López-Videla, Bruno, 2015. "Imputación de ingresos laborales. Una aplicación con encuestas de empleo en México," El Trimestre Económico, Fondo de Cultura Económica, vol. 0(325), pages .117-146, enero-mar.
    14. Neumann, George & Olitsky, Neal & Robbins, Steve, 2009. "Job congruence, academic achievement, and earnings," Labour Economics, Elsevier, vol. 16(5), pages 503-509, October.
    15. Wojan, Timothy R. & Crown, Daniel & Rupasingha, Anil, 2018. "Varieties of innovation and business survival: Does pursuit of incremental or far-ranging innovation make manufacturing establishments more resilient?," Research Policy, Elsevier, vol. 47(9), pages 1801-1810.
    16. Macpherson, David A. & Hirsch, Barry, 2021. "Five Decades of Union Wages, Nonunion Wages, and Union Wage Gaps at Unionstats.com," IZA Discussion Papers 14398, Institute of Labor Economics (IZA).
    17. Ferman, Bruno, 2021. "Matching estimators with few treated and many control observations," Journal of Econometrics, Elsevier, vol. 225(2), pages 295-307.
    18. Mai Seki, 2013. "Heterogeneous Returns to U.S. College Selectivity and the Value of Graduate Degree Attainment," Staff Working Papers 13-46, Bank of Canada.
    19. Niels-Hugo Blunch & Nabanita Datta Gupta, 2014. "Social Networks and Health Knowledge in India: Who You Know or Who You Are?," Economics Working Papers 2014-24, Department of Economics and Business Economics, Aarhus University.
    20. Atsushi Inoue & Gary Solon, 2010. "Two-Sample Instrumental Variables Estimators," The Review of Economics and Statistics, MIT Press, vol. 92(3), pages 557-561, August.

    More about this item

    Keywords

    measurement error bias; matching estimation; linear regression; indirect inference; Bias correction;
    All these keywords.

    JEL classification:

    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C31 - Mathematical and Quantitative Methods - - Multiple or Simultaneous Equation Models; Multiple Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models; Quantile Regressions; Social Interaction Models

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:syb:wpbsba:2123/18063. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Artem Prokhorov (email available below). General contact details of provider: https://edirc.repec.org/data/sbsydau.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.