IDEAS home Printed from https://ideas.repec.org/p/arx/papers/1010.4345.html
   My bibliography  Save this paper

Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain

Author

Listed:
  • Alexandre Belloni
  • Daniel Chen
  • Victor Chernozhukov
  • Christian Hansen

Abstract

We develop results for the use of Lasso and Post-Lasso methods to form first-stage predictions and estimate optimal instruments in linear instrumental variables (IV) models with many instruments, $p$. Our results apply even when $p$ is much larger than the sample size, $n$. We show that the IV estimator based on using Lasso or Post-Lasso in the first stage is root-n consistent and asymptotically normal when the first-stage is approximately sparse; i.e. when the conditional expectation of the endogenous variables given the instruments can be well-approximated by a relatively small set of variables whose identities may be unknown. We also show the estimator is semi-parametrically efficient when the structural error is homoscedastic. Notably our results allow for imperfect model selection, and do not rely upon the unrealistic "beta-min" conditions that are widely used to establish validity of inference following model selection. In simulation experiments, the Lasso-based IV estimator with a data-driven penalty performs well compared to recently advocated many-instrument-robust procedures. In an empirical example dealing with the effect of judicial eminent domain decisions on economic outcomes, the Lasso-based IV estimator outperforms an intuitive benchmark. In developing the IV results, we establish a series of new results for Lasso and Post-Lasso estimators of nonparametric conditional expectation functions which are of independent theoretical and practical interest. We construct a modification of Lasso designed to deal with non-Gaussian, heteroscedastic disturbances which uses a data-weighted $\ell_1$-penalty function. Using moderate deviation theory for self-normalized sums, we provide convergence rates for the resulting Lasso and Post-Lasso estimators that are as sharp as the corresponding rates in the homoscedastic Gaussian case under the condition that $\log p = o(n^{1/3})$.

Suggested Citation

  • Alexandre Belloni & Daniel Chen & Victor Chernozhukov & Christian Hansen, 2010. "Sparse Models and Methods for Optimal Instruments with an Application to Eminent Domain," Papers 1010.4345, arXiv.org, revised Apr 2015.
  • Handle: RePEc:arx:papers:1010.4345
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/1010.4345
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Kapetanios, George & Marcellino, Massimiliano, 2010. "Factor-GMM estimation with large sets of possibly weak instruments," Computational Statistics & Data Analysis, Elsevier, vol. 54(11), pages 2655-2675, November.
    2. Victor DeMiguel & Lorenzo Garlappi & Francisco J. Nogales & Raman Uppal, 2009. "A Generalized Approach to Portfolio Optimization: Improving Performance by Constraining Portfolio Norms," Management Science, INFORMS, vol. 55(5), pages 798-812, May.
    3. A. Belloni & D. Chen & V. Chernozhukov & C. Hansen, 2012. "Sparse Models and Methods for Optimal Instruments With an Application to Eminent Domain," Econometrica, Econometric Society, vol. 80(6), pages 2369-2429, November.
    4. Bai, Jushan & Ng, Serena, 2008. "Forecasting economic time series using targeted predictors," Journal of Econometrics, Elsevier, vol. 146(2), pages 304-317, October.
    5. Eric Gautier & Alexandre Tsybakov, 2011. "High-Dimensional Instrumental Variables Regression and Confidence Sets," Working Papers 2011-13, Center for Research in Economics and Statistics.
    6. Douglas Staiger & James H. Stock, 1997. "Instrumental Variables Regression with Weak Instruments," Econometrica, Econometric Society, vol. 65(3), pages 557-586, May.
    7. Frank Kleibergen, 2005. "Testing Parameters in GMM Without Assuming that They Are Identified," Econometrica, Econometric Society, vol. 73(4), pages 1103-1123, July.
    8. Frank Kleibergen, 2002. "Pivotal Statistics for Testing Structural Parameters in Instrumental Variables Regression," Econometrica, Econometric Society, vol. 70(5), pages 1781-1803, September.
    9. Carrasco, Marine & Tchuente, Guy, 2015. "Regularized LIML for many instruments," Journal of Econometrics, Elsevier, vol. 186(2), pages 427-442.
    10. Jerry A. Hausman & Whitney K. Newey & Tiemen Woutersen & John C. Chao & Norman R. Swanson, 2012. "Instrumental variable estimation with heteroskedasticity and many instruments," Quantitative Economics, Econometric Society, vol. 3(2), pages 211-255, July.
    11. Donald W.K. Andrews & James H. Stock, 2005. "Inference with Weak Instruments," Cowles Foundation Discussion Papers 1530, Cowles Foundation for Research in Economics, Yale University.
    12. Donald, Stephen G & Newey, Whitney K, 2001. "Choosing the Number of Instruments," Econometrica, Econometric Society, vol. 69(5), pages 1161-1191, September.
    13. Stock, James H & Wright, Jonathan H & Yogo, Motohiro, 2002. "A Survey of Weak Instruments and Weak Identification in Generalized Method of Moments," Journal of Business & Economic Statistics, American Statistical Association, vol. 20(4), pages 518-529, October.
    14. Donald W. K. Andrews & Marcelo J. Moreira & James H. Stock, 2006. "Optimal Two-Sided Invariant Similar Tests for Instrumental Variables Regression," Econometrica, Econometric Society, vol. 74(3), pages 715-752, May.
    15. Carrasco, Marine, 2012. "A regularization approach to the many instruments problem," Journal of Econometrics, Elsevier, vol. 170(2), pages 383-398.
    16. John C. Chao & Norman R. Swanson, 2005. "Consistent Estimation with a Large Number of Weak Instruments," Econometrica, Econometric Society, vol. 73(5), pages 1673-1692, September.
    17. Joshua D. Angrist & Alan B. Krueger, 1993. "Split Sample Instrumental Variables," Working Papers 699, Princeton University, Department of Economics, Industrial Relations Section..
    18. Chamberlain, Gary, 1987. "Asymptotic efficiency in estimation with conditional moment restrictions," Journal of Econometrics, Elsevier, vol. 34(3), pages 305-334, March.
    19. Giannone, Domenico & De Mol, Christine & Daubechies, Ingrid & Brodie, Joshua, 2007. "Sparse and Stable Markowitz Portfolios," CEPR Discussion Papers 6474, C.E.P.R. Discussion Papers.
    20. Newey, Whitney K, 1990. "Efficient Instrumental Variables Estimation of Nonlinear Models," Econometrica, Econometric Society, vol. 58(4), pages 809-837, July.
    21. Gary Chamberlain & Guido Imbens, 2004. "Random Effects Estimators with many Instrumental Variables," Econometrica, Econometric Society, vol. 72(1), pages 295-306, January.
    22. Jinyong Hahn & Jerry Hausman & Guido Kuersteiner, 2004. "Estimation with weak instruments: Accuracy of higher-order bias and MSE approximations," Econometrics Journal, Royal Economic Society, vol. 7(1), pages 272-306, June.
    23. Knight, Keith, 2008. "Shrinkage Estimation For Nearly Singular Designs," Econometric Theory, Cambridge University Press, vol. 24(2), pages 323-337, April.
    24. Hahn, Jinyong, 2002. "Optimal Inference With Many Instruments," Econometric Theory, Cambridge University Press, vol. 18(1), pages 140-168, February.
    25. Okui, Ryo, 2011. "Instrumental variable estimation in the presence of many moment conditions," Journal of Econometrics, Elsevier, vol. 165(1), pages 70-86.
    26. Andrews,Donald W. K. & Stock,James H. (ed.), 2005. "Identification and Inference for Econometric Models," Cambridge Books, Cambridge University Press, number 9780521844413, September.
    27. Hansen, Christian & Hausman, Jerry & Newey, Whitney, 2008. "Estimation With Many Instrumental Variables," Journal of Business & Economic Statistics, American Statistical Association, vol. 26, pages 398-422.
    28. Angrist, Joshua D & Krueger, Alan B, 1995. "Split-Sample Instrumental Variables Estimates of the Return to Schooling," Journal of Business & Economic Statistics, American Statistical Association, vol. 13(2), pages 225-235, April.
    29. Fuller, Wayne A, 1977. "Some Properties of a Modification of the Limited Information Estimator," Econometrica, Econometric Society, vol. 45(4), pages 939-953, May.
    30. Caner, Mehmet, 2009. "Lasso-Type Gmm Estimator," Econometric Theory, Cambridge University Press, vol. 25(1), pages 270-290, February.
    31. Marcelo J. Moreira, 2003. "A Conditional Likelihood Ratio Test for Structural Models," Econometrica, Econometric Society, vol. 71(4), pages 1027-1048, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hansen, Christian & Kozbur, Damian, 2014. "Instrumental variables estimation with many weak instruments using regularized JIVE," Journal of Econometrics, Elsevier, vol. 182(2), pages 290-308.
    2. Dennis Lim & Wenjie Wang & Yichong Zhang, 2022. "A Conditional Linear Combination Test with Many Weak Instruments," Papers 2207.11137, arXiv.org, revised Apr 2023.
    3. Lim, Dennis & Wang, Wenjie & Zhang, Yichong, 2024. "A conditional linear combination test with many weak instruments," Journal of Econometrics, Elsevier, vol. 238(2).
    4. Carlos Velasco & Xuexin Wang, 2021. "Instrumental variable estimation via a continuum of instruments with an application to estimating the elasticity of intertemporal substitution in consumption," Working Papers 2024-09-06, Wang Yanan Institute for Studies in Economics (WISE), Xiamen University.
    5. Alastair R. Hall, 2015. "Econometricians Have Their Moments: GMM at 32," The Economic Record, The Economic Society of Australia, vol. 91(S1), pages 1-24, June.
    6. Carrasco, Marine & Tchuente, Guy, 2015. "Regularized LIML for many instruments," Journal of Econometrics, Elsevier, vol. 186(2), pages 427-442.
    7. Kolesár, Michal, 2018. "Minimum distance approach to inference with many instruments," Journal of Econometrics, Elsevier, vol. 204(1), pages 86-100.
    8. Thomas Wiemann, 2023. "Optimal Categorical Instrumental Variables," Papers 2311.17021, arXiv.org, revised May 2024.
    9. Anna Mikusheva & Liyang Sun, 2024. "Weak identification with many instruments," The Econometrics Journal, Royal Economic Society, vol. 27(2), pages -28.
    10. Marcellino, Massimiliano & Kapetanios, George & Khalaf, Lynda, 2015. "Factor based identification-robust inference in IV regressions," CEPR Discussion Papers 10390, C.E.P.R. Discussion Papers.
    11. Murray Michael P., 2017. "Linear Model IV Estimation When Instruments Are Many or Weak," Journal of Econometric Methods, De Gruyter, vol. 6(1), pages 1-22, January.
    12. repec:bla:ecorec:v:91:y:2015:i::p:1-24 is not listed on IDEAS
    13. Andrews, Donald W.K. & Moreira, Marcelo J. & Stock, James H., 2007. "Performance of conditional Wald tests in IV regression with weak instruments," Journal of Econometrics, Elsevier, vol. 139(1), pages 116-132, July.
    14. Matsushita, Yukitoshi & Otsu, Taisuke, 2024. "A jackknife Lagrange multiplier test with many weak instruments," LSE Research Online Documents on Economics 116392, London School of Economics and Political Science, LSE Library.
    15. Isaiah Andrews & Timothy B. Armstrong, 2017. "Unbiased instrumental variables estimation under known first‐stage sign," Quantitative Economics, Econometric Society, vol. 8(2), pages 479-503, July.
    16. Marine Carrasco & Guy Tchuente, 2016. "Efficient Estimation with Many Weak Instruments Using Regularization Techniques," Econometric Reviews, Taylor & Francis Journals, vol. 35(8-10), pages 1609-1637, December.
    17. Michal Kolesár & Raj Chetty & John Friedman & Edward Glaeser & Guido W. Imbens, 2015. "Identification and Inference With Many Invalid Instruments," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 33(4), pages 474-484, October.
    18. Guy Tchuente, 2019. "Weak Identification and Estimation of Social Interaction Models," Papers 1902.06143, arXiv.org.
    19. Wang, Wenjie & Doko Tchatoka, Firmin, 2018. "On Bootstrap inconsistency and Bonferroni-based size-correction for the subset Anderson–Rubin test under conditional homoskedasticity," Journal of Econometrics, Elsevier, vol. 207(1), pages 188-211.
    20. Xu Cheng & Zhipeng Liao, 2012. "Select the Valid and Relevant Moments: A One-Step Procedure for GMM with Many Moments," PIER Working Paper Archive 12-045, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
    21. Sølvsten, Mikkel, 2020. "Robust estimation with many instruments," Journal of Econometrics, Elsevier, vol. 214(2), pages 495-512.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:1010.4345. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.