IDEAS home Printed from https://ideas.repec.org/p/iza/izadps/dp16630.html
   My bibliography  Save this paper

(Frisch-Waugh-Lovell)' On the Estimation of Regression Models by Row

Author

Listed:
  • Clarke, Damian

    (University of Chile)

  • Torres, Nicolás Paris

    (University of Chile)

  • Villena-Roldan, Benjamin

    (Diego Portales University)

Abstract

We demonstrate that regression models can be estimated by working independently in a row-wise fashion. We document a simple procedure which allows for a wide class of econometric estimators to be implemented cumulatively, where, in the limit, estimators can be produced without ever storing more than a single line of data in a computer's memory. This result is useful in understanding the mechanics of many common regression models. These procedures can be used to speed up the computation of estimates computed via OLS, IV, Ridge regression, LASSO, Elastic Net, and Non-linear models including probit and logit, with all common modes of inference. This has implications for estimation and inference with 'big data', where memory constraints may imply that working with all data at once is particularly costly. We additionally show that even with moderately sized datasets, this method can reduce computation time compared with traditional estimation routines.

Suggested Citation

  • Clarke, Damian & Torres, Nicolás Paris & Villena-Roldan, Benjamin, 2023. "(Frisch-Waugh-Lovell)' On the Estimation of Regression Models by Row," IZA Discussion Papers 16630, Institute of Labor Economics (IZA).
  • Handle: RePEc:iza:izadps:dp16630
    as

    Download full text from publisher

    File URL: https://docs.iza.org/dp16630.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Jonah B. Gelbach, 2016. "When Do Covariates Matter? And Which Ones, and How Much?," Journal of Labor Economics, University of Chicago Press, vol. 34(2), pages 509-543.
    2. Paulo Guimarães & Pedro Portugal, 2010. "A simple feasible procedure to fit models with high-dimensional fixed effects," Stata Journal, StataCorp LP, vol. 10(4), pages 628-649, December.
    3. A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
    4. Abadie, Alberto, 2003. "Semiparametric instrumental variable estimation of treatment response models," Journal of Econometrics, Elsevier, vol. 113(2), pages 231-263, April.
    5. Rosenzweig, Mark R & Wolpin, Kenneth I, 1980. "Testing the Quantity-Quality Fertility Model: The Use of Twins as a Natural Experiment," Econometrica, Econometric Society, vol. 48(1), pages 227-240, January.
    6. David Roodman & James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2019. "Fast and wild: Bootstrap inference in Stata using boottest," Stata Journal, StataCorp LP, vol. 19(1), pages 4-60, March.
    7. Gary Solon & Steven J. Haider & Jeffrey M. Wooldridge, 2015. "What Are We Weighting For?," Journal of Human Resources, University of Wisconsin Press, vol. 50(2), pages 301-316.
    8. MacKinnon, James G., 2023. "Using large samples in econometrics," Journal of Econometrics, Elsevier, vol. 235(2), pages 922-926.
    9. MacKinnon, James G., 2023. "Fast cluster bootstrap methods for linear regression models," Econometrics and Statistics, Elsevier, vol. 26(C), pages 52-71.
    10. Joshua D. Angrist, 1998. "Estimating the Labor Market Impact of Voluntary Military Service Using Social Security Data on Military Applicants," Econometrica, Econometric Society, vol. 66(2), pages 249-288, March.
    11. Hal R. Varian, 2014. "Big Data: New Tricks for Econometrics," Journal of Economic Perspectives, American Economic Association, vol. 28(2), pages 3-28, Spring.
    12. Alberto Abadie & Susan Athey & Guido W. Imbens & Jeffrey M. Wooldridge, 2020. "Sampling‐Based versus Design‐Based Uncertainty in Regression Analysis," Econometrica, Econometric Society, vol. 88(1), pages 265-296, January.
    13. Gaure, Simen, 2013. "OLS with multiple high dimensional category variables," Computational Statistics & Data Analysis, Elsevier, vol. 66(C), pages 8-18.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sloczynski, Tymon, 2018. "A General Weighted Average Representation of the Ordinary and Two-Stage Least Squares Estimands," IZA Discussion Papers 11866, Institute of Labor Economics (IZA).
    2. Conti, Gabriella & Poupakis, Stavros & Ekamper, Peter & Bijwaard, Govert E. & Lumey, L.H., 2024. "Severe prenatal shocks and adolescent health: Evidence from the Dutch Hunger Winter," Economics & Human Biology, Elsevier, vol. 53(C).
    3. Peter Hull & Michal Kolesár & Christopher Walters, 2022. "Labor by design: contributions of David Card, Joshua Angrist, and Guido Imbens," Scandinavian Journal of Economics, Wiley Blackwell, vol. 124(3), pages 603-645, July.
    4. David Roodman, 2024. "Julia as a universal platform for statistical software development," Papers 2404.09309, arXiv.org, revised Aug 2024.
    5. Wang, Wenjie & Zhang, Yichong, 2024. "Wild bootstrap inference for instrumental variables regressions with weak and few clusters," Journal of Econometrics, Elsevier, vol. 241(1).
    6. Tymon S{l}oczy'nski, 2018. "Interpreting OLS Estimands When Treatment Effects Are Heterogeneous: Smaller Groups Get Larger Weights," Papers 1810.01576, arXiv.org, revised May 2020.
    7. Bahar, Dany & Ibáñez, Ana María & Rozo, Sandra V., 2021. "Give me your tired and your poor: Impact of a large-scale amnesty program for undocumented refugees," Journal of Development Economics, Elsevier, vol. 151(C).
    8. James G. MacKinnon & Morten {O}rregaard Nielsen & Matthew D. Webb, 2024. "Cluster-robust jackknife and bootstrap inference for binary response models," Papers 2406.00650, arXiv.org.
    9. MacKinnon, James G. & Nielsen, Morten Ørregaard & Webb, Matthew D., 2023. "Cluster-robust inference: A guide to empirical practice," Journal of Econometrics, Elsevier, vol. 232(2), pages 272-299.
    10. Tymon Słoczyński, 2022. "Interpreting OLS Estimands When Treatment Effects Are Heterogeneous: Smaller Groups Get Larger Weights," The Review of Economics and Statistics, MIT Press, vol. 104(3), pages 501-509, May.
    11. Peter Hull, 2018. "Estimating Treatment Effects in Mover Designs," Papers 1804.06721, arXiv.org.
    12. Jeffrey D. Michler & Anna Josephson, 2022. "Recent developments in inference: practicalities for applied economics," Chapters, in: A Modern Guide to Food Economics, chapter 11, pages 235-268, Edward Elgar Publishing.
    13. Tyler Smith & Tom Coupé, 2020. "The Mental Health Cost of Terrorism: A Replication of Kim and Albert Kim (Health Economics, 2018)," Working Papers in Economics 20/21, University of Canterbury, Department of Economics and Finance.
    14. Yichen Shen & Rong Fu & Haruko Noguchi, 2021. "COVID‐19's Lockdown and Crime Victimization: The State of Emergency under the Abe Administration," Asian Economic Policy Review, Japan Center for Economic Research, vol. 16(2), pages 327-348, July.
    15. Matthias Sutter & Michael Weyland & Anna Untertrifaller & Manuel Froitzheim & Sebastian O. Schneider, 2020. "Financial literacy, risk and time preferences – Results from a randomized educational intervention," Discussion Paper Series of the Max Planck Institute for Research on Collective Goods 2023_03, Max Planck Institute for Research on Collective Goods.
    16. Matthias Sutter & Michael Weyland & Anna Untertrifaller & Manuel Froitzheim & Sebastian O. Schneider, 2023. "Financial Literacy, Experimental Preference Measures and Field Behavior – A Randomized Educational Intervention," ECONtribute Discussion Papers Series 229, University of Bonn and University of Cologne, Germany.
    17. Wissmann, Daniel, 2020. "Finally a Smoking Gun," Discussion Papers in Economics 73026, University of Munich, Department of Economics.
    18. Ellis, Jimmy R. & Gershenson, Seth, 2016. "LATE for the Meeting: Gender, Peer Advising, and College Success," IZA Discussion Papers 9956, Institute of Labor Economics (IZA).
    19. Gric, Zuzana & Ehrenbergerova, Dominika & Hodula, Martin, 2022. "The power of sentiment: Irrational beliefs of households and consumer loan dynamics," Journal of Financial Stability, Elsevier, vol. 59(C).
    20. James G. MacKinnon & Morten Ørregaard Nielsen & Matthew D. Webb, 2023. "Fast and reliable jackknife and bootstrap methods for cluster‐robust inference," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(5), pages 671-694, August.

    More about this item

    Keywords

    big data; estimation; regression; matrix inversion;
    All these keywords.

    JEL classification:

    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis
    • C61 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Optimization Techniques; Programming Models; Dynamic Analysis
    • C87 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs - - - Econometric Software

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:iza:izadps:dp16630. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Holger Hinte (email available below). General contact details of provider: https://edirc.repec.org/data/izaaade.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.