IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2203.11820.html
   My bibliography  Save this paper

Dealing with Logs and Zeros in Regression Models

Author

Listed:
  • Christophe Bell'ego
  • David Benatia
  • Louis Pape

Abstract

Log-linear models are prevalent in empirical research. Yet, how to handle zeros in the dependent variable remains an unsettled issue. This article clarifies it and addresses the log of zero by developing a new family of estimators called iterated Ordinary Least Squares (iOLS). This family nests standard approaches such as log-linear and Poisson regressions, offers several computational advantages, and corresponds to the correct way to perform the popular $\log(Y+1)$ transformation. We extend it to the endogenous regressor setting (i2SLS) and overcome other common issues with Poisson models, such as controlling for many fixed-effects. We also develop specification tests to help researchers select between alternative estimators. Finally, our methods are illustrated through numerical simulations and replications of landmark publications.

Suggested Citation

  • Christophe Bell'ego & David Benatia & Louis Pape, 2022. "Dealing with Logs and Zeros in Regression Models," Papers 2203.11820, arXiv.org.
  • Handle: RePEc:arx:papers:2203.11820
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2203.11820
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 29-46, January.
    2. Fernández-Val, Iván & Weidner, Martin, 2016. "Individual and time effects in nonlinear panel models with large N, T," Journal of Econometrics, Elsevier, vol. 192(1), pages 291-312.
    3. Davidson, Russell & MacKinnon, James G, 1981. "Several Tests for Model Specification in the Presence of Alternative Hypotheses," Econometrica, Econometric Society, vol. 49(3), pages 781-793, May.
    4. Ravallion, Martin, 2017. "A concave log-like transformation allowing non-positive values," Economics Letters, Elsevier, vol. 161(C), pages 130-132.
    5. Eaton Jonathan & Tamura Akiko, 1994. "Bilateralism and Regionalism in Japanese and U.S. Trade and Direct Foreign Investment Patterns," Journal of the Japanese and International Economies, Elsevier, vol. 8(4), pages 478-510, December.
    6. John Mullahy, 1997. "Instrumental-Variable Estimation Of Count Data Models: Applications To Models Of Cigarette Smoking Behavior," The Review of Economics and Statistics, MIT Press, vol. 79(4), pages 586-593, November.
    7. Dominitz, Jeff & Sherman, Robert P., 2005. "Some Convergence Theory For Iterative Estimation Procedures With An Application To Semiparametric Estimation," Econometric Theory, Cambridge University Press, vol. 21(4), pages 838-863, August.
    8. Manning, Willard G. & Mullahy, John, 2001. "Estimating log models: to transform or not to transform?," Journal of Health Economics, Elsevier, vol. 20(4), pages 461-494, July.
    9. Rainer Winkelmann, 2008. "Econometric Analysis of Count Data," Springer Books, Springer, edition 0, number 978-3-540-78389-3, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Congiu, Raffaele & Sabatino, Lorien & Sapi, Geza, 2022. "The Impact of Privacy Regulation on Web Traffic: Evidence From the GDPR," Information Economics and Policy, Elsevier, vol. 61(C).
    2. Mengzhen Wang & Xingong Ding & Baekryul Choi, 2023. "FDI or International-Trade-Driven Green Growth of 24 Korean Manufacturing Industries? Evidence from Heterogeneous Panel Based on Non-Causality Test," Sustainability, MDPI, vol. 15(7), pages 1-20, March.
    3. Gustave Kenedi & Louis Sirugue, 2021. "The Anatomy of Intergenerational Income Mobility in France and its Spatial Variations," Working Papers halshs-03455282, HAL.
    4. Carlos Alberto Piscarreta Pinto Ferreira, 2023. "Drivers of Sovereign Bond Demand – The Case of Japans," Working Papers REM 2023/0264, ISEG - Lisbon School of Economics and Management, REM, Universidade de Lisboa.
    5. Glediana Zeneli (Foto) & Arsen Benga & Altin Hoti, 2024. "Analysis of Albania’s Trade Direction: Is the Open Balkan a New Center of Gravity?," Economies, MDPI, vol. 12(7), pages 1-30, July.
    6. Reda Cherif & Christoph Grimpe & Fuad Hasanov & Wolfgang Sofka, 2023. "Promoting Innovation: The Differential Impact of R&D Subsidies," Journal of Industry, Competition and Trade, Springer, vol. 23(3), pages 187-241, December.
    7. Gustave Kenedi & Louis Sirugue, 2021. "The Anatomy of Intergenerational Income Mobility in France and its Spatial Variations," PSE Working Papers halshs-03455282, HAL.
    8. Qiao Wen, 2022. "Estimating Education and Labor Market Consequences of China’s Higher Education Expansion," Sustainability, MDPI, vol. 14(13), pages 1-25, June.
    9. Cosimo Beverelli & Rohit Ticku, 2023. "Global Livestock Trade and Infectious Diseases," RSCAS Working Papers 2023/09, European University Institute.
    10. Wang, Kunlun & Zheng, Leven J. & Lin, Boqiang, 2024. "Demand-side incentives, competition, and firms’ innovative activities: Evidence from automobile industry in China," Energy Economics, Elsevier, vol. 132(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. J. M. C. Santos Silva & Silvana Tenreyro, 2022. "The Log of Gravity at 15," Portuguese Economic Journal, Springer;Instituto Superior de Economia e Gestao, vol. 21(3), pages 423-437, September.
    2. Elina Bryngemark & Patrik Söderholm, 2022. "Green industrial policies and domestic production of biofuels: an econometric analysis of OECD countries," Environmental Economics and Policy Studies, Springer;Society for Environmental Economics and Policy Studies - SEEPS, vol. 24(2), pages 225-261, April.
    3. Santos Silva, J.M.C. & Tenreyro, Silvana, 2011. "Further simulation evidence on the performance of the Poisson pseudo-maximum likelihood estimator," Economics Letters, Elsevier, vol. 112(2), pages 220-222, August.
    4. Theresa M. Greaney & Kozo Kiyota, 2020. "Japan’s Outward FDI Potential," Working Papers 202005, University of Hawaii at Manoa, Department of Economics.
    5. Greaney, Theresa M. & Kiyota, Kozo, 2020. "Japan's outward FDI potential," Journal of the Japanese and International Economies, Elsevier, vol. 57(C).
    6. Mariam Camarero & Laura Montolio & Cecilio Tamarit, 2022. "Explaining German outward FDI in the EU: a reassessment using Bayesian model averaging and GLM estimators," Empirical Economics, Springer, vol. 62(2), pages 487-511, February.
    7. Weidner, Martin & Zylkin, Thomas, 2021. "Bias and consistency in three-way gravity models," Journal of International Economics, Elsevier, vol. 132(C).
    8. Silva João M. C. Santos & Tenreyro Silvana & Windmeijer Frank, 2015. "Testing Competing Models for Non-negative Data with Many Zeros," Journal of Econometric Methods, De Gruyter, vol. 4(1), pages 29-46, January.
    9. Kalle Hirvonen & John Hoddinott, 2017. "Agricultural production and children's diets: evidence from rural Ethiopia," Agricultural Economics, International Association of Agricultural Economists, vol. 48(4), pages 469-480, July.
    10. Börner, Lars & Severgnini, Battista, 2011. "Epidemic trade," Discussion Papers 2011/12, Free University Berlin, School of Business & Economics.
    11. Kevin E. Staub, 2009. "Simple tests for exogeneity of a binary explanatory variable in count data regression models," SOI - Working Papers 0904, Socioeconomic Institute - University of Zurich.
    12. Santos Silva, Joao & Tenreyro, Silvana, 2009. "On the existence of the maximum likelihood estimates for Poisson regression," LSE Research Online Documents on Economics 25504, London School of Economics and Political Science, LSE Library.
    13. Head, Keith & Mayer, Thierry, 2014. "Gravity Equations: Workhorse,Toolkit, and Cookbook," Handbook of International Economics, in: Gopinath, G. & Helpman, . & Rogoff, K. (ed.), Handbook of International Economics, edition 1, volume 4, chapter 0, pages 131-195, Elsevier.
    14. Mariam Camarero & Sergi Moliner & Cecilio Tamarit, 2021. "Is there a euro effect in the drivers of US FDI? New evidence using Bayesian model averaging techniques," Review of World Economics (Weltwirtschaftliches Archiv), Springer;Institut für Weltwirtschaft (Kiel Institute for the World Economy), vol. 157(4), pages 881-926, November.
    15. Santos Silva, J.M.C. & Tenreyro, Silvana, 2010. "On the existence of the maximum likelihood estimates in Poisson regression," Economics Letters, Elsevier, vol. 107(2), pages 310-312, May.
    16. Hoshi, Takeo & Kiyota, Kozo, 2019. "Potential for inward foreign direct investment in Japan," Journal of the Japanese and International Economies, Elsevier, vol. 52(C), pages 32-52.
    17. Juliana D. Araujo & Povilas Lastauskas & Chris Papageorgiou, 2017. "Evolution of Bilateral Capital Flows to Developing Countries at Intensive and Extensive Margins," Journal of Money, Credit and Banking, Blackwell Publishing, vol. 49(7), pages 1517-1554, October.
    18. repec:hal:spmain:info:hdl:2441/75dbbb2hc596np6q8flqf6i79k is not listed on IDEAS
    19. J. M. C. Santos Silva & Silvana Tenreyro, 2011. "poisson: Some convergence issues," Stata Journal, StataCorp LP, vol. 11(2), pages 215-225, June.
    20. Hinz, Julian & Stammann, Amrei & Wanner, Joschka, 2019. "Persistent zeros: The extensive margin of trade," Kiel Working Papers 2139, Kiel Institute for the World Economy (IfW Kiel).
    21. Hiroaki Kaido & Kaspar Wüthrich, 2021. "Decentralization estimators for instrumental variable quantile regression models," Quantitative Economics, Econometric Society, vol. 12(2), pages 443-475, May.

    More about this item

    JEL classification:

    • C26 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Instrumental Variables (IV) Estimation
    • C52 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Model Evaluation, Validation, and Selection
    • C55 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Large Data Sets: Modeling and Analysis

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2203.11820. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.