IDEAS home Printed from https://ideas.repec.org/p/hal/journl/hal-00815561.html
   My bibliography  Save this paper

Regression with imputed covariates: A generalized missing-indicator approach

Author

Listed:
  • Valentino Dardanoni

    (Università degli studi di Palermo - University of Palermo)

  • Salvatore Modica

    (Università degli studi di Palermo - University of Palermo)

  • Franco Peracchi

    (Università degli Studi di Roma Tor Vergata [Roma, Italia] = University of Rome Tor Vergata [Rome, Italy] = Université de Rome Tor Vergata [Rome, Italie])

Abstract

A common problem in applied regression analysis is that covariate values may be missing for some observations but imputed values may be available. This situation generates a trade-off between bias and precision: the complete cases are often disarmingly few, but replacing the missing observations with the imputed values to gain precision may lead to bias. In this paper we formalize this trade-off by showing that one can augment the regression model with a set of auxiliary variables so as to obtain, under weak assumptions about the imputations, the same unbiased estimator of the parameters of interest as complete-case analysis. Given this augmented model, the bias-precision trade-off may then be tackled by either model reduction procedures or model averaging methods. We illustrate our approach by considering the problem of estimating the relation between income and the body mass index (BMI) using survey data affected by item non-response, where the missing values on the main covariates are filled in by imputations.

Suggested Citation

  • Valentino Dardanoni & Salvatore Modica & Franco Peracchi, 2011. "Regression with imputed covariates: A generalized missing-indicator approach," Post-Print hal-00815561, HAL.
  • Handle: RePEc:hal:journl:hal-00815561
    DOI: 10.1016/j.jeconom.2011.02.005
    Note: View the original document on HAL open archive server: https://hal.science/hal-00815561
    as

    Download full text from publisher

    File URL: https://hal.science/hal-00815561/document
    Download Restriction: no

    File URL: https://libkey.io/10.1016/j.jeconom.2011.02.005?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Horton, Nicholas J. & Kleinman, Ken P., 2007. "Much Ado About Nothing: A Comparison of Missing Data Methods and Software to Fit Incomplete Data Regression Models," The American Statistician, American Statistical Association, vol. 61, pages 79-90, February.
    2. García Villar, Jaume & Quintana-Domeque, Climent, 2009. "Income and body mass index in Europe," Economics & Human Biology, Elsevier, vol. 7(1), pages 73-83, March.
    3. Magnus, J.R. & Powell, O.R. & Prüfer, P., 2008. "A Comparison of Two Averaging Techniques with an Application to Growth Empirics," Other publications TiSEM 0392dffa-51e0-4bc9-9644-f, Tilburg University, School of Economics and Management.
    4. David M. Cutler & Edward L. Glaeser & Jesse M. Shapiro, 2003. "Why Have Americans Become More Obese?," Journal of Economic Perspectives, American Economic Association, vol. 17(3), pages 93-118, Summer.
    5. John Cawley & John Moran & Kosali Simon, 2010. "The impact of income on the weight of elderly Americans," Health Economics, John Wiley & Sons, Ltd., vol. 19(8), pages 979-993, August.
    6. Anna Sanz De Galdeano, 2005. "The Obesity Epidemic in Europe," CSEF Working Papers 143, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
    7. Magnus, Jan R. & Powell, Owen & Prüfer, Patricia, 2010. "A comparison of two model averaging techniques with an application to growth empirics," Journal of Econometrics, Elsevier, vol. 154(2), pages 139-153, February.
    8. Tomas Philipson & Richard Posner, 2008. "Is the Obesity Epidemic a Public Health Problem? A Decade of Research on the Economics of Obesity," NBER Working Papers 14010, National Bureau of Economic Research, Inc.
    9. Jan R. Magnus & J. Durbin, 1999. "Estimation of Regression Coefficients of Interest When Other Regression Coefficients Are of No Interest," Econometrica, Econometric Society, vol. 67(3), pages 639-644, May.
    10. Xavier Sala-I-Martin & Gernot Doppelhofer & Ronald I. Miller, 2004. "Determinants of Long-Term Growth: A Bayesian Averaging of Classical Estimates (BACE) Approach," American Economic Review, American Economic Association, vol. 94(4), pages 813-835, September.
    11. Danilov, Dmitry & Magnus, J.R.Jan R., 2004. "On the harm that ignoring pretesting can cause," Journal of Econometrics, Elsevier, vol. 122(1), pages 27-46, September.
    12. Julia Campos & Neil R. Ericsson & David F. Hendry, 2005. "General-to-specific modeling: an overview and selected bibliography," International Finance Discussion Papers 838, Board of Governors of the Federal Reserve System (U.S.).
    Full references (including those not matched with items on IDEAS)

    Citations

    Blog mentions

    As found by EconAcademics.org, the blog aggregator for Economics research:
    1. Gli esperti di valutazione all’italiana
      by Alberto Baccini in ROARS - Return on Academic Research on 2011-12-16 21:45:50

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Sophia Rabe-Hesketh & Anders Skrondal, 2023. "Ignoring Non-ignorable Missingness," Psychometrika, Springer;The Psychometric Society, vol. 88(1), pages 31-50, March.
    2. Chris Muris, 2020. "Efficient GMM Estimation with Incomplete Data," The Review of Economics and Statistics, MIT Press, vol. 102(3), pages 518-530, July.
    3. Giuseppe Luca & Jan R. Magnus & Franco Peracchi, 2023. "Weighted-Average Least Squares (WALS): Confidence and Prediction Intervals," Computational Economics, Springer;Society for Computational Economics, vol. 61(4), pages 1637-1664, April.
    4. Djavad Salehi-Isfahani & Nadia Hassine & Ragui Assaad, 2014. "Equality of opportunity in educational achievement in the Middle East and North Africa," The Journal of Economic Inequality, Springer;Society for the Study of Economic Inequality, vol. 12(4), pages 489-515, December.
    5. McDonough, Ian K. & Millimet, Daniel L., 2017. "Missing data, imputation, and endogeneity," Journal of Econometrics, Elsevier, vol. 199(2), pages 141-155.
    6. Aedın Doris & Donal O’Neill & Olive Sweetman, 2011. "GMM estimation of the covariance structure of longitudinal data on earnings," Stata Journal, StataCorp LP, vol. 11(3), pages 439-459, September.
    7. Giuseppe De Luca & Jan R. Magnus, 2011. "Bayesian model averaging and weighted-average least squares: Equivariance, stability, and numerical issues," Stata Journal, StataCorp LP, vol. 11(4), pages 518-544, December.
    8. Giuseppe De Luca & Jan Magnus & Franco Peracchi, 2022. "Asymptotic properties of the weighted average least squares (WALS) estimator," Tinbergen Institute Discussion Papers 22-022/III, Tinbergen Institute.
    9. Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2012. "A generalized missing-indicator approach to regression with imputed covariates," Stata Journal, StataCorp LP, vol. 12(4), pages 575-604, December.
    10. Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2013. "Bayesian Model Averaging for Generalized Linear Models with Missing Covariates," EIEF Working Papers Series 1311, Einaudi Institute for Economics and Finance (EIEF), revised May 2013.
    11. Laszlo Goerke & Sabrina Jeworrek & Markus Pannenberg, 2015. "Trade union membership and paid vacation in Germany," IZA Journal of Labor Economics, Springer;Forschungsinstitut zur Zukunft der Arbeit GmbH (IZA), vol. 4(1), pages 1-26, December.
    12. Zhang, Xinyu, 2013. "Model averaging with covariates that are missing completely at random," Economics Letters, Elsevier, vol. 121(3), pages 360-363.
    13. Yuan, Chaoxia & Fang, Fang & Ni, Lyu, 2022. "Mallows model averaging with effective model size in fragmentary data prediction," Computational Statistics & Data Analysis, Elsevier, vol. 173(C).
    14. World Bank, 2015. "Tanzania Poverty Assessment," World Bank Publications - Reports 21871, The World Bank Group.
    15. Dardanoni, Valentino & De Luca, Giuseppe & Modica, Salvatore & Peracchi, Franco, 2015. "Model averaging estimation of generalized linear models with imputed covariates," Journal of Econometrics, Elsevier, vol. 184(2), pages 452-463.
    16. Francesco Bartolucci & Fulvia Pennoni & Giorgio Vittadini, 2023. "A Causal Latent Transition Model With Multivariate Outcomes and Unobserved Heterogeneity: Application to Human Capital Development," Journal of Educational and Behavioral Statistics, , vol. 48(4), pages 387-419, August.
    17. Jiti Gao & Bin Peng & Zhao Ren & Xiaohui Zhang, 2015. "Variable Selection for a Categorical Varying-Coefficient Model with Identifications for Determinants of Body Mass Index," Monash Econometrics and Business Statistics Working Papers 21/15, Monash University, Department of Econometrics and Business Statistics.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:hal:journl:peer-00815561 is not listed on IDEAS
    2. Judith Anne Clarke, 2017. "Model Averaging OLS and 2SLS: An Application of the WALS Procedure," Econometrics Working Papers 1701, Department of Economics, University of Victoria.
    3. García Villar, Jaume & Quintana-Domeque, Climent, 2009. "Income and body mass index in Europe," Economics & Human Biology, Elsevier, vol. 7(1), pages 73-83, March.
    4. De Luca, Giuseppe & Magnus, Jan R. & Peracchi, Franco, 2018. "Weighted-average least squares estimation of generalized linear models," Journal of Econometrics, Elsevier, vol. 204(1), pages 1-17.
    5. Aiyar, Shekhar & Duval, Romain & Puy, Damien & Wu, Yiqun & Zhang, Longmei, 2018. "Growth slowdowns and the middle-income trap," Japan and the World Economy, Elsevier, vol. 48(C), pages 22-37.
    6. Salmasi, Luca & Celidoni, Martina, 2017. "Investigating the poverty-obesity paradox in Europe," Economics & Human Biology, Elsevier, vol. 26(C), pages 70-85.
    7. Dardanoni, Valentino & De Luca, Giuseppe & Modica, Salvatore & Peracchi, Franco, 2015. "Model averaging estimation of generalized linear models with imputed covariates," Journal of Econometrics, Elsevier, vol. 184(2), pages 452-463.
    8. Tumala, Mohammed M & Olubusoye, Olusanya E & Yaaba, Baba N & Yaya, OlaOluwa S & Akanbi, Olawale B, 2017. "Investigating Predictors of Inflation in Nigeria: BMA and WALS Techniques," MPRA Paper 88773, University Library of Munich, Germany, revised Feb 2018.
    9. Poghosyan, K., 2012. "Structural and reduced-form modeling and forecasting with application to Armenia," Other publications TiSEM ad1a24c3-15e6-4f04-b338-3, Tilburg University, School of Economics and Management.
    10. Jan R. Magnus & Wendun Wang & Xinyu Zhang, 2016. "Weighted-Average Least Squares Prediction," Econometric Reviews, Taylor & Francis Journals, vol. 35(6), pages 1040-1074, June.
    11. Giuseppe De Luca & Jan R. Magnus, 2011. "Bayesian model averaging and weighted-average least squares: Equivariance, stability, and numerical issues," Stata Journal, StataCorp LP, vol. 11(4), pages 518-544, December.
    12. Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2013. "Bayesian Model Averaging for Generalized Linear Models with Missing Covariates," EIEF Working Papers Series 1311, Einaudi Institute for Economics and Finance (EIEF), revised May 2013.
    13. Aedın Doris & Donal O’Neill & Olive Sweetman, 2011. "GMM estimation of the covariance structure of longitudinal data on earnings," Stata Journal, StataCorp LP, vol. 11(3), pages 439-459, September.
    14. Becker William & Paruolo Paolo & Saltelli Andrea, 2021. "Variable Selection in Regression Models Using Global Sensitivity Analysis," Journal of Time Series Econometrics, De Gruyter, vol. 13(2), pages 187-233, July.
    15. Mark F. J. Steel, 2020. "Model Averaging and Its Use in Economics," Journal of Economic Literature, American Economic Association, vol. 58(3), pages 644-719, September.
    16. Tafreschi, Darjusch, 2015. "The income body weight gradients in the developing economy of China," Economics & Human Biology, Elsevier, vol. 16(C), pages 115-134.
    17. Valentino Dardanoni & Giuseppe De Luca & Salvatore Modica & Franco Peracchi, 2012. "A generalized missing-indicator approach to regression with imputed covariates," Stata Journal, StataCorp LP, vol. 12(4), pages 575-604, December.
    18. Karen Poghosyan & Jan R. Magnus, 2012. "WALS Estimation and Forecasting in Factor-based Dynamic Models with an Application to Armenia," International Econometric Review (IER), Econometric Research Association, vol. 4(1), pages 40-58, April.
    19. Enrique Moral‐Benito, 2016. "Growth Empirics in Panel Data Under Model Uncertainty and Weak Exogeneity," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(3), pages 584-602, April.
    20. Giuseppe Luca & Jan R. Magnus & Franco Peracchi, 2023. "Weighted-Average Least Squares (WALS): Confidence and Prediction Intervals," Computational Economics, Springer;Society for Computational Economics, vol. 61(4), pages 1637-1664, April.
    21. Liu, Chu-An, 2015. "Distribution theory of the least squares averaging estimator," Journal of Econometrics, Elsevier, vol. 186(1), pages 142-159.

    More about this item

    Keywords

    C12; C13; C19; Missing covariates; Imputations; Bias-precision trade-off; Model reduction; Model averaging; BMI and income;
    All these keywords.

    JEL classification:

    • C12 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Hypothesis Testing: General
    • C13 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Estimation: General
    • C19 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Other

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-00815561. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.