IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2204.05175.html
   My bibliography  Save this paper

Partially Linear Models under Data Combination

Author

Listed:
  • Xavier D'Haultf{oe}uille
  • Christophe Gaillac
  • Arnaud Maurel

Abstract

We study partially linear models when the outcome of interest and some of the covariates are observed in two different datasets that cannot be linked. This type of data combination problem arises very frequently in empirical microeconomics. Using recent tools from optimal transport theory, we derive a constructive characterization of the sharp identified set. We then build on this result and develop a novel inference method that exploits the specific geometric properties of the identified set. Our method exhibits good performances in finite samples, while remaining very tractable. We apply our approach to study intergenerational income mobility over the period 1850-1930 in the United States. Our method allows us to relax the exclusion restrictions used in earlier work, while delivering confidence regions that are informative.

Suggested Citation

  • Xavier D'Haultf{oe}uille & Christophe Gaillac & Arnaud Maurel, 2022. "Partially Linear Models under Data Combination," Papers 2204.05175, arXiv.org, revised Aug 2023.
  • Handle: RePEc:arx:papers:2204.05175
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2204.05175
    File Function: Latest version
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Alfred Galichon, 2016. "Optimal Transport Methods in Economics," Economics Books, Princeton University Press, edition 1, number 10870.
    2. Victor Chernozhukov & Sokbae Lee & Adam M. Rosen, 2013. "Intersection Bounds: Estimation and Inference," Econometrica, Econometric Society, vol. 81(2), pages 667-737, March.
    3. Joseph G. Altonji & Todd E. Elder & Christopher R. Taber, 2005. "Selection on Observed and Unobserved Variables: Assessing the Effectiveness of Catholic Schools," Journal of Political Economy, University of Chicago Press, vol. 113(1), pages 151-184, February.
    4. Davillas, Apostolos & Pudney, Stephen, 2020. "Using biomarkers to predict healthcare costs: Evidence from a UK household panel," Journal of Health Economics, Elsevier, vol. 73(C).
    5. Robinson, Peter M, 1988. "Root- N-Consistent Semiparametric Regression," Econometrica, Econometric Society, vol. 56(4), pages 931-954, July.
    6. Susan Athey & Raj Chetty & Guido Imbens, 2020. "Combining Experimental and Observational Data to Estimate Treatment Effects on Long Term Outcomes," Papers 2006.09676, arXiv.org.
    7. Yanqin Fan & Robert Sherman & Matthew Shum, 2014. "Identifying Treatment Effects Under Data Combination," Econometrica, Econometric Society, vol. 82(2), pages 811-822, March.
    8. Christian Bontemps & Thierry Magnac, 2017. "Set Identification, Moment Restrictions, and Inference," Annual Review of Economics, Annual Reviews, vol. 9(1), pages 103-129, September.
    9. Ridder, Geert & Moffitt, Robert, 2007. "The Econometrics of Data Combination," Handbook of Econometrics, in: J.J. Heckman & E.E. Leamer (ed.), Handbook of Econometrics, edition 1, volume 6, chapter 75, Elsevier.
    10. Andrews, Donald W.K. & Shi, Xiaoxia, 2017. "Inference based on many conditional moment inequalities," Journal of Econometrics, Elsevier, vol. 196(2), pages 275-287.
    11. Sundaram,Rangarajan K., 1996. "A First Course in Optimization Theory," Cambridge Books, Cambridge University Press, number 9780521497190, September.
    12. Bhattacharya, Debopam, 2013. "Evaluating treatment protocols using data combination," Journal of Econometrics, Elsevier, vol. 173(2), pages 160-174.
    13. David Pacini, 2019. "Two-sample least squares projection," Econometric Reviews, Taylor & Francis Journals, vol. 38(1), pages 95-123, January.
    14. Xavier D'Haultfoeuille & Christophe Gaillac & Arnaud Maurel, 2021. "Rationalizing rational expectations: Characterizations and tests," Quantitative Economics, Econometric Society, vol. 12(3), pages 817-842, July.
    15. repec:hal:spmain:info:hdl:2441/5rkqqmvrn4tl22s9mc4ao8ocg is not listed on IDEAS
    16. Rémi Piatek & Pia Pinger, 2016. "Maintaining (Locus of) Control? Data Combination for the Identification and Inference of Factor Structure Models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 31(4), pages 734-755, June.
    17. Claudia Olivetti & M. Daniele Paserman, 2015. "In the Name of the Son (and the Daughter): Intergenerational Mobility in the United States, 1850-1940," American Economic Review, American Economic Association, vol. 105(8), pages 2695-2724, August.
    18. Alfred Galichon, 2016. "Optimal transport methods in economics," Post-Print hal-03256830, HAL.
    19. Fan Yanqin & Sherman Robert & Shum Matthew, 2016. "Estimation and Inference in an Ecological Inference Model," Journal of Econometric Methods, De Gruyter, vol. 5(1), pages 17-48, January.
    20. Alfred Galichon & Marc Henry, 2011. "Set Identification in Models with Multiple Equilibria," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(4), pages 1264-1298.
    21. Francesca Molinari, 2020. "Microeconometrics with Partial Identification," Papers 2004.11751, arXiv.org.
    22. Buchinsky, Moshe & Li, Fanghua & Liao, Zhipeng, 2022. "Estimation and inference of semiparametric models using data from several sources," Journal of Econometrics, Elsevier, vol. 226(1), pages 80-103.
    23. Christian Bontemps & Thierry Magnac, 2017. "Set Identification, Moment Restrictions, and Inference," Post-Print hal-02137347, HAL.
    24. Hong, Han & Li, Jessie, 2018. "The numerical delta method," Journal of Econometrics, Elsevier, vol. 206(2), pages 379-394.
    25. Sundaram,Rangarajan K., 1996. "A First Course in Optimization Theory," Cambridge Books, Cambridge University Press, number 9780521497701, September.
    26. Vaart,A. W. van der, 2000. "Asymptotic Statistics," Cambridge Books, Cambridge University Press, number 9780521784504, September.
    27. Zheng Fang & Andres Santos, 2019. "Inference on Directionally Differentiable Functions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 86(1), pages 377-412.
    28. repec:hal:wpspec:info:hdl:2441/5rkqqmvrn4tl22s9mc4ao8ocg is not listed on IDEAS
    29. Pollard, David, 1991. "Asymptotics for Least Absolute Deviation Regression Estimators," Econometric Theory, Cambridge University Press, vol. 7(2), pages 186-199, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xavier D'Haultfoeuille & Christophe Gaillac & Arnaud Maurel, 2021. "Rationalizing rational expectations: Characterizations and tests," Quantitative Economics, Econometric Society, vol. 12(3), pages 817-842, July.
    2. Francesca Molinari, 2020. "Microeconometrics with Partial Identi?cation," CeMMAP working papers CWP15/20, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    3. Firpo, Sergio & Galvao, Antonio F. & Parker, Thomas, 2023. "Uniform inference for value functions," Journal of Econometrics, Elsevier, vol. 235(2), pages 1680-1699.
    4. Francesca Molinari, 2019. "Econometrics with Partial Identification," CeMMAP working papers CWP25/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
    5. Xavier D'Haultfoeuille & Christophe Gaillac & Arnaud Maurel, 2018. "Rationalizing Rational Expectations? Tests and Deviations," NBER Working Papers 25274, National Bureau of Economic Research, Inc.
    6. Sarah Moon, 2024. "Partial Identification of Individual-Level Parameters Using Aggregate Data in a Nonparametric Model," Papers 2403.07236, arXiv.org, revised May 2024.
    7. Matthew A. Masten & Alexandre Poirier, 2021. "Salvaging Falsified Instrumental Variable Models," Econometrica, Econometric Society, vol. 89(3), pages 1449-1469, May.
    8. Santiago Acerenza & Otávio Bartalotti & Désiré Kédagni, 2023. "Testing identifying assumptions in bivariate probit models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 38(3), pages 407-422, April.
    9. Pablo Lavado & Gonzalo Rivera, 2015. "Identifying treatment effects and counterfactual distributions using data combination with unobserved heterogeneity," Working Papers 15-14, Centro de Investigación, Universidad del Pacífico.
    10. Matthew A. Masten & Alexandre Poirier, 2020. "Inference on breakdown frontiers," Quantitative Economics, Econometric Society, vol. 11(1), pages 41-111, January.
    11. Bontemps, Christian & Kumar, Rohit, 2020. "A geometric approach to inference in set-identified entry games," Journal of Econometrics, Elsevier, vol. 218(2), pages 373-389.
    12. Semenova, Vira, 2023. "Debiased machine learning of set-identified linear models," Journal of Econometrics, Elsevier, vol. 235(2), pages 1725-1746.
    13. Shuowen Chen & Hiroaki Kaido, 2022. "Robust Tests of Model Incompleteness in the Presence of Nuisance Parameters," Papers 2208.11281, arXiv.org, revised Sep 2023.
    14. Callaway, Brantly, 2021. "Bounds on distributional treatment effect parameters using panel data with an application on job displacement," Journal of Econometrics, Elsevier, vol. 222(2), pages 861-881.
    15. Magnac, Thierry, 2013. "Identification partielle : méthodes et conséquences pour les applications empiriques," L'Actualité Economique, Société Canadienne de Science Economique, vol. 89(4), pages 233-258, Décembre.
    16. Manuel Arellano & Stéphane Bonhomme, 2023. "Recovering Latent Variables by Matching," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 118(541), pages 693-706, January.
    17. Gu, Jiaying & Russell, Thomas M., 2023. "Partial identification in nonseparable binary response models with endogenous regressors," Journal of Econometrics, Elsevier, vol. 235(2), pages 528-562.
    18. Lixiong Li & Désiré Kédagni & Ismaël Mourifié, 2024. "Discordant relaxations of misspecified models," Quantitative Economics, Econometric Society, vol. 15(2), pages 331-379, May.
    19. Nathan Kallus, 2022. "What's the Harm? Sharp Bounds on the Fraction Negatively Affected by Treatment," Papers 2205.10327, arXiv.org, revised Nov 2022.
    20. Ryo Okui, 2021. "A moment inequality approach to statistical inference for rankings," The Japanese Economic Review, Springer, vol. 72(2), pages 169-184, April.

    More about this item

    JEL classification:

    • C14 - Mathematical and Quantitative Methods - - Econometric and Statistical Methods and Methodology: General - - - Semiparametric and Nonparametric Methods: General
    • C21 - Mathematical and Quantitative Methods - - Single Equation Models; Single Variables - - - Cross-Sectional Models; Spatial Models; Treatment Effect Models
    • J62 - Labor and Demographic Economics - - Mobility, Unemployment, Vacancies, and Immigrant Workers - - - Job, Occupational and Intergenerational Mobility; Promotion

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2204.05175. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.