IDEAS home Printed from https://ideas.repec.org/a/oup/emjrnl/v28y2025i1p41-82..html
   My bibliography  Save this article

Causal inference and data fusion in econometrics

Author

Listed:
  • Paul Hünermund
  • Elias Bareinboim

Abstract

SummaryLearning about cause and effect is arguably the main goal in applied econometrics. In practice, the validity of these causal inferences is contingent on a number of critical assumptions regarding the type of data that has been collected, and the substantive knowledge that is available about the phenomenon under investigation. For instance, unobserved confounding factors threaten the internal validity of estimates; data availability is often limited to nonrandom, selection-biased samples; causal effects need to be learned from surrogate experiments with imperfect compliance; and causal knowledge has to be extrapolated across structurally heterogeneous populations. A powerful and flexible causal inference framework is required in order to tackle all of these challenges, which plague essentially any data analysis to varying degrees. Building on the structural perspective on causality introduced by Haavelmo (1943) and the graph-theoretic approach proposed by Pearl (1995), the artificial intelligence (AI) literature has developed a wide array of techniques for causal inference that allow us to leverage information from various imperfect, heterogeneous, and biased data sources (Bareinboim and Pearl, 2016). In this paper, we review recent advances made in this literature that have the potential to contribute to econometric methodology along three broad dimensions. First, they provide a unified and comprehensive framework for causal learning, in which the above-mentioned problems can be addressed in generality. Second, due to their origin in AI, they come together with sound, efficient, and complete (to be formally defined) algorithmic criteria for automation of the corresponding identification task. And third, because of the nonparametric description of structural models that graph-theoretic approaches build on, they combine the analytical rigor of structural econometrics with the flexibility of the potential outcomes framework, and thus offer a valuable complement to these two literature streams.

Suggested Citation

  • Paul Hünermund & Elias Bareinboim, 2025. "Causal inference and data fusion in econometrics," The Econometrics Journal, Royal Economic Society, vol. 28(1), pages 41-82.
  • Handle: RePEc:oup:emjrnl:v:28:y:2025:i:1:p:41-82.
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1093/ectj/utad008
    Download Restriction: Access to full text is restricted to subscribers.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:oup:emjrnl:v:28:y:2025:i:1:p:41-82.. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Oxford University Press (email available below). General contact details of provider: https://edirc.repec.org/data/resssea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.