IDEAS home Printed from https://ideas.repec.org/a/bpj/causin/v6y2018i1p27n2.html
   My bibliography  Save this article

Detecting Confounding in Multivariate Linear Models via Spectral Analysis

Author

Listed:
  • Janzing Dominik

    (Deaprtment ‘Empirical Inference’,Max Planck Institute for Intelligent Systems,Spemannstr. 36, 70569Tübingen,Germany)

  • Schölkopf Bernhard

    (Deaprtment ‘Empirical Inference’,Max Planck Institute for Intelligent Systems,Tübingen,Germany)

Abstract

We study a model where one target variable Y$Y$ is correlated with a vector X:=(X1,…,Xd)$\textbf{X}:=(X_1,\dots,X_d)$ of predictor variables being potential causes of Y$Y$. We describe a method that infers to what extent the statistical dependences between X$\textbf{X}$ and Y$Y$ are due to the influence of X$\textbf{X}$ on Y$Y$ and to what extent due to a hidden common cause (confounder) of X$\textbf{X}$ and Y$Y$. The method relies on concentration of measure results for large dimensions d$d$ and an independence assumption stating that, in the absence of confounding, the vector of regression coefficients describing the influence of each X$\textbf{X}$ on Y$Y$ typically has ‘generic orientation’ relative to the eigenspaces of the covariance matrix of X$\textbf{X}$. For the special case of a scalar confounder we show that confounding typically spoils this generic orientation in a characteristic way that can be used to quantitatively estimate the amount of confounding (subject to our idealized model assumptions).

Suggested Citation

  • Janzing Dominik & Schölkopf Bernhard, 2018. "Detecting Confounding in Multivariate Linear Models via Spectral Analysis," Journal of Causal Inference, De Gruyter, vol. 6(1), pages 1-27, March.
  • Handle: RePEc:bpj:causin:v:6:y:2018:i:1:p:27:n:2
    DOI: 10.1515/jci-2017-0013
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jci-2017-0013
    Download Restriction: no

    File URL: https://libkey.io/10.1515/jci-2017-0013?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Karlin, Samuel & Rinott, Yosef, 1980. "Classes of orderings of measures and related correlation inequalities. I. Multivariate totally positive distributions," Journal of Multivariate Analysis, Elsevier, vol. 10(4), pages 467-498, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Nowak, Piotr Bolesław, 2016. "The MLE of the mean of the exponential distribution based on grouped data is stochastically increasing," Statistics & Probability Letters, Elsevier, vol. 111(C), pages 49-54.
    2. Khaledi, Baha-Eldin & Shaked, Moshe, 2010. "Stochastic comparisons of multivariate mixtures," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2486-2498, November.
    3. Colangelo Antonio, 2005. "Multivariate hazard orderings of discrete random vectors," Economics and Quantitative Methods qf05010, Department of Economics, University of Insubria.
    4. Chi, Chang Koo & Murto, Pauli & Valimaki, Juuso, 2017. "All-Pay Auctions with Affiliated Values," MPRA Paper 80799, University Library of Munich, Germany.
    5. Arnaud Costinot & Jonathan Vogel, 2010. "Matching and Inequality in the World Economy," Journal of Political Economy, University of Chicago Press, vol. 118(4), pages 747-786, August.
    6. Rinott, Yosef & Scarsini, Marco, 2006. "Total positivity order and the normal distribution," Journal of Multivariate Analysis, Elsevier, vol. 97(5), pages 1251-1261, May.
    7. Gustavo Rodríguez, 1997. "First price auctions: Monotonicity and uniqueness," Economics Working Papers 208, Department of Economics and Business, Universitat Pompeu Fabra.
    8. repec:dau:papers:123456789/698 is not listed on IDEAS
    9. Vikram Krishnamurthy & Udit Pareek, 2015. "Myopic Bounds for Optimal Policy of POMDPs: An Extension of Lovejoy’s Structural Results," Operations Research, INFORMS, vol. 63(2), pages 428-434, April.
    10. Ilse Lindenlaub & Fabien Postel-Vinay, 2023. "Multidimensional Sorting under Random Search," Journal of Political Economy, University of Chicago Press, vol. 131(12), pages 3497-3539.
    11. Badía, F.G. & Sangüesa, C. & Cha, J.H., 2014. "Stochastic comparison of multivariate conditionally dependent mixtures," Journal of Multivariate Analysis, Elsevier, vol. 129(C), pages 82-94.
    12. Prokopovych, Pavlo & Yannelis, Nicholas C., 2019. "On monotone approximate and exact equilibria of an asymmetric first-price auction with affiliated private information," Journal of Economic Theory, Elsevier, vol. 184(C).
    13. Miravete, Eugenio J., 2011. "Convolution and composition of totally positive random variables in economics," Journal of Mathematical Economics, Elsevier, vol. 47(4-5), pages 479-490.
    14. Patricio S. Dalton & Sayantan Ghosal & Anandi Mani, 2016. "Poverty and Aspirations Failure," Economic Journal, Royal Economic Society, vol. 126(590), pages 165-188, February.
    15. Bhattacharya, Bhaskar, 2012. "Covariance selection and multivariate dependence," Journal of Multivariate Analysis, Elsevier, vol. 106(C), pages 212-228.
    16. Mizuno, Toshihide, 2003. "On the existence of a unique price equilibrium for models of product differentiation," International Journal of Industrial Organization, Elsevier, vol. 21(6), pages 761-793, June.
    17. Baha-Eldin Khaledi & Subhash Kochar, 2001. "Dependence Properties of Multivariate Mixture Distributions and Their Applications," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 53(3), pages 620-630, September.
    18. Moshe Shaked & J. Shanthikumar, 1990. "Parametric stochastic convexity and concavity of stochastic processes," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 42(3), pages 509-531, September.
    19. , & , & ,, 2008. "Credible ratings," Theoretical Economics, Econometric Society, vol. 3(3), September.
    20. Castaño-Martínez, A. & Pigueiras, G. & Sordo, M.A., 2019. "On a family of risk measures based on largest claims," Insurance: Mathematics and Economics, Elsevier, vol. 86(C), pages 92-97.
    21. Elina Robeva & Bernd Sturmfels & Ngoc Tran & Caroline Uhler, 2021. "Maximum likelihood estimation for totally positive log‐concave densities," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 48(3), pages 817-844, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:causin:v:6:y:2018:i:1:p:27:n:2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.