IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v12y2024i5p696-d1347243.html
   My bibliography  Save this article

Invariant Feature Learning Based on Causal Inference from Heterogeneous Environments

Author

Listed:
  • Hang Su

    (School of Mathematics, Renmin University of China, Beijing 100872, China)

  • Wei Wang

    (School of Mathematics, Renmin University of China, Beijing 100872, China)

Abstract

Causality has become a powerful tool for addressing the out-of-distribution (OOD) generalization problem, with the idea of invariant causal features across domains of interest. Most existing methods for learning invariant features are based on optimization, which typically fails to converge to the optimal solution. Therefore, obtaining the variables that cause the target outcome through a causal inference method is a more direct and effective method. This paper presents a new approach for invariant feature learning based on causal inference (IFCI). IFCI detects causal variables unaffected by the environment through the causal inference method. IFCI focuses on partial causal relationships to work efficiently even in the face of high-dimensional data. Our proposed causal inference method can accurately infer causal effects even when the treatment variable has more complex values. Our method can be viewed as a pretreatment of data to filter out variables whose distributions change between different environments, and it can then be combined with any learning method for classification and regression. The result of empirical studies shows that IFCI can detect and filter out environmental variables affected by the environment. After filtering out environmental variables, even a model with a simple structure and common loss function can have strong OOD generalization capability. Furthermore, we provide evidence to show that classifiers utilizing IFCI achieve higher accuracy in classification compared to existing OOD generalization algorithms.

Suggested Citation

  • Hang Su & Wei Wang, 2024. "Invariant Feature Learning Based on Causal Inference from Heterogeneous Environments," Mathematics, MDPI, vol. 12(5), pages 1-23, February.
  • Handle: RePEc:gam:jmathe:v:12:y:2024:i:5:p:696-:d:1347243
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/12/5/696/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/12/5/696/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Jonas Peters & Peter Bühlmann & Nicolai Meinshausen, 2016. "Causal inference by using invariant prediction: identification and confidence intervals," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 78(5), pages 947-1012, November.
    2. Donald B. Rubin, 2005. "Causal Inference Using Potential Outcomes: Design, Modeling, Decisions," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 322-331, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dominik Rothenhäusler & Nicolai Meinshausen & Peter Bühlmann & Jonas Peters, 2021. "Anchor regression: Heterogeneous data meet causality," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 215-246, April.
    2. Noémi Kreif & Richard Grieve & Iván Díaz & David Harrison, 2015. "Evaluation of the Effect of a Continuous Treatment: A Machine Learning Approach with an Application to Treatment for Traumatic Brain Injury," Health Economics, John Wiley & Sons, Ltd., vol. 24(9), pages 1213-1228, September.
    3. Martin Ravallion, 2022. "On the Gains from Tradable Benefits‐in‐kind: Evidence for Workfare in India," Economica, London School of Economics and Political Science, vol. 89(355), pages 770-787, July.
    4. Peter Abell & Ofer Engel, 2021. "Subjective Causality and Counterfactuals in the Social Sciences: Toward an Ethnographic Causality?," Sociological Methods & Research, , vol. 50(4), pages 1842-1862, November.
    5. Shonosuke Sugasawa & Hisashi Noma, 2021. "Efficient screening of predictive biomarkers for individual treatment selection," Biometrics, The International Biometric Society, vol. 77(1), pages 249-257, March.
    6. Ruoxuan Xiong & Allison Koenecke & Michael Powell & Zhu Shen & Joshua T. Vogelstein & Susan Athey, 2021. "Federated Causal Inference in Heterogeneous Observational Data," Papers 2107.11732, arXiv.org, revised Apr 2023.
    7. Salvatore Bimonte & Antonella D’Agostino, 2021. "Tourism development and residents’ well-being: Comparing two seaside destinations in Italy," Tourism Economics, , vol. 27(7), pages 1508-1525, November.
    8. Mealli Fabrizia & Mattei Alessandra, 2012. "A Refreshing Account of Principal Stratification," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-19, April.
    9. Antonio R. Linero, 2022. "Simulation‐based estimators of analytically intractable causal effects," Biometrics, The International Biometric Society, vol. 78(3), pages 1001-1017, September.
    10. Berger, Marius & Hottenrott, Hanna, 2021. "Start-up subsidies and the sources of venture capital," Journal of Business Venturing Insights, Elsevier, vol. 16(C).
    11. Sahar Saeed & Erica E. M. Moodie & Erin C. Strumpf & Marina B. Klein, 2018. "Segmented generalized mixed effect models to evaluate health outcomes," International Journal of Public Health, Springer;Swiss School of Public Health (SSPH+), vol. 63(4), pages 547-551, May.
    12. Jinglong Zhao, 2024. "Experimental Design For Causal Inference Through An Optimization Lens," Papers 2408.09607, arXiv.org, revised Aug 2024.
    13. Hodula, Martin & Melecký, Martin & Pfeifer, Lukáš & Szabo, Milan, 2023. "Cooling the mortgage loan market: The effect of borrower-based limits on new mortgage lending," Journal of International Money and Finance, Elsevier, vol. 132(C).
    14. Fangting Zhou & Kejun He & Yang Ni, 2023. "Individualized causal discovery with latent trajectory embedded Bayesian networks," Biometrics, The International Biometric Society, vol. 79(4), pages 3191-3202, December.
    15. Manuel S. González Canché, 2017. "Financial Benefits of Rapid Student Loan Repayment: An Analytic Framework Employing Two Decades of Data," The ANNALS of the American Academy of Political and Social Science, , vol. 671(1), pages 154-182, May.
    16. Damian Clarke & Daniel Paila~nir & Susan Athey & Guido Imbens, 2023. "Synthetic Difference In Differences Estimation," Papers 2301.11859, arXiv.org, revised Feb 2023.
    17. Almer, Christian & Winkler, Ralph, 2017. "Analyzing the effectiveness of international environmental policies: The case of the Kyoto Protocol," Journal of Environmental Economics and Management, Elsevier, vol. 82(C), pages 125-151.
    18. Sanford C. Gordon & Hannah K. Simpson, 2020. "Causes, theories, and the past in political science," Public Choice, Springer, vol. 185(3), pages 315-333, December.
    19. Lechner, Michael, 2008. "A note on endogenous control variables in causal studies," Statistics & Probability Letters, Elsevier, vol. 78(2), pages 190-195, February.
    20. Angelov, Nikolay & Eliason, Marcus, 2014. "The effects of targeted labour market programs for job seekers with occupational disabilities," Working Paper Series 2014:27, IFAU - Institute for Evaluation of Labour Market and Education Policy.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:12:y:2024:i:5:p:696-:d:1347243. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.