IDEAS home Printed from https://ideas.repec.org/a/bpj/ijbist/v6y2010i2n2.html
   My bibliography  Save this article

Targeted Maximum Likelihood Based Causal Inference: Part I

Author

Listed:
  • van der Laan Mark J.

    (University of California - Berkeley)

Abstract

Given causal graph assumptions, intervention-specific counterfactual distributions of the data can be defined by the so called G-computation formula, which is obtained by carrying out these interventions on the likelihood of the data factorized according to the causal graph. The obtained G-computation formula represents the counterfactual distribution the data would have had if this intervention would have been enforced on the system generating the data. A causal effect of interest can now be defined as some difference between these counterfactual distributions indexed by different interventions. For example, the interventions can represent static treatment regimens or individualized treatment rules that assign treatment in response to time-dependent covariates, and the causal effects could be defined in terms of features of the mean of the treatment-regimen specific counterfactual outcome of interest as a function of the corresponding treatment regimens. Such features could be defined nonparametrically in terms of so called (nonparametric) marginal structural models for static or individualized treatment rules, whose parameters can be thought of as (smooth) summary measures of differences between the treatment regimen specific counterfactual distributions.In this article, we develop a particular targeted maximum likelihood estimator of causal effects of multiple time point interventions. This involves the use of loss-based super-learning to obtain an initial estimate of the unknown factors of the G-computation formula, and subsequently, applying a target-parameter specific optimal fluctuation function (least favorable parametric submodel) to each estimated factor, estimating the fluctuation parameter(s) with maximum likelihood estimation, and iterating this updating step of the initial factor till convergence. This iterative targeted maximum likelihood updating step makes the resulting estimator of the causal effect double robust in the sense that it is consistent if either the initial estimator is consistent, or the estimator of the optimal fluctuation function is consistent. The optimal fluctuation function is correctly specified if the conditional distributions of the nodes in the causal graph one intervenes upon are correctly specified. The latter conditional distributions often comprise the so called treatment and censoring mechanism. Selection among different targeted maximum likelihood estimators (e.g., indexed by different initial estimators) can be based on loss-based cross-validation such as likelihood based cross-validation or cross-validation based on another appropriate loss function for the distribution of the data. Some specific loss functions are mentioned in this article.Subsequently, a variety of interesting observations about this targeted maximum likelihood estimation procedure are made. This article provides the basis for the subsequent companion Part II-article in which concrete demonstrations for the implementation of the targeted MLE in complex causal effect estimation problems are provided.

Suggested Citation

  • van der Laan Mark J., 2010. "Targeted Maximum Likelihood Based Causal Inference: Part I," The International Journal of Biostatistics, De Gruyter, vol. 6(2), pages 1-45, February.
  • Handle: RePEc:bpj:ijbist:v:6:y:2010:i:2:n:2
    DOI: 10.2202/1557-4679.1211
    as

    Download full text from publisher

    File URL: https://doi.org/10.2202/1557-4679.1211
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.2202/1557-4679.1211?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Rose Sherri & van der Laan Mark J., 2008. "Simple Optimal Weighting of Cases and Controls in Case-Control Studies," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-26, September.
    2. P. W. Lavori & R. Dawson, 2000. "A design for testing clinical strategies: biased adaptive within‐subject randomization," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 163(1), pages 29-38.
    3. van der Laan Mark J. & Petersen Maya L, 2007. "Causal Effect Models for Realistic Individualized Treatment and Intention to Treat Rules," The International Journal of Biostatistics, De Gruyter, vol. 3(1), pages 1-55, March.
    4. van der Laan Mark J. & Dudoit Sandrine & Keles Sunduz, 2004. "Asymptotic Optimality of Likelihood-Based Cross-Validation," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 3(1), pages 1-25, March.
    5. van der Laan Mark J. & Polley Eric C & Hubbard Alan E., 2007. "Super Learner," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 6(1), pages 1-23, September.
    6. S. A. Murphy, 2003. "Optimal dynamic treatment regimes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 331-355, May.
    7. Rubin Daniel B & van der Laan Mark J., 2008. "Empirical Efficiency Maximization: Improved Locally Efficient Covariate Adjustment in Randomized Experiments and Survival Analysis," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-42, May.
    8. van der Laan Mark J. & Rubin Daniel, 2006. "Targeted Maximum Likelihood Learning," The International Journal of Biostatistics, De Gruyter, vol. 2(1), pages 1-40, December.
    9. Vaart Aad W. van der & Dudoit Sandrine & Laan Mark J. van der, 2006. "Oracle inequalities for multi-fold cross validation," Statistics & Risk Modeling, De Gruyter, vol. 24(3), pages 351-371, December.
    10. Rose Sherri & van der Laan Mark J., 2009. "Why Match? Investigating Matched Case-Control Study Designs with Causal Effect Estimation," The International Journal of Biostatistics, De Gruyter, vol. 5(1), pages 1-26, January.
    11. van der Laan Mark J., 2008. "Estimation Based on Case-Control Designs with Known Prevalence Probability," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-59, September.
    12. van der Laan Mark J. & Petersen Maya L & Joffe Marshall M, 2005. "History-Adjusted Marginal Structural Models and Statically-Optimal Dynamic Treatment Regimens," The International Journal of Biostatistics, De Gruyter, vol. 1(1), pages 1-41, November.
    13. Laan Mark J. van der & Dudoit Sandrine & Vaart Aad W. van der, 2006. "The cross-validated adaptive epsilon-net estimator," Statistics & Risk Modeling, De Gruyter, vol. 24(3), pages 373-395, December.
    14. Alberto Abadie & Guido W. Imbens, 2006. "Large Sample Properties of Matching Estimators for Average Treatment Effects," Econometrica, Econometric Society, vol. 74(1), pages 235-267, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mireille E. Schnitzer & Erica E.M. Moodie & Mark J. van der Laan & Robert W. Platt & Marina B. Klein, 2014. "Modeling the impact of hepatitis C viral clearance on end-stage liver disease in an HIV co-infected cohort with targeted maximum likelihood estimation," Biometrics, The International Biometric Society, vol. 70(1), pages 144-152, March.
    2. Rachael V. Phillips & Mark J. van der Laan, 2022. "Rachael V. Phillips and Mark J. van der Laan’s contribution to the Discussion of ‘Assumption‐lean inference for generalised linear model parameters’ by Vansteelandt and Dukes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(3), pages 717-718, July.
    3. Jason Roy & Kirsten J. Lum & Bret Zeldow & Jordan D. Dworkin & Vincent Lo Re & Michael J. Daniels, 2018. "Bayesian nonparametric generative models for causal inference with missing at random covariates," Biometrics, The International Biometric Society, vol. 74(4), pages 1193-1202, December.
    4. Brathwaite, Timothy & Walker, Joan L., 2018. "Causal inference in travel demand modeling (and the lack thereof)," Journal of choice modelling, Elsevier, vol. 26(C), pages 1-18.
    5. repec:jss:jstsof:43:i13 is not listed on IDEAS
    6. Jacob, Daniel, 2021. "CATE meets ML: Conditional average treatment effect and machine learning," IRTG 1792 Discussion Papers 2021-005, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    7. Daniel Jacob, 2021. "CATE meets ML," Digital Finance, Springer, vol. 3(2), pages 99-148, June.
    8. Guo, Xu & Fang, Yun & Zhu, Xuehu & Xu, Wangli & Zhu, Lixing, 2018. "Semiparametric double robust and efficient estimation for mean functionals with response missing at random," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 325-339.
    9. Sara E Moore & Anna Decker & Alan Hubbard & Rachael A Callcut & Erin E Fox & Deborah J del Junco & John B Holcomb & Mohammad H Rahbar & Charles E Wade & Martin A Schreiber & Louis H Alarcon & Karen J , 2015. "Statistical Machines for Trauma Hospital Outcomes Research: Application to the PRospective, Observational, Multi-Center Major Trauma Transfusion (PROMMTT) Study," PLOS ONE, Public Library of Science, vol. 10(8), pages 1-16, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. van der Laan Mark J. & Gruber Susan, 2010. "Collaborative Double Robust Targeted Maximum Likelihood Estimation," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-71, May.
    2. Rose Sherri & van der Laan Mark J., 2011. "A Targeted Maximum Likelihood Estimator for Two-Stage Designs," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-21, March.
    3. van der Laan Mark J. & Petersen Maya & Zheng Wenjing, 2013. "Estimating the Effect of a Community-Based Intervention with Two Communities," Journal of Causal Inference, De Gruyter, vol. 1(1), pages 83-106, June.
    4. Kristin A. Linn & Eric B. Laber & Leonard A. Stefanski, 2017. "Interactive -Learning for Quantiles," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(518), pages 638-649, April.
    5. van der Laan Mark J. & Gruber Susan, 2012. "Targeted Minimum Loss Based Estimation of Causal Effects of Multiple Time Point Interventions," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-41, May.
    6. Gruber Susan & van der Laan Mark J., 2010. "An Application of Collaborative Targeted Maximum Likelihood Estimation in Causal Inference and Genomics," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-31, May.
    7. Biernot Peter & Moodie Erica E. M., 2010. "A Comparison of Variable Selection Approaches for Dynamic Treatment Regimes," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-20, January.
    8. Brooks Jordan & van der Laan Mark J. & Go Alan S., 2012. "Targeted Maximum Likelihood Estimation for Prediction Calibration," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-35, October.
    9. Díaz Muñoz Iván & van der Laan Mark J., 2011. "Super Learner Based Conditional Density Estimation with Application to Marginal Structural Models," The International Journal of Biostatistics, De Gruyter, vol. 7(1), pages 1-20, October.
    10. Zhen Li & Jie Chen & Eric Laber & Fang Liu & Richard Baumgartner, 2023. "Optimal Treatment Regimes: A Review and Empirical Comparison," International Statistical Review, International Statistical Institute, vol. 91(3), pages 427-463, December.
    11. Q. Clairon & R. Henderson & N. J. Young & E. D. Wilson & C. J. Taylor, 2021. "Adaptive treatment and robust control," Biometrics, The International Biometric Society, vol. 77(1), pages 223-236, March.
    12. Jin Wang & Donglin Zeng & D. Y. Lin, 2022. "Semiparametric single-index models for optimal treatment regimens with censored outcomes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 28(4), pages 744-763, October.
    13. Susan Athey & Guido W. Imbens & Stefan Wager, 2018. "Approximate residual balancing: debiased inference of average treatment effects in high dimensions," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 80(4), pages 597-623, September.
    14. Jelena Bradic & Weijie Ji & Yuqian Zhang, 2021. "High-dimensional Inference for Dynamic Treatment Effects," Papers 2110.04924, arXiv.org, revised May 2023.
    15. Stitelman Ori M & van der Laan Mark J., 2010. "Collaborative Targeted Maximum Likelihood for Time to Event Data," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-46, June.
    16. Gruber Susan & van der Laan Mark J., 2010. "A Targeted Maximum Likelihood Estimator of a Causal Effect on a Bounded Continuous Outcome," The International Journal of Biostatistics, De Gruyter, vol. 6(1), pages 1-18, August.
    17. Frölich, Markus & Huber, Martin & Wiesenfarth, Manuel, 2017. "The finite sample performance of semi- and non-parametric estimators for treatment effects and policy evaluation," Computational Statistics & Data Analysis, Elsevier, vol. 115(C), pages 91-102.
    18. Rose Sherri & van der Laan Mark J., 2008. "Simple Optimal Weighting of Cases and Controls in Case-Control Studies," The International Journal of Biostatistics, De Gruyter, vol. 4(1), pages 1-26, September.
    19. Zhiwei Zhang & Zhen Chen & James F. Troendle & Jun Zhang, 2012. "Causal Inference on Quantiles with an Obstetric Application," Biometrics, The International Biometric Society, vol. 68(3), pages 697-706, September.
    20. Rich Benjamin & Moodie Erica E. M. & A. Stephens David, 2016. "Influence Re-weighted G-Estimation," The International Journal of Biostatistics, De Gruyter, vol. 12(1), pages 157-177, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:ijbist:v:6:y:2010:i:2:n:2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.