IDEAS home Printed from https://ideas.repec.org/a/spr/lifeda/v28y2022i3d10.1007_s10985-022-09552-w.html
   My bibliography  Save this article

Mixture survival trees for cancer risk classification

Author

Listed:
  • Beilin Jia

    (University of North Carolina at Chapel Hill)

  • Donglin Zeng

    (University of North Carolina at Chapel Hill)

  • Jason J. Z. Liao

    (Incyte Corporation)

  • Guanghan F. Liu

    (Merck & Co., Inc)

  • Xianming Tan

    (University of North Carolina at Chapel Hill)

  • Guoqing Diao

    (The George Washington University)

  • Joseph G. Ibrahim

    (University of North Carolina at Chapel Hill)

Abstract

In oncology studies, it is important to understand and characterize disease heterogeneity among patients so that patients can be classified into different risk groups and one can identify high-risk patients at the right time. This information can then be used to identify a more homogeneous patient population for developing precision medicine. In this paper, we propose a mixture survival tree approach for direct risk classification. We assume that the patients can be classified into a pre-specified number of risk groups, where each group has distinct survival profile. Our proposed tree-based methods are devised to estimate latent group membership using an EM algorithm. The observed data log-likelihood function is used as the splitting criterion in recursive partitioning. The finite sample performance is evaluated by extensive simulation studies and the proposed method is illustrated by a case study in breast cancer.

Suggested Citation

  • Beilin Jia & Donglin Zeng & Jason J. Z. Liao & Guanghan F. Liu & Xianming Tan & Guoqing Diao & Joseph G. Ibrahim, 2022. "Mixture survival trees for cancer risk classification," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 28(3), pages 356-379, July.
  • Handle: RePEc:spr:lifeda:v:28:y:2022:i:3:d:10.1007_s10985-022-09552-w
    DOI: 10.1007/s10985-022-09552-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10985-022-09552-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10985-022-09552-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Hoora Moradian & Denis Larocque & François Bellavance, 2017. "$$L_1$$ L 1 splitting rules in survival forests," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(4), pages 671-691, October.
    2. Donglin Zeng & Lu Mao & D. Y. Lin, 2016. "Maximum likelihood estimation for semiparametric transformation models with interval-censored data," Biometrika, Biometrika Trust, vol. 103(2), pages 253-271.
    3. Yan Zhou & John McArdle, 2015. "Rationale and Applications of Survival Tree and Survival Ensemble Methods," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 811-833, September.
    4. Juan Shen & Xuming He, 2015. "Inference for Subgroup Analysis With a Structured Logistic-Normal Mixture Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 303-312, March.
    5. Yueh-Yun Chi & Joseph G. Ibrahim, 2006. "Joint Models for Multivariate Longitudinal and Multivariate Survival Data," Biometrics, The International Biometric Society, vol. 62(2), pages 432-445, June.
    6. Martin G. Larson & Gregg E. Dinse, 1985. "A Mixture Model for the Regression Analysis of Competing Risks Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 34(3), pages 201-211, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alina Schenk & Moritz Berger & Matthias Schmid, 2024. "Pseudo-value regression trees," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 30(2), pages 439-471, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jingxiang Chen & Yufeng Liu & Donglin Zeng & Rui Song & Yingqi Zhao & Michael R. Kosorok, 2016. "Comment," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 942-947, July.
    2. Zhang, Danjie & Chen, Ming-Hui & Ibrahim, Joseph G. & Boye, Mark E. & Shen, Wei, 2016. "JMFit: A SAS Macro for Joint Models of Longitudinal and Survival Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 71(i03).
    3. Qingning Zhou & Jianwen Cai & Haibo Zhou, 2018. "Outcome†dependent sampling with interval†censored failure time data," Biometrics, The International Biometric Society, vol. 74(1), pages 58-67, March.
    4. Cai, Tingting & Li, Jianbo & Zhou, Qin & Yin, Songlou & Zhang, Riquan, 2024. "Subgroup detection based on partially linear additive individualized model with missing data in response," Computational Statistics & Data Analysis, Elsevier, vol. 192(C).
    5. Wentian Guo & Yuan Ji & Daniel V. T. Catenacci, 2017. "A subgroup cluster-based Bayesian adaptive design for precision medicine," Biometrics, The International Biometric Society, vol. 73(2), pages 367-377, June.
    6. Oi, Katsuya, 2020. "Disuse as time away from a cognitively demanding job; how does it temporally or developmentally impact late-life cognition?," Intelligence, Elsevier, vol. 82(C).
    7. Tang, Nian-Sheng & Tang, An-Min & Pan, Dong-Dong, 2014. "Semiparametric Bayesian joint models of multivariate longitudinal and survival data," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 113-129.
    8. Choi, Taehwa & Kim, Arlene K.H. & Choi, Sangbum, 2021. "Semiparametric least-squares regression with doubly-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    9. Huicong Yu & Jiaqi Wu & Weiping Zhang, 2024. "Simultaneous subgroup identification and variable selection for high dimensional data," Computational Statistics, Springer, vol. 39(6), pages 3181-3205, September.
    10. Sankaran, P.G. & Anisha, P., 2012. "Additive hazards models for gap time data with multiple causes," Statistics & Probability Letters, Elsevier, vol. 82(7), pages 1454-1462.
    11. Du, Mingyue & Li, Huiqiong & Sun, Jianguo, 2021. "Regression analysis of censored data with nonignorable missing covariates and application to Alzheimer Disease," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    12. Yanqing Sun & Qingning Zhou & Peter B. Gilbert, 2023. "Analysis of the Cox Model with Longitudinal Covariates with Measurement Errors and Partly Interval Censored Failure Times, with Application to an AIDS Clinical Trial," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 15(2), pages 430-454, July.
    13. Minggen Lu & Christopher S. McMahan, 2018. "A partially linear proportional hazards model for current status data," Biometrics, The International Biometric Society, vol. 74(4), pages 1240-1249, December.
    14. Yangxin Huang & Xiaosun Lu & Jiaqing Chen & Juan Liang & Miriam Zangmeister, 2018. "Joint model-based clustering of nonlinear longitudinal trajectories and associated time-to-event data analysis, linked by latent class membership: with application to AIDS clinical studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(4), pages 699-718, October.
    15. Zhang, Xiaochen & Zhang, Qingzhao & Ma, Shuangge & Fang, Kuangnan, 2022. "Subgroup analysis for high-dimensional functional regression," Journal of Multivariate Analysis, Elsevier, vol. 192(C).
    16. Okumu Argan Wekesa & Mwalili Samuel & Mwita Peter, 2012. "Modelling Credit Risk for Personal Loans Using Product-Limit Estimator," International Journal of Financial Research, International Journal of Financial Research, Sciedu Press, vol. 3(1), pages 22-32, January.
    17. Mo Leo S. F. & Yau Kelvin K. W., 2010. "Survival Mixture Model for Credit Risk Analysis," Asia-Pacific Journal of Risk and Insurance, De Gruyter, vol. 4(2), pages 1-20, July.
    18. Wang, Wuyi & Su, Liangjun, 2021. "Identifying latent group structures in nonlinear panels," Journal of Econometrics, Elsevier, vol. 220(2), pages 272-295.
    19. S. R. Haile & J.-H. Jeong & X. Chen & Y. Cheng, 2016. "A 3-parameter Gompertz distribution for survival data with competing risks, with an application to breast cancer data," Journal of Applied Statistics, Taylor & Francis Journals, vol. 43(12), pages 2239-2253, September.
    20. Donglin Zeng & Fei Gao & D. Y. Lin, 2017. "Maximum likelihood estimation for semiparametric regression models with multivariate interval-censored data," Biometrika, Biometrika Trust, vol. 104(3), pages 505-525.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:lifeda:v:28:y:2022:i:3:d:10.1007_s10985-022-09552-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.