IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i3p2705-2718.html
   My bibliography  Save this article

Associating somatic mutation with clinical outcomes through kernel regression and optimal transport

Author

Listed:
  • Paul Little
  • Li Hsu
  • Wei Sun

Abstract

Somatic mutations in cancer patients are inherently sparse and potentially high dimensional. Cancer patients may share the same set of deregulated biological processes perturbed by different sets of somatically mutated genes. Therefore, when assessing the associations between somatic mutations and clinical outcomes, gene‐by‐gene analysis is often under‐powered because it does not capture the complex disease mechanisms shared across cancer patients. Rather than testing genes one by one, an intuitive approach is to aggregate somatic mutation data of multiple genes to assess their joint association with clinical outcomes. The challenge is how to aggregate such information. Building on the optimal transport method, we propose a principled approach to estimate the similarity of somatic mutation profiles of multiple genes between tumor samples, while accounting for gene–gene similarities defined by gene annotations or empirical mutational patterns. Using such similarities, we can assess the associations between somatic mutations and clinical outcomes by kernel regression. We have applied our method to analyze somatic mutation data of 17 cancer types and identified at least five cancer types, where somatic mutations are associated with overall survival, progression‐free interval, or cytolytic activity.

Suggested Citation

  • Paul Little & Li Hsu & Wei Sun, 2023. "Associating somatic mutation with clinical outcomes through kernel regression and optimal transport," Biometrics, The International Biometric Society, vol. 79(3), pages 2705-2718, September.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:3:p:2705-2718
    DOI: 10.1111/biom.13769
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13769
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13769?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dawei Liu & Xihong Lin & Debashis Ghosh, 2007. "Semiparametric Regression of Multidimensional Genetic Pathway Data: Least-Squares Kernel Machines and Linear Mixed Models," Biometrics, The International Biometric Society, vol. 63(4), pages 1079-1088, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zaili Fang & Inyoung Kim & Jeesun Jung, 2018. "Semiparametric Kernel-Based Regression for Evaluating Interaction Between Pathway Effect and Covariate," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 23(1), pages 129-152, March.
    2. Arnab Maity & Xihong Lin, 2011. "Powerful Tests for Detecting a Gene Effect in the Presence of Possible Gene–Gene Interactions Using Garrote Kernel Machines," Biometrics, The International Biometric Society, vol. 67(4), pages 1271-1284, December.
    3. Long Qu & Tobias Guennel & Scott L. Marshall, 2013. "Linear Score Tests for Variance Components in Linear Mixed Models and Applications to Genetic Association Studies," Biometrics, The International Biometric Society, vol. 69(4), pages 883-892, December.
    4. Teran Hidalgo, Sebastian J. & Wu, Michael C. & Engel, Stephanie M. & Kosorok, Michael R., 2018. "Goodness-of-fit test for nonparametric regression models: Smoothing spline ANOVA models as example," Computational Statistics & Data Analysis, Elsevier, vol. 122(C), pages 135-155.
    5. Lin Zhang & Inyoung Kim, 2021. "Finite mixtures of semiparametric Bayesian survival kernel machine regressions: Application to breast cancer gene pathway subgroup analysis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(2), pages 251-269, March.
    6. Chakraborty, Sounak, 2009. "Bayesian binary kernel probit model for microarray based cancer classification and gene selection," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4198-4209, October.
    7. Yunxuan Jiang & Karen N. Conneely & Michael P. Epstein, 2018. "Robust Rare-Variant Association Tests for Quantitative Traits in General Pedigrees," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 10(3), pages 491-505, December.
    8. Glen McGee & Ander Wilson & Thomas F. Webster & Brent A. Coull, 2023. "Bayesian multiple index models for environmental mixtures," Biometrics, The International Biometric Society, vol. 79(1), pages 462-474, March.
    9. Pluta, Dustin & Yu, Zhaoxia & Shen, Tong & Chen, Chuansheng & Xue, Gui & Ombao, Hernando, 2018. "Statistical methods and challenges in connectome genetics," Statistics & Probability Letters, Elsevier, vol. 136(C), pages 83-86.
    10. Clemontina A. Davenport & Arnab Maity & Patrick F. Sullivan & Jung-Ying Tzeng, 2018. "A Powerful Test for SNP Effects on Multivariate Binary Outcomes Using Kernel Machine Regression," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 10(1), pages 117-138, April.
    11. Zhang Hongmei & Gan Jianjun, 2012. "A Reproducing Kernel-Based Spatial Model in Poisson Regressions," The International Journal of Biostatistics, De Gruyter, vol. 8(1), pages 1-26, October.
    12. Wen‐Yu Hua & Debashis Ghosh, 2015. "Equivalence of kernel machine regression and kernel distance covariance for multidimensional phenotype association studies," Biometrics, The International Biometric Society, vol. 71(3), pages 812-820, September.
    13. Ghosh, Debashis, 2014. "An asymptotically minimax kernel machine," Statistics & Probability Letters, Elsevier, vol. 95(C), pages 33-38.
    14. Xia Zheng & Yaohua Rong & Ling Liu & Weihu Cheng, 2021. "A More Accurate Estimation of Semiparametric Logistic Regression," Mathematics, MDPI, vol. 9(19), pages 1-12, September.
    15. Jennifer A. Sinnott & Tianxi Cai, 2013. "Omnibus Risk Assessment via Accelerated Failure Time Kernel Machine Modeling," Biometrics, The International Biometric Society, vol. 69(4), pages 861-873, December.
    16. Wei Dai & Ming Yang & Chaolong Wang & Tianxi Cai, 2017. "Sequence robust association test for familial data," Biometrics, The International Biometric Society, vol. 73(3), pages 876-884, September.
    17. Xiang Zhan & Anna Plantinga & Ni Zhao & Michael C. Wu, 2017. "A fast small‐sample kernel independence test for microbiome community‐level association analysis," Biometrics, The International Biometric Society, vol. 73(4), pages 1453-1463, December.
    18. Lulu Cheng & Inyoung Kim & Herbert Pang, 2016. "Bayesian Semiparametric Model for Pathway-Based Analysis with Zero-Inflated Clinical Outcomes," Journal of Agricultural, Biological and Environmental Statistics, Springer;The International Biometric Society;American Statistical Association, vol. 21(4), pages 641-662, December.
    19. Luts, Jan & Molenberghs, Geert & Verbeke, Geert & Van Huffel, Sabine & Suykens, Johan A.K., 2012. "A mixed effects least squares support vector machine model for classification of longitudinal data," Computational Statistics & Data Analysis, Elsevier, vol. 56(3), pages 611-628.
    20. Weiming Zhang & Michael P. Epstein & Tasha E. Fingerlin & Debashis Ghosh, 2017. "Links Between the Sequence Kernel Association and the Kernel-Based Adaptive Cluster Tests," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 246-258, June.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:3:p:2705-2718. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.