IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v73y2017i4p1254-1265.html
   My bibliography  Save this article

Analysis of multiple diverse phenotypes via semiparametric canonical correlation analysis

Author

Listed:
  • Denis Agniel
  • Tianxi Cai

Abstract

Studying multiple outcomes simultaneously allows researchers to begin to identify underlying factors that affect all of a set of diseases (i.e., shared etiology) and what may give rise to differences in disorders between patients (i.e., disease subtypes). In this work, our goal is to build risk scores that are predictive of multiple phenotypes simultaneously and identify subpopulations at high risk of multiple phenotypes. Such analyses could yield insight into etiology or point to treatment and prevention strategies. The standard canonical correlation analysis (CCA) can be used to relate multiple continuous outcomes to multiple predictors. However, in order to capture the full complexity of a disorder, phenotypes may include a diverse range of data types, including binary, continuous, ordinal, and censored variables. When phenotypes are diverse in this way, standard CCA is not possible and no methods currently exist to model them jointly. In the presence of such complications, we propose a semi‐parametric CCA method to develop risk scores that are predictive of multiple phenotypes. To guard against potential model mis‐specification, we also propose a nonparametric calibration method to identify subgroups that are at high risk of multiple disorders. A resampling procedure is also developed to account for the variability in these estimates. Our method opens the door to synthesizing a wide array of data sources for the purposes of joint prediction.

Suggested Citation

  • Denis Agniel & Tianxi Cai, 2017. "Analysis of multiple diverse phenotypes via semiparametric canonical correlation analysis," Biometrics, The International Biometric Society, vol. 73(4), pages 1254-1265, December.
  • Handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1254-1265
    DOI: 10.1111/biom.12690
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.12690
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.12690?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ogasawara, Haruhiko, 2007. "Asymptotic expansions of the distributions of estimators in canonical correlation analysis under nonnormality," Journal of Multivariate Analysis, Elsevier, vol. 98(9), pages 1726-1750, October.
    2. T. Cai & L. Tian & Hajime Uno & Scott D. Solomon & L. J. Wei, 2010. "Calibrating parametric subject-specific risk estimation," Biometrika, Biometrika Trust, vol. 97(2), pages 389-404.
    3. Lu Tian & David Zucker & L.J. Wei, 2005. "On the Cox Model With Time-Varying Regression Coefficients," Journal of the American Statistical Association, American Statistical Association, vol. 100, pages 172-183, March.
    4. Ling Zhou & Huazhen Lin & Xinyuan Song & Yi Li, 2014. "Selection of Latent Variables for Multiple Mixed-outcome Models," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(4), pages 1064-1082, December.
    5. Denis Agniel & Katherine P. Liao & Tianxi Cai, 2016. "Estimation and testing for multiple regulation of multivariate mixed outcomes," Biometrics, The International Biometric Society, vol. 72(4), pages 1194-1205, December.
    6. Joe, Harry, 2005. "Asymptotic efficiency of the two-stage estimation method for copula-based models," Journal of Multivariate Analysis, Elsevier, vol. 94(2), pages 401-419, June.
    7. Yingcun Xia, 2008. "A semiparametric approach to canonical analysis," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 70(3), pages 519-543, July.
    8. Tianxi Cai & Lu Tian & L. J. Wei, 2005. "Semiparametric Box–Cox power transformation models for censored survival observations," Biometrika, Biometrika Trust, vol. 92(3), pages 619-632, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Layla Parast & Beth Ann Griffin, 2017. "Landmark estimation of survival and treatment effects in observational studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(2), pages 161-182, April.
    2. Yi Li & Lu Tian & Lee-Jen Wei, 2011. "Estimating Subject-Specific Dependent Competing Risk Profile with Censored Event Time Observations," Biometrics, The International Biometric Society, vol. 67(2), pages 427-435, June.
    3. Yingye Zheng & Tianxi Cai & Janet L. Stanford & Ziding Feng, 2010. "Semiparametric Models of Time-Dependent Predictive Values of Prognostic Biomarkers," Biometrics, The International Biometric Society, vol. 66(1), pages 50-60, March.
    4. Hongyuan Cao & Mathew M. Churpek & Donglin Zeng & Jason P. Fine, 2015. "Analysis of the Proportional Hazards Model With Sparse Longitudinal Covariates," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(511), pages 1187-1196, September.
    5. Jun Yan & Jian Huang, 2012. "Model Selection for Cox Models with Time-Varying Coefficients," Biometrics, The International Biometric Society, vol. 68(2), pages 419-428, June.
    6. Bouteska, Ahmed & Sharif, Taimur & Abedin, Mohammad Zoynul, 2023. "COVID-19 and stock returns: Evidence from the Markov switching dependence approach," Research in International Business and Finance, Elsevier, vol. 64(C).
    7. Torben Martinussen & Christian Bressen Pipper, 2014. "Estimation of Causal Odds of Concordance using the Aalen Additive Model," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 41(1), pages 141-151, March.
    8. Hamza, Taher & Ben Haj Hamida, Hayet & Mili, Mehdi & Sami, Mina, 2024. "High inflation during Russia–Ukraine war and financial market interaction: Evidence from C-Vine Copula and SETAR models," Research in International Business and Finance, Elsevier, vol. 70(PB).
    9. Michael S. Smith & Shaun P. Vahey, 2016. "Asymmetric Forecast Densities for U.S. Macroeconomic Variables from a Gaussian Copula Model of Cross-Sectional and Serial Dependence," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 34(3), pages 416-434, July.
    10. Atkinson, Anthony C. & Riani, Marco & Corbellini, Aldo, 2021. "The box-cox transformation: review and extensions," LSE Research Online Documents on Economics 103537, London School of Economics and Political Science, LSE Library.
    11. Wang, Mengjiao & Liu, Jianxu & Yang, Bing, 2024. "Does the strength of the US dollar affect the interdependence among currency exchange rates of RCEP and CPTPP countries?," Finance Research Letters, Elsevier, vol. 62(PA).
    12. Naseri, Masoud & Baraldi, Piero & Compare, Michele & Zio, Enrico, 2016. "Availability assessment of oil and gas processing plants operating under dynamic Arctic weather conditions," Reliability Engineering and System Safety, Elsevier, vol. 152(C), pages 66-82.
    13. Li, Feng & Kang, Yanfei, 2018. "Improving forecasting performance using covariate-dependent copula models," International Journal of Forecasting, Elsevier, vol. 34(3), pages 456-476.
    14. Kojadinovic, Ivan & Yan, Jun, 2010. "Comparison of three semiparametric methods for estimating dependence parameters in copula models," Insurance: Mathematics and Economics, Elsevier, vol. 47(1), pages 52-63, August.
    15. Petropoulos, Fotios & Apiletti, Daniele & Assimakopoulos, Vassilios & Babai, Mohamed Zied & Barrow, Devon K. & Ben Taieb, Souhaib & Bergmeir, Christoph & Bessa, Ricardo J. & Bijak, Jakub & Boylan, Joh, 2022. "Forecasting: theory and practice," International Journal of Forecasting, Elsevier, vol. 38(3), pages 705-871.
      • Fotios Petropoulos & Daniele Apiletti & Vassilios Assimakopoulos & Mohamed Zied Babai & Devon K. Barrow & Souhaib Ben Taieb & Christoph Bergmeir & Ricardo J. Bessa & Jakub Bijak & John E. Boylan & Jet, 2020. "Forecasting: theory and practice," Papers 2012.03854, arXiv.org, revised Jan 2022.
    16. Ojea-Ferreiro, Javier & Reboredo, Juan C., 2022. "Exchange rates and the global transmission of equity market shocks," Economic Modelling, Elsevier, vol. 114(C).
    17. Corduas, Marcella, 2015. "A statistical model for consumer preferences: the case of Italian extra virgin olive oil," 143rd Joint EAAE/AAEA Seminar, March 25-27, 2015, Naples, Italy 202701, European Association of Agricultural Economists.
    18. Roland A. Matsouaka & Junlong Li & Tianxi Cai, 2014. "Evaluating marker-guided treatment selection strategies," Biometrics, The International Biometric Society, vol. 70(3), pages 489-499, September.
    19. Guillermo Martínez-Flórez & Artur J. Lemonte & Germán Moreno-Arenas & Roger Tovar-Falón, 2022. "The Bivariate Unit-Sinh-Normal Distribution and Its Related Regression Model," Mathematics, MDPI, vol. 10(17), pages 1-26, August.
    20. Rémillard, Bruno & Scaillet, Olivier, 2009. "Testing for equality between two copulas," Journal of Multivariate Analysis, Elsevier, vol. 100(3), pages 377-386, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:73:y:2017:i:4:p:1254-1265. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.