IDEAS home Printed from https://ideas.repec.org/a/jss/jstsof/v023i12.html
   My bibliography  Save this article

CCA: An R Package to Extend Canonical Correlation Analysis

Author

Listed:
  • González, Ignacio
  • Déjean, Sébastien
  • Martin, Pascal G. P.
  • Baccini, Alain

Abstract

Canonical correlations analysis (CCA) is an exploratory statistical method to highlight correlations between two data sets acquired on the same experimental units. The cancor() function in R (R Development Core Team 2007) performs the core of computations but further work was required to provide the user with additional tools to facilitate the interpretation of the results. We implemented an R package, CCA, freely available from the Comprehensive R Archive Network (CRAN, http://CRAN.R-project.org/), to develop numerical and graphical outputs and to enable the user to handle missing values. The CCA package also includes a regularized version of CCA to deal with data sets with more variables than units. Illustrations are given through the analysis of a data set coming from a nutrigenomic study in the mouse.

Suggested Citation

  • González, Ignacio & Déjean, Sébastien & Martin, Pascal G. P. & Baccini, Alain, 2008. "CCA: An R Package to Extend Canonical Correlation Analysis," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 23(i12).
  • Handle: RePEc:jss:jstsof:v:023:i12
    DOI: http://hdl.handle.net/10.18637/jss.v023.i12
    as

    Download full text from publisher

    File URL: https://www.jstatsoft.org/index.php/jss/article/view/v023i12/v23i12.pdf
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v023i12/CCA_1.1.tar.gz
    Download Restriction: no

    File URL: https://www.jstatsoft.org/index.php/jss/article/downloadSuppFile/v023i12/v23i12.R
    Download Restriction: no

    File URL: https://libkey.io/http://hdl.handle.net/10.18637/jss.v023.i12?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Peter Bickel & Bo Li & Alexandre Tsybakov & Sara Geer & Bin Yu & Teófilo Valdés & Carlos Rivero & Jianqing Fan & Aad Vaart, 2006. "Regularization in statistics," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 15(2), pages 271-344, September.
    2. Mevik, Björn-Helge & Wehrens, Ron, 2007. "The pls Package: Principal Component and Partial Least Squares Regression in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 18(i02).
    3. Vinod, H. D., 1976. "Canonical ridge and econometrics of joint production," Journal of Econometrics, Elsevier, vol. 4(2), pages 147-166, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Florian Rohart & Benoît Gautier & Amrit Singh & Kim-Anh Lê Cao, 2017. "mixOmics: An R package for ‘omics feature selection and multiple data integration," PLOS Computational Biology, Public Library of Science, vol. 13(11), pages 1-19, November.
    2. Langworthy, Benjamin W. & Stephens, Rebecca L. & Gilmore, John H. & Fine, Jason P., 2021. "Canonical correlation analysis for elliptical copulas," Journal of Multivariate Analysis, Elsevier, vol. 183(C).
    3. Tenenhaus, Arthur & Philippe, Cathy & Frouin, Vincent, 2015. "Kernel Generalized Canonical Correlation Analysis," Computational Statistics & Data Analysis, Elsevier, vol. 90(C), pages 114-131.
    4. Wang, Wenjia & Zhou, Yi-Hui, 2021. "Eigenvector-based sparse canonical correlation analysis: Fast computation for estimation of multiple canonical vectors," Journal of Multivariate Analysis, Elsevier, vol. 185(C).
    5. Corona Francisco & Horrillo Juan de Dios Tena & Wiper Michael Peter, 2017. "On the importance of the probabilistic model in identifying the most decisive games in a tournament," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 13(1), pages 11-23, March.
    6. Jimmy Martin-Delgado & Aurora Mula & Rafael Manzanera & Jose Joaquin Mira, 2022. "Measuring the Impact of Future Outbreaks? A Secondary Analysis of Routinely Available Data in Spain," IJERPH, MDPI, vol. 19(21), pages 1-14, October.
    7. Dmitry Kobak & Yves Bernaerts & Marissa A. Weis & Federico Scala & Andreas S. Tolias & Philipp Berens, 2021. "Sparse reduced‐rank regression for exploratory visualisation of paired multivariate data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(4), pages 980-1000, August.
    8. Hongxing Li & Alasdair Cohen & Zheng Li & Mengjie Zhang, 2018. "The Impacts of Socioeconomic Development on Rural Drinking Water Safety in China: A Provincial-Level Comparative Analysis," Sustainability, MDPI, vol. 11(1), pages 1-12, December.
    9. Cruz-Cano, Raul & Lee, Mei-Ling Ting, 2014. "Fast regularized canonical correlation analysis," Computational Statistics & Data Analysis, Elsevier, vol. 70(C), pages 88-100.
    10. Alicja Grzeskowiak, 2016. "Satisfaction with chosen aspects of life in Poland - evaluation by canonical correlation methods," International Journal of Social Sciences, International Institute of Social and Economic Sciences, vol. 5(1), pages 60-71, February.
    11. Lykou, Anastasia & Whittaker, Joe, 2010. "Sparse CCA using a Lasso with positivity constraints," Computational Statistics & Data Analysis, Elsevier, vol. 54(12), pages 3144-3157, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:jss:jstsof:23:i12 is not listed on IDEAS
    2. Fan, Jianqing & Jiang, Bai & Sun, Qiang, 2022. "Bayesian factor-adjusted sparse regression," Journal of Econometrics, Elsevier, vol. 230(1), pages 3-19.
    3. Takane, Yoshio & Yanai, Haruo & Hwang, Heungsun, 2006. "An improved method for generalized constrained canonical correlation analysis," Computational Statistics & Data Analysis, Elsevier, vol. 50(1), pages 221-241, January.
    4. Andrés García-Medina & Graciela González Farías, 2020. "Transfer entropy as a variable selection methodology of cryptocurrencies in the framework of a high dimensional predictive model," PLOS ONE, Public Library of Science, vol. 15(1), pages 1-31, January.
    5. Politis, Dimitris N, 2010. "Model-free Model-fitting and Predictive Distributions," University of California at San Diego, Economics Working Paper Series qt67j6s174, Department of Economics, UC San Diego.
    6. Elton Mammadov & Michael Denk & Frank Riedel & Cezary Kaźmierowski & Karolina Lewinska & Remigiusz Łukowiak & Witold Grzebisz & Amrakh I. Mamedov & Cornelia Glaesser, 2022. "Determination of Mehlich 3 Extractable Elements with Visible and Near Infrared Spectroscopy in a Mountainous Agricultural Land, the Caucasus Mountains," Land, MDPI, vol. 11(3), pages 1-24, March.
    7. Giacomo Crucil & Fabio Castaldi & Emilien Aldana-Jague & Bas van Wesemael & Andy Macdonald & Kristof Van Oost, 2019. "Assessing the Performance of UAS-Compatible Multispectral and Hyperspectral Sensors for Soil Organic Carbon Prediction," Sustainability, MDPI, vol. 11(7), pages 1-18, March.
    8. Michael Jansson & Demian Pouzo, 2017. "Towards a General Large Sample Theory for Regularized Estimators," Papers 1712.07248, arXiv.org, revised Jul 2020.
    9. Demian Pouzo, 2015. "On the Non-Asymptotic Properties of Regularized M-estimators," Papers 1512.06290, arXiv.org, revised Oct 2016.
    10. Firoozye, Nikan & Tan, Vincent & Zohren, Stefan, 2023. "Canonical portfolios: Optimal asset and signal combination," Journal of Banking & Finance, Elsevier, vol. 154(C).
    11. Hiroyuki Kawakatsu, 2022. "Modeling Realized Variance with Realized Quarticity," Stats, MDPI, vol. 5(3), pages 1-25, September.
    12. Bennett, Donyetta & Mekelburg, Erik & Strauss, Jack & Williams, T.H., 2024. "Unlocking the black box of sentiment and cryptocurrency: What, which, why, when and how?," Global Finance Journal, Elsevier, vol. 60(C).
    13. Alessandro Barbarino & Efstathia Bura, 2015. "Forecasting with Sufficient Dimension Reductions," Finance and Economics Discussion Series 2015-74, Board of Governors of the Federal Reserve System (U.S.).
    14. Nandana Sengupta & Fallaw Sowell, 2020. "On the Asymptotic Distribution of Ridge Regression Estimators Using Training and Test Samples," Econometrics, MDPI, vol. 8(4), pages 1-25, October.
    15. Ruggiero, John, 1998. "A new approach for technical efficiency estimation in multiple output production," European Journal of Operational Research, Elsevier, vol. 111(2), pages 369-380, December.
    16. Tenenhaus, Arthur & Philippe, Cathy & Frouin, Vincent, 2015. "Kernel Generalized Canonical Correlation Analysis," Computational Statistics & Data Analysis, Elsevier, vol. 90(C), pages 114-131.
    17. Lukáš Malec & Antonín Pavlícek & Jaroslav Poživil, 2014. "Studying Covariance and Variance Components in the Czech Regions Arrival Tourism Data," Acta Universitatis Danubius. OEconomica, Danubius University of Galati, issue 2(2), pages 109-128, April.
    18. Samuel Trachsel & Thanda Dhliwayo & Lorena Gonzalez Perez & Jose Alberto Mendoza Lugo & Mathias Trachsel, 2019. "Estimation of physiological genomic estimated breeding values (PGEBV) combining full hyperspectral and marker data across environments for grain yield under combined heat and drought stress in tropica," PLOS ONE, Public Library of Science, vol. 14(3), pages 1-15, March.
    19. Tomasz Rymarczyk & Krzysztof Król & Edward Kozłowski & Tomasz Wołowiec & Marta Cholewa-Wiktor & Piotr Bednarczuk, 2021. "Application of Electrical Tomography Imaging Using Machine Learning Methods for the Monitoring of Flood Embankments Leaks," Energies, MDPI, vol. 14(23), pages 1-35, December.
    20. Yoshio Takane & Heungsun Hwang & Hervé Abdi, 2008. "Regularized Multiple-Set Canonical Correlation Analysis," Psychometrika, Springer;The Psychometric Society, vol. 73(4), pages 753-775, December.
    21. Natallia Pashkevich & Darek Haftor & Mikael Karlsson & Soumitra Chowdhury, 2019. "Sustainability through the Digitalization of Industrial Machines: Complementary Factors of Fuel Consumption and Productivity for Forklifts with Sensors," Sustainability, MDPI, vol. 11(23), pages 1-21, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:jss:jstsof:v:023:i12. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Christopher F. Baum (email available below). General contact details of provider: http://www.jstatsoft.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.