IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i4p2933-2946.html
   My bibliography  Save this article

Hierarchical nuclear norm penalization for multi‐view data integration

Author

Listed:
  • Sangyoon Yi
  • Raymond Ka Wai Wong
  • Irina Gaynanova

Abstract

The prevalence of data collected on the same set of samples from multiple sources (i.e., multi‐view data) has prompted significant development of data integration methods based on low‐rank matrix factorizations. These methods decompose signal matrices from each view into the sum of shared and individual structures, which are further used for dimension reduction, exploratory analyses, and quantifying associations across views. However, existing methods have limitations in modeling partially‐shared structures due to either too restrictive models, or restrictive identifiability conditions. To address these challenges, we propose a new formulation for signal structures that include partially‐shared signals based on grouping the views into so‐called hierarchical levels with identifiable guarantees under suitable conditions. The proposed hierarchy leads us to introduce a new penalty, hierarchical nuclear norm (HNN), for signal estimation. In contrast to existing methods, HNN penalization avoids scores and loadings factorization of the signals and leads to a convex optimization problem, which we solve using a dual forward–backward algorithm. We propose a simple refitting procedure to adjust the penalization bias and develop an adapted version of bi‐cross‐validation for selecting tuning parameters. Extensive simulation studies and analysis of the genotype‐tissue expression data demonstrate the advantages of our method over existing alternatives.

Suggested Citation

  • Sangyoon Yi & Raymond Ka Wai Wong & Irina Gaynanova, 2023. "Hierarchical nuclear norm penalization for multi‐view data integration," Biometrics, The International Biometric Society, vol. 79(4), pages 2933-2946, December.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:4:p:2933-2946
    DOI: 10.1111/biom.13893
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13893
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13893?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Gen Li & Sungkyu Jung, 2017. "Incorporating covariates into integrated factor analysis of multi‐view data," Biometrics, The International Biometric Society, vol. 73(4), pages 1433-1442, December.
    2. Irina Gaynanova & Gen Li, 2019. "Structural learning and integrative decomposition of multi‐view data," Biometrics, The International Biometric Society, vol. 75(4), pages 1121-1132, December.
    3. Jun Young Park & Eric F. Lock, 2020. "Integrative factorization of bidimensionally linked matrices," Biometrics, The International Biometric Society, vol. 76(1), pages 61-74, March.
    4. Kun Chen & Hongbo Dong & Kung-Sik Chan, 2013. "Reduced rank regression via adaptive nuclear norm penalization," Biometrika, Biometrika Trust, vol. 100(4), pages 901-920.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xing Gao & Sungwon Lee & Gen Li & Sungkyu Jung, 2021. "Covariate‐driven factorization by thresholding for multiblock data," Biometrics, The International Biometric Society, vol. 77(3), pages 1011-1023, September.
    2. Palzer, Elise F. & Wendt, Christine H. & Bowler, Russell P. & Hersh, Craig P. & Safo, Sandra E. & Lock, Eric F., 2022. "sJIVE: Supervised joint and individual variation explained," Computational Statistics & Data Analysis, Elsevier, vol. 175(C).
    3. Chen, Canyi & Xu, Wangli & Zhu, Liping, 2022. "Distributed estimation in heterogeneous reduced rank regression: With application to order determination in sufficient dimension reduction," Journal of Multivariate Analysis, Elsevier, vol. 190(C).
    4. S. Yaser Samadi & Wiranthe B. Herath, 2023. "Reduced-rank Envelope Vector Autoregressive Models," Papers 2309.12902, arXiv.org.
    5. Zhao, Weihua & Jiang, Xuejun & Lian, Heng, 2018. "A principal varying-coefficient model for quantile regression: Joint variable selection and dimension reduction," Computational Statistics & Data Analysis, Elsevier, vol. 127(C), pages 269-280.
    6. Guo, Wenxing & Balakrishnan, Narayanaswamy & He, Mu, 2023. "Envelope-based sparse reduced-rank regression for multivariate linear model," Journal of Multivariate Analysis, Elsevier, vol. 195(C).
    7. Luo, Chongliang & Liang, Jian & Li, Gen & Wang, Fei & Zhang, Changshui & Dey, Dipak K. & Chen, Kun, 2018. "Leveraging mixed and incomplete outcomes via reduced-rank modeling," Journal of Multivariate Analysis, Elsevier, vol. 167(C), pages 378-394.
    8. Feng, Sanying & Lian, Heng & Zhu, Fukang, 2016. "Reduced rank regression with possibly non-smooth criterion functions: An empirical likelihood approach," Computational Statistics & Data Analysis, Elsevier, vol. 103(C), pages 139-150.
    9. Ahelegbey, Daniel Felix, 2015. "The Econometrics of Bayesian Graphical Models: A Review With Financial Application," MPRA Paper 92634, University Library of Munich, Germany, revised 25 Apr 2016.
    10. Mishra, Aditya & Dey, Dipak K. & Chen, Yong & Chen, Kun, 2021. "Generalized co-sparse factor regression," Computational Statistics & Data Analysis, Elsevier, vol. 157(C).
    11. Samorodnitsky, Sarah & Wendt, Chris H. & Lock, Eric F., 2024. "Bayesian simultaneous factorization and prediction using multi-omic data," Computational Statistics & Data Analysis, Elsevier, vol. 197(C).
    12. Jiang, Zhenzhen & Guo, Hongping & Wang, Jinjuan, 2023. "Feature screening for multiple responses," Journal of Multivariate Analysis, Elsevier, vol. 198(C).
    13. Yang, Yaohong & Zhao, Weihua & Wang, Lei, 2023. "Online regularized matrix regression with streaming data," Computational Statistics & Data Analysis, Elsevier, vol. 187(C).
    14. Goh, Gyuhyeong & Dey, Dipak K. & Chen, Kun, 2017. "Bayesian sparse reduced rank multivariate regression," Journal of Multivariate Analysis, Elsevier, vol. 157(C), pages 14-28.
    15. Wei Hu & Tianyu Pan & Dehan Kong & Weining Shen, 2021. "Nonparametric matrix response regression with application to brain imaging data analysis," Biometrics, The International Biometric Society, vol. 77(4), pages 1227-1240, December.
    16. Kohei Yoshikawa & Shuichi Kawano, 2023. "Sparse reduced-rank regression for simultaneous rank and variable selection via manifold optimization," Computational Statistics, Springer, vol. 38(1), pages 53-75, March.
    17. Yang, Yuehan & Xia, Siwei & Yang, Hu, 2023. "Multivariate sparse Laplacian shrinkage for joint estimation of two graphical structures," Computational Statistics & Data Analysis, Elsevier, vol. 178(C).
    18. Kun Chen & Kung-Sik Chan & Nils Chr. Stenseth, 2014. "Source-Sink Reconstruction Through Regularized Multicomponent Regression Analysis-With Application to Assessing Whether North Sea Cod Larvae Contributed to Local Fjord Cod in Skagerrak," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(506), pages 560-573, June.
    19. Fan, Jianqing & Gong, Wenyan & Zhu, Ziwei, 2019. "Generalized high-dimensional trace regression via nuclear norm regularization," Journal of Econometrics, Elsevier, vol. 212(1), pages 177-202.
    20. Lian, Heng & Kim, Yongdai, 2016. "Nonconvex penalized reduced rank regression and its oracle properties in high dimensions," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 383-393.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:4:p:2933-2946. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.