IDEAS home Printed from https://ideas.repec.org/a/taf/japsta/v52y2025i2p356-380.html
   My bibliography  Save this article

Accurate identification of single-cell types via correntropy-based Sparse PCA combining hypergraph and fusion similarity

Author

Listed:
  • Juan Wang
  • Tai-Ge Wang
  • Shasha Yuan
  • Feng Li

Abstract

The advent of single-cell RNA sequencing (scRNA-seq) technology enables researchers to gain deep insights into cellular heterogeneity. However, the high dimensionality and noise of scRNA-seq data pose significant challenges to clustering. Therefore, we propose a new single-cell type identification method, called CHLSPCA, to address these challenges. In this model, we innovatively combine correntropy with PCA to address the noise and outliers inherent in scRNA-seq data. Meanwhile, we integrate the hypergraph into the model to extract more valuable information from the local structure of the original data. Subsequently, to capture crucial similarity information not considered by the PCA model, we employ the Gaussian kernel function and the Euclidean metric to mine the similarity information between cells, and incorporate this information into the model as the similarity constraint. Furthermore, the principal components (PCs) of PCA are very dense. A new sparse constraint is introduced into the model to gain sparse PCs. Finally, based on the principal direction matrix learned from CHLSPCA, we conduct extensive downstream analyses on real scRNA-seq datasets. The experimental results show that CHLSPCA performs better than many popular clustering methods and is expected to promote the understanding of cellular heterogeneity in scRNA-seq data analysis and support biomedical research.

Suggested Citation

  • Juan Wang & Tai-Ge Wang & Shasha Yuan & Feng Li, 2025. "Accurate identification of single-cell types via correntropy-based Sparse PCA combining hypergraph and fusion similarity," Journal of Applied Statistics, Taylor & Francis Journals, vol. 52(2), pages 356-380, January.
  • Handle: RePEc:taf:japsta:v:52:y:2025:i:2:p:356-380
    DOI: 10.1080/02664763.2024.2369955
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1080/02664763.2024.2369955
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1080/02664763.2024.2369955?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:taf:japsta:v:52:y:2025:i:2:p:356-380. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Longhurst (email available below). General contact details of provider: http://www.tandfonline.com/CJAS20 .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.