IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0104314.html
   My bibliography  Save this article

Learning a Weighted Meta-Sample Based Parameter Free Sparse Representation Classification for Microarray Data

Author

Listed:
  • Bo Liao
  • Yan Jiang
  • Guanqun Yuan
  • Wen Zhu
  • Lijun Cai
  • Zhi Cao

Abstract

Sparse representation classification (SRC) is one of the most promising classification methods for supervised learning. This method can effectively exploit discriminating information by introducing a regularization terms to the data. With the desirable property of sparisty, SRC is robust to both noise and outliers. In this study, we propose a weighted meta-sample based non-parametric sparse representation classification method for the accurate identification of tumor subtype. The proposed method includes three steps. First, we extract the weighted meta-samples for each sub class from raw data, and the rationality of the weighting strategy is proven mathematically. Second, sparse representation coefficients can be obtained by regularization of underdetermined linear equations. Thus, data dependent sparsity can be adaptively tuned. A simple characteristic function is eventually utilized to achieve classification. Asymptotic time complexity analysis is applied to our method. Compared with some state-of-the-art classifiers, the proposed method has lower time complexity and more flexibility. Experiments on eight samples of publicly available gene expression profile data show the effectiveness of the proposed method.

Suggested Citation

  • Bo Liao & Yan Jiang & Guanqun Yuan & Wen Zhu & Lijun Cai & Zhi Cao, 2014. "Learning a Weighted Meta-Sample Based Parameter Free Sparse Representation Classification for Microarray Data," PLOS ONE, Public Library of Science, vol. 9(8), pages 1-12, August.
  • Handle: RePEc:plo:pone00:0104314
    DOI: 10.1371/journal.pone.0104314
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0104314
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0104314&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0104314?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ash A. Alizadeh & Michael B. Eisen & R. Eric Davis & Chi Ma & Izidore S. Lossos & Andreas Rosenwald & Jennifer C. Boldrick & Hajeer Sabet & Truc Tran & Xin Yu & John I. Powell & Liming Yang & Gerald E, 2000. "Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling," Nature, Nature, vol. 403(6769), pages 503-511, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sewell, Daniel K., 2018. "Visualizing data through curvilinear representations of matrices," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 255-270.
    2. M. Moghadam & K. Aminian & M. Asghari & M. Parnianpour, 2013. "How well do the muscular synergies extracted via non-negative matrix factorisation explain the variation of torque at shoulder joint?," Computer Methods in Biomechanics and Biomedical Engineering, Taylor & Francis Journals, vol. 16(3), pages 291-301.
    3. Prendergast, Luke A. & Li Wai Suen, Connie, 2011. "A new and practical influence measure for subsets of covariance matrix sample principal components with applications to high dimensional datasets," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 752-764, January.
    4. Apostolos Zaravinos & George I Lambrou & Ioannis Boulalas & Dimitris Delakas & Demetrios A Spandidos, 2011. "Identification of Common Differentially Expressed Genes in Urinary Bladder Cancer," PLOS ONE, Public Library of Science, vol. 6(4), pages 1-28, April.
    5. Roy Navon & Hui Wang & Israel Steinfeld & Anya Tsalenko & Amir Ben-Dor & Zohar Yakhini, 2009. "Novel Rank-Based Statistical Methods Reveal MicroRNAs with Differential Expression in Multiple Cancer Types," PLOS ONE, Public Library of Science, vol. 4(11), pages 1-10, November.
    6. Frantisek Honti & Stephen Meader & Caleb Webber, 2014. "Unbiased Functional Clustering of Gene Variants with a Phenotypic-Linkage Network," PLOS Computational Biology, Public Library of Science, vol. 10(8), pages 1-7, August.
    7. Sophia S Wang & Mark P Purdue & James R Cerhan & Tongzhang Zheng & Idan Menashe & Bruce K Armstrong & Qing Lan & Patricia Hartge & Anne Kricker & Yawei Zhang & Lindsay M Morton & Claire M Vajdic & The, 2009. "Common Gene Variants in the Tumor Necrosis Factor (TNF) and TNF Receptor Superfamilies and NF-kB Transcription Factors and Non-Hodgkin Lymphoma Risk," PLOS ONE, Public Library of Science, vol. 4(4), pages 1-9, April.
    8. Salas-Gonzalez, Diego & Kuruoglu, Ercan E. & Ruiz, Diego P., 2009. "A heavy-tailed empirical Bayes method for replicated microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1535-1546, March.
    9. Ai, Chunrong & You, Jinhong & Zhou, Yong, 2011. "Statistical inference using a weighted difference-based series approach for partially linear regression models," Journal of Multivariate Analysis, Elsevier, vol. 102(3), pages 601-618, March.
    10. Nilsen Gro & Borgan Ørnulf & LiestØl Knut & Lingjærde Ole Christian, 2013. "Identifying clusters in genomics data by recursive partitioning," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 12(5), pages 637-652, October.
    11. You, Jinhong & Zhou, Haibo, 2008. "A two-stage approach to semilinear in-slide models," Journal of Multivariate Analysis, Elsevier, vol. 99(8), pages 1610-1634, September.
    12. Juan C. Laria & M. Carmen Aguilera-Morillo & Rosa E. Lillo, 2023. "Group linear algorithm with sparse principal decomposition: a variable selection and clustering method for generalized linear models," Statistical Papers, Springer, vol. 64(1), pages 227-253, February.
    13. Wei-Chung Cheng & Wun-Yi Shu & Chia-Yang Li & Min-Lung Tsai & Cheng-Wei Chang & Chaang-Ray Chen & Hung-Tsu Cheng & Tzu-Hao Wang & Ian C Hsu, 2012. "Intra- and Inter-Individual Variance of Gene Expression in Clinical Studies," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-8, June.
    14. Stella Amanda & Tze King Tan & Jolynn Zu Lin Ong & Madelaine Skolastika Theardy & Regina Wan Ju Wong & Xiao Zi Huang & Muhammad Zulfaqar Ali & Yan Li & Zhiyuan Gong & Hiroshi Inagaki & Ee Yong Foo & B, 2022. "IRF4 drives clonal evolution and lineage choice in a zebrafish model of T-cell lymphoma," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    15. Maureen Stone & Xiaofeng Liu & Hegang Chen & Jerry L. Prince, 2010. "A preliminary application of principal components and cluster analysis to internal tongue deformation patterns," Computer Methods in Biomechanics and Biomedical Engineering, Taylor & Francis Journals, vol. 13(4), pages 493-503.
    16. van Wieringen, Wessel N. & Kun, David & Hampel, Regina & Boulesteix, Anne-Laure, 2009. "Survival prediction using gene expression data: A review and comparison," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1590-1603, March.
    17. Min Liu & Giorgio Bertolazzi & Shruti Sridhar & Rui Xue Lee & Patrick Jaynes & Kevin Mulder & Nicholas Syn & Michal Marek Hoppe & Shuangyi Fan & Yanfen Peng & Jocelyn Thng & Reiya Chua & Jayalakshmi &, 2024. "Spatially-resolved transcriptomics reveal macrophage heterogeneity and prognostic significance in diffuse large B-cell lymphoma," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    18. Dettling, Marcel & Bühlmann, Peter, 2004. "Finding predictive gene groups from microarray data," Journal of Multivariate Analysis, Elsevier, vol. 90(1), pages 106-131, July.
    19. Laura Anderlucci & Francesca Fortunato & Angela Montanari, 2022. "High-Dimensional Clustering via Random Projections," Journal of Classification, Springer;The Classification Society, vol. 39(1), pages 191-216, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0104314. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.