IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0038650.html
   My bibliography  Save this article

Intra- and Inter-Individual Variance of Gene Expression in Clinical Studies

Author

Listed:
  • Wei-Chung Cheng
  • Wun-Yi Shu
  • Chia-Yang Li
  • Min-Lung Tsai
  • Cheng-Wei Chang
  • Chaang-Ray Chen
  • Hung-Tsu Cheng
  • Tzu-Hao Wang
  • Ian C Hsu

Abstract

Background: Variance in microarray studies has been widely discussed as a critical topic on the identification of differentially expressed genes; however, few studies have addressed the influence of estimating variance. Methodology/Principal Findings: To break intra- and inter-individual variance in clinical studies down to three levels–technical, anatomic, and individual–we designed experiments and algorithms to investigate three forms of variances. As a case study, a group of “inter-individual variable genes” were identified to exemplify the influence of underestimated variance on the statistical and biological aspects in identification of differentially expressed genes. Our results showed that inadequate estimation of variance inevitably led to the inclusion of non-statistically significant genes into those listed as significant, thereby interfering with the correct prediction of biological functions. Applying a higher cutoff value of fold changes in the selection of significant genes reduces/eliminates the effects of underestimated variance. Conclusions/Significance: Our data demonstrated that correct variance evaluation is critical in selecting significant genes. If the degree of variance is underestimated, “noisy” genes are falsely identified as differentially expressed genes. These genes are the noise associated with biological interpretation, reducing the biological significance of the gene set. Our results also indicate that applying a higher number of fold change as the selection criteria reduces/eliminates the differences between distinct estimations of variance.

Suggested Citation

  • Wei-Chung Cheng & Wun-Yi Shu & Chia-Yang Li & Min-Lung Tsai & Cheng-Wei Chang & Chaang-Ray Chen & Hung-Tsu Cheng & Tzu-Hao Wang & Ian C Hsu, 2012. "Intra- and Inter-Individual Variance of Gene Expression in Clinical Studies," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-8, June.
  • Handle: RePEc:plo:pone00:0038650
    DOI: 10.1371/journal.pone.0038650
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0038650
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0038650&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0038650?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Eric E. Schadt & Stephanie A. Monks & Thomas A. Drake & Aldons J. Lusis & Nam Che & Veronica Colinayo & Thomas G. Ruff & Stephen B. Milligan & John R. Lamb & Guy Cavet & Peter S. Linsley & Mao Mao & R, 2003. "Genetics of gene expression surveyed in maize, mouse and man," Nature, Nature, vol. 422(6929), pages 297-302, March.
    2. Ash A. Alizadeh & Michael B. Eisen & R. Eric Davis & Chi Ma & Izidore S. Lossos & Andreas Rosenwald & Jennifer C. Boldrick & Hajeer Sabet & Truc Tran & Xin Yu & John I. Powell & Liming Yang & Gerald E, 2000. "Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling," Nature, Nature, vol. 403(6769), pages 503-511, February.
    3. M. Kathleen Kerr, 2003. "Design Considerations for Efficient and Effective Microarray Studies," Biometrics, The International Biometric Society, vol. 59(4), pages 822-828, December.
    4. Laura J. van 't Veer & Hongyue Dai & Marc J. van de Vijver & Yudong D. He & Augustinus A. M. Hart & Mao Mao & Hans L. Peterse & Karin van der Kooy & Matthew J. Marton & Anke T. Witteveen & George J. S, 2002. "Gene expression profiling predicts clinical outcome of breast cancer," Nature, Nature, vol. 415(6871), pages 530-536, January.
    5. Marine Jeanmougin & Aurelien de Reynies & Laetitia Marisa & Caroline Paccard & Gregory Nuel & Mickael Guedj, 2010. "Should We Abandon the t-Test in the Analysis of Gene Expression Microarray Data: A Comparison of Variance Modeling Strategies," PLOS ONE, Public Library of Science, vol. 5(9), pages 1-9, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. van Wieringen, Wessel N. & Kun, David & Hampel, Regina & Boulesteix, Anne-Laure, 2009. "Survival prediction using gene expression data: A review and comparison," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1590-1603, March.
    2. Dettling, Marcel & Bühlmann, Peter, 2004. "Finding predictive gene groups from microarray data," Journal of Multivariate Analysis, Elsevier, vol. 90(1), pages 106-131, July.
    3. Sewell, Daniel K., 2018. "Visualizing data through curvilinear representations of matrices," Computational Statistics & Data Analysis, Elsevier, vol. 128(C), pages 255-270.
    4. Tibshirani Robert J., 2009. "Univariate Shrinkage in the Cox Model for High Dimensional Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-20, April.
    5. Jing Zhang & Qihua Wang & Xuan Wang, 2022. "Surrogate-variable-based model-free feature screening for survival data under the general censoring mechanism," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 74(2), pages 379-397, April.
    6. M. Moghadam & K. Aminian & M. Asghari & M. Parnianpour, 2013. "How well do the muscular synergies extracted via non-negative matrix factorisation explain the variation of torque at shoulder joint?," Computer Methods in Biomechanics and Biomedical Engineering, Taylor & Francis Journals, vol. 16(3), pages 291-301.
    7. Gaorong Li & Liugen Xue & Heng Lian, 2012. "SCAD-penalised generalised additive models with non-polynomial dimensionality," Journal of Nonparametric Statistics, Taylor & Francis Journals, vol. 24(3), pages 681-697.
    8. Zemin Zheng & Jie Zhang & Yang Li, 2022. "L 0 -Regularized Learning for High-Dimensional Additive Hazards Regression," INFORMS Journal on Computing, INFORMS, vol. 34(5), pages 2762-2775, September.
    9. Prendergast, Luke A. & Li Wai Suen, Connie, 2011. "A new and practical influence measure for subsets of covariance matrix sample principal components with applications to high dimensional datasets," Computational Statistics & Data Analysis, Elsevier, vol. 55(1), pages 752-764, January.
    10. Sijia Huang & Cameron Yee & Travers Ching & Herbert Yu & Lana X Garmire, 2014. "A Novel Model to Combine Clinical and Pathway-Based Transcriptomic Information for the Prognosis Prediction of Breast Cancer," PLOS Computational Biology, Public Library of Science, vol. 10(9), pages 1-15, September.
    11. Kerr Kathleen F., 2012. "Optimality Criteria for the Design of 2-Color Microarray Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-9, January.
    12. Lian, Heng & Du, Pang & Li, YuanZhang & Liang, Hua, 2014. "Partially linear structure identification in generalized additive models with NP-dimensionality," Computational Statistics & Data Analysis, Elsevier, vol. 80(C), pages 197-208.
    13. Jan, Budczies & Kosztyla, Daniel & von Törne, Christian & Stenzinger, Albrecht & Darb-Esfahani, Silvia & Dietel, Manfred & Denkert, Carsten, 2014. "cancerclass: An R Package for Development and Validation of Diagnostic Tests from High-Dimensional Molecular Data," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 59(i01).
    14. Apostolos Zaravinos & George I Lambrou & Ioannis Boulalas & Dimitris Delakas & Demetrios A Spandidos, 2011. "Identification of Common Differentially Expressed Genes in Urinary Bladder Cancer," PLOS ONE, Public Library of Science, vol. 6(4), pages 1-28, April.
    15. Zhaoliang Wang & Liugen Xue & Gaorong Li & Fei Lu, 2019. "Spline estimator for ultra-high dimensional partially linear varying coefficient models," Annals of the Institute of Statistical Mathematics, Springer;The Institute of Statistical Mathematics, vol. 71(3), pages 657-677, June.
    16. Jun Yao & Qi Zhao & Ying Yuan & Li Zhang & Xiaoming Liu & W K Alfred Yung & John N Weinstein, 2012. "Identification of Common Prognostic Gene Expression Signatures with Biological Meanings from Microarray Gene Expression Datasets," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-11, September.
    17. Roy Navon & Hui Wang & Israel Steinfeld & Anya Tsalenko & Amir Ben-Dor & Zohar Yakhini, 2009. "Novel Rank-Based Statistical Methods Reveal MicroRNAs with Differential Expression in Multiple Cancer Types," PLOS ONE, Public Library of Science, vol. 4(11), pages 1-10, November.
    18. Bo Jiang & Jun S. Liu, 2015. "Bayesian Partition Models for Identifying Expression Quantitative Trait Loci," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1350-1361, December.
    19. Lian, I.B. & Chang, C.J. & Liang, Y.J. & Yang, M.J. & Fann, C.S.J., 2007. "Identifying differentially expressed genes in dye-swapped microarray experiments of small sample size," Computational Statistics & Data Analysis, Elsevier, vol. 51(5), pages 2602-2620, February.
    20. Landgrebe, Jobst & Bretz, Frank & Brunner, Edgar, 2006. "Efficient design and analysis of two colour factorial microarray experiments," Computational Statistics & Data Analysis, Elsevier, vol. 50(2), pages 499-517, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0038650. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.