IDEAS home Printed from https://ideas.repec.org/a/spr/stabio/v9y2017i1d10.1007_s12561-016-9172-x.html
   My bibliography  Save this article

Establishing Informative Prior for Gene Expression Variance from Public Databases

Author

Listed:
  • Nan Li

    (Brown University)

  • Matthew N. McCall

    (University of Rochester)

  • Zhijin Wu

    (Brown University)

Abstract

Identifying differential expressed genes across various conditions or genotypes is the most typical approach to studying the regulation of gene expression. An estimate of gene-specific variance is often needed for the assessment of statistical significance in most differential expression (DE) detection methods, including linear models (e.g., for transformed and normalized microarray data) and generalized linear models (e.g., for count data in RNAseq). Due to a common limit in sample size, the variance estimate is often unstable in small experiments. Shrinkage estimates using empirical Bayes methods have proven useful in improving the variance estimate, hence improving the detection of DE. The most widely used empirical Bayes methods borrow information across genes within the same experiments. In these methods, genes are considered exchangeable or exchangeable conditioning on expression level. We propose, with the increasing accumulation of expression data, borrowing information from historical data on the same gene can provide better estimate of gene-specific variance, thus further improve DE detection. Specifically, we show that the variation of gene expression is truly gene-specific and reproducible between different experiments. We present a new method to establish informative gene-specific prior on the variance of expression using existing public data, and illustrate how to shrink the variance estimate and detect DE. We demonstrate improvement in DE detection under our strategy compared to leading DE detection methods.

Suggested Citation

  • Nan Li & Matthew N. McCall & Zhijin Wu, 2017. "Establishing Informative Prior for Gene Expression Variance from Public Databases," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 9(1), pages 160-177, June.
  • Handle: RePEc:spr:stabio:v:9:y:2017:i:1:d:10.1007_s12561-016-9172-x
    DOI: 10.1007/s12561-016-9172-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12561-016-9172-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12561-016-9172-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Smyth Gordon K, 2004. "Linear Models and Empirical Bayes Methods for Assessing Differential Expression in Microarray Experiments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 3(1), pages 1-28, February.
    2. Zhijin Wu & Rafael Irizarry & Robert Gentleman & Francisco Martinez Murillo & Forrest Spencer, 2004. "A Model Based Background Adjustment for Oligonucleotide Expression Arrays," Johns Hopkins University Dept. of Biostatistics Working Paper Series 1001, Berkeley Electronic Press.
    3. Zhijin Wu & Rafael A. Irizarry & Robert Gentleman & Francisco Martinez-Murillo & Forrest Spencer, 2004. "A Model-Based Background Adjustment for Oligonucleotide Expression Arrays," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 909-917, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Marot Guillemette & Mayer Claus-Dieter, 2009. "Sequential Analysis for Microarray Data Based on Sensitivity and Meta-Analysis," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-35, January.
    2. Rinku Sharma & Garima Singh & Sudeepto Bhattacharya & Ashutosh Singh, 2018. "Comparative transcriptome meta-analysis of Arabidopsis thaliana under drought and cold stress," PLOS ONE, Public Library of Science, vol. 13(9), pages 1-18, September.
    3. Jin-Xing Liu & Yong Xu & Chun-Hou Zheng & Yi Wang & Jing-Yu Yang, 2012. "Characteristic Gene Selection via Weighting Principal Components by Singular Values," PLOS ONE, Public Library of Science, vol. 7(7), pages 1-10, July.
    4. Sigrun Helga Lund & Daniel Fannar Gudbjartsson & Thorunn Rafnar & Asgeir Sigurdsson & Sigurjon Axel Gudjonsson & Julius Gudmundsson & Kari Stefansson & Gunnar Stefansson, 2014. "A Method for Detecting Long Non-Coding RNAs with Tiled RNA Expression Microarrays," PLOS ONE, Public Library of Science, vol. 9(6), pages 1-9, June.
    5. Krishanpal Anamika & Àkos Gyenis & Laetitia Poidevin & Olivier Poch & Làszlò Tora, 2012. "RNA Polymerase II Pausing Downstream of Core Histone Genes Is Different from Genes Producing Polyadenylated Transcripts," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-14, June.
    6. Lei Zhang & Linlin Wang & Pu Tian & Suyan Tian, 2016. "Identification of Genes Discriminating Multiple Sclerosis Patients from Controls by Adapting a Pathway Analysis Method," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-13, November.
    7. Upton Graham J. G. & Harrison Andrew P, 2010. "The Detection of Blur in Affymetrix GeneChips," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-19, October.
    8. Ryan Abo & Gregory D Jenkins & Liewei Wang & Brooke L Fridley, 2012. "Identifying the Genetic Variation of Gene Expression Using Gene Sets: Application of Novel Gene Set eQTL Approach to PharmGKB and KEGG," PLOS ONE, Public Library of Science, vol. 7(8), pages 1-11, August.
    9. Jeremiah J Faith & Boris Hayete & Joshua T Thaden & Ilaria Mogno & Jamey Wierzbowski & Guillaume Cottarel & Simon Kasif & James J Collins & Timothy S Gardner, 2007. "Large-Scale Mapping and Validation of Escherichia coli Transcriptional Regulation from a Compendium of Expression Profiles," PLOS Biology, Public Library of Science, vol. 5(1), pages 1-13, January.
    10. Chalise, Prabhakar & Fridley, Brooke L., 2012. "Comparison of penalty functions for sparse canonical correlation analysis," Computational Statistics & Data Analysis, Elsevier, vol. 56(2), pages 245-254.
    11. Wei-Chung Cheng & Cheng-Wei Chang & Chaang-Ray Chen & Min-Lung Tsai & Wun-Yi Shu & Chia-Yang Li & Ian C Hsu, 2011. "Identification of Reference Genes across Physiological States for qRT-PCR through Microarray Meta-Analysis," PLOS ONE, Public Library of Science, vol. 6(2), pages 1-8, February.
    12. Parker Hilary S. & Leek Jeffrey T., 2012. "The practical effect of batch on genomic prediction," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(3), pages 1-22, April.
    13. Suyan Tian & James G Krueger & Katherine Li & Ali Jabbari & Carrie Brodmerkel & Michelle A Lowes & Mayte Suárez-Fariñas, 2012. "Meta-Analysis Derived (MAD) Transcriptome of Psoriasis Defines the “Core” Pathogenesis of Disease," PLOS ONE, Public Library of Science, vol. 7(9), pages 1-15, September.
    14. Akul Singhania & Hitasha Rupani & Nivenka Jayasekera & Simon Lumb & Paul Hales & Neil Gozzard & Donna E Davies & Christopher H Woelk & Peter H Howarth, 2017. "Altered Epithelial Gene Expression in Peripheral Airways of Severe Asthma," PLOS ONE, Public Library of Science, vol. 12(1), pages 1-16, January.
    15. Russell D J Huby & Philip Glaves & Richard Jackson, 2014. "The Incidence of Sexually Dimorphic Gene Expression Varies Greatly between Tissues in the Rat," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-19, December.
    16. Erick da Conceição Amorim & Vinícius Diniz Mayrink, 2020. "Clustering non-linear interactions in factor analysis," METRON, Springer;Sapienza Università di Roma, vol. 78(3), pages 329-352, December.
    17. Aaron C Ericsson & J Wade Davis & William Spollen & Nathan Bivens & Scott Givan & Catherine E Hagan & Mark McIntosh & Craig L Franklin, 2015. "Effects of Vendor and Genetic Background on the Composition of the Fecal Microbiota of Inbred Mice," PLOS ONE, Public Library of Science, vol. 10(2), pages 1-19, February.
    18. Hossain, Ahmed & Beyene, Joseph & Willan, Andrew R. & Hu, Pingzhao, 2009. "A flexible approximate likelihood ratio test for detecting differential expression in microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 53(10), pages 3685-3695, August.
    19. Xiaohong Li & Guy N Brock & Eric C Rouchka & Nigel G F Cooper & Dongfeng Wu & Timothy E O’Toole & Ryan S Gill & Abdallah M Eteleeb & Liz O’Brien & Shesh N Rai, 2017. "A comparison of per sample global scaling and per gene normalization methods for differential expression analysis of RNA-seq data," PLOS ONE, Public Library of Science, vol. 12(5), pages 1-22, May.
    20. Kerr Kathleen F., 2012. "Optimality Criteria for the Design of 2-Color Microarray Studies," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(1), pages 1-9, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:stabio:v:9:y:2017:i:1:d:10.1007_s12561-016-9172-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.