IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1000849.html
   My bibliography  Save this article

Computational Analysis of Whole-Genome Differential Allelic Expression Data in Human

Author

Listed:
  • James R Wagner
  • Bing Ge
  • Dmitry Pokholok
  • Kevin L Gunderson
  • Tomi Pastinen
  • Mathieu Blanchette

Abstract

Allelic imbalance (AI) is a phenomenon where the two alleles of a given gene are expressed at different levels in a given cell, either because of epigenetic inactivation of one of the two alleles, or because of genetic variation in regulatory regions. Recently, Bing et al. have described the use of genotyping arrays to assay AI at a high resolution (∼750,000 SNPs across the autosomes). In this paper, we investigate computational approaches to analyze this data and identify genomic regions with AI in an unbiased and robust statistical manner. We propose two families of approaches: (i) a statistical approach based on z-score computations, and (ii) a family of machine learning approaches based on Hidden Markov Models. Each method is evaluated using previously published experimental data sets as well as with permutation testing. When applied to whole genome data from 53 HapMap samples, our approaches reveal that allelic imbalance is widespread (most expressed genes show evidence of AI in at least one of our 53 samples) and that most AI regions in a given individual are also found in at least a few other individuals. While many AI regions identified in the genome correspond to known protein-coding transcripts, others overlap with recently discovered long non-coding RNAs. We also observe that genomic regions with AI not only include complete transcripts with consistent differential expression levels, but also more complex patterns of allelic expression such as alternative promoters and alternative 3′ end. The approaches developed not only shed light on the incidence and mechanisms of allelic expression, but will also help towards mapping the genetic causes of allelic expression and identify cases where this variation may be linked to diseases.Author Summary: Measures of gene expression, and the search for regulatory regions in the genome responsible for differences in levels of gene expression, is one of the key paths of research used to identify disease causing genes, as well as explain differences between healthy individuals. Typically, experiments have measured and compared gene expression in multiple individuals, and used this information to attempt to map regulatory regions responsible. Differences in environment between individuals can, however, cause differences in gene expression unrelated to the underlying regulatory sequence. New genotyping technologies enable the measurement of expression of both copies of a particular gene, at loci that are heterozygous within a particular individual. This will therefore act as an internal control, as environmental factors will continue to affect the expression of both copies of a gene at presumably equal levels, and differences in expression are more likely to be explicable by differences in regulatory regions specific to the two copies of the gene itself. Differences between regulatory regions are expected to lead to differences in expression of the two copies (or the two alleles) of a particular gene, also known as allelic imbalance. We describe a set of signal processing methods for the reliable detection of allelic expression within the genome.

Suggested Citation

  • James R Wagner & Bing Ge & Dmitry Pokholok & Kevin L Gunderson & Tomi Pastinen & Mathieu Blanchette, 2010. "Computational Analysis of Whole-Genome Differential Allelic Expression Data in Human," PLOS Computational Biology, Public Library of Science, vol. 6(7), pages 1-12, July.
  • Handle: RePEc:plo:pcbi00:1000849
    DOI: 10.1371/journal.pcbi.1000849
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1000849
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000849&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1000849?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Laura Carrel & Huntington F. Willard, 2005. "X-inactivation profile reveals extensive variability in X-linked gene expression in females," Nature, Nature, vol. 434(7031), pages 400-404, March.
    2. Oscar M Rueda & Ramón Díaz-Uriarte, 2007. "Flexible and Accurate Detection of Genomic Copy-Number Changes from aCGH," PLOS Computational Biology, Public Library of Science, vol. 3(6), pages 1-8, June.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Jean Francois Lefebvre & Emilio Vello & Bing Ge & Stephen B Montgomery & Emmanouil T Dermitzakis & Tomi Pastinen & Damian Labuda, 2012. "Genotype-Based Test in Mapping Cis-Regulatory Variants from Allele-Specific Expression Data," PLOS ONE, Public Library of Science, vol. 7(6), pages 1-15, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mitsuko Nakayama & Osamu Kondo & Paula Pesonen & Lassi Alvesalo & Raija Lähdesmäki, 2018. "Influence of long and short arms of X chromosome on maxillary molar crown morphology," PLOS ONE, Public Library of Science, vol. 13(11), pages 1-17, November.
    2. Boisvert, Danielle & Wells, Jessica & Armstrong, Todd A. & Lewis, Richard H., 2018. "Serotonin and self-control: A genetically moderated stress sensitization effect," Journal of Criminal Justice, Elsevier, vol. 56(C), pages 98-106.
    3. Stogner, John M. & Gibson, Chris L., 2013. "Stressful life events and adolescent drug use: Moderating influences of the MAOA gene," Journal of Criminal Justice, Elsevier, vol. 41(5), pages 357-363.
    4. Ao Yuan & Guanjie Chen & Juan Xiong & Wenqing He & Wen Jin & Charles Rotimi, 2011. "Bayesian--frequentist hybrid model with application to the analysis of gene copy number changes," Journal of Applied Statistics, Taylor & Francis Journals, vol. 38(5), pages 987-1005, February.
    5. Taru Tukiainen & Matti Pirinen & Antti-Pekka Sarin & Claes Ladenvall & Johannes Kettunen & Terho Lehtimäki & Marja-Liisa Lokki & Markus Perola & Juha Sinisalo & Efthymia Vlachopoulou & Johan G Eriksso, 2014. "Chromosome X-Wide Association Study Identifies Loci for Fasting Insulin and Height and Evidence for Incomplete Dosage Compensation," PLOS Genetics, Public Library of Science, vol. 10(2), pages 1-12, February.
    6. Michael Seifert & André Gohr & Marc Strickert & Ivo Grosse, 2012. "Parsimonious Higher-Order Hidden Markov Models for Improved Array-CGH Analysis with Applications to Arabidopsis thaliana," PLOS Computational Biology, Public Library of Science, vol. 8(1), pages 1-15, January.
    7. Antonio Lentini & Huaitao Cheng & J. C. Noble & Natali Papanicolaou & Christos Coucoravas & Nathanael Andrews & Qiaolin Deng & Martin Enge & Björn Reinius, 2022. "Elastic dosage compensation by X-chromosome upregulation," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    8. Svitlana Tyekucheva & Francesca Chiaromonte, 2008. "Augmenting the bootstrap to analyze high dimensional genomic data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 17(1), pages 1-18, May.
    9. Diana Chang & Feng Gao & Andrea Slavney & Li Ma & Yedael Y Waldman & Aaron J Sams & Paul Billing-Ross & Aviv Madar & Richard Spritz & Alon Keinan, 2014. "Accounting for eXentricities: Analysis of the X Chromosome in GWAS Reveals X-Linked Genes Implicated in Autoimmune Diseases," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-31, December.
    10. Yemiao Gao & Yuke Xiong & Xia Liu & Hui Wang, 2021. "The Effects of Childhood Maltreatment on Non-Suicidal Self-Injury in Male Adolescents: The Moderating Roles of the Monoamine Oxidase A (MAOA) Gene and the Catechol-O-Methyltransferase (COMT) Gene," IJERPH, MDPI, vol. 18(5), pages 1-13, March.
    11. Hao, Meiling & Zhao, Xingqiu & Xu, Wei, 2020. "Competing risk modeling and testing for X-chromosome genetic association," Computational Statistics & Data Analysis, Elsevier, vol. 151(C).
    12. Norman L. Lehman & Nathalie Spassky & Müge Sak & Amy Webb & Cory T. Zumbar & Aisulu Usubalieva & Khaled J. Alkhateeb & Joseph P. McElroy & Kirsteen H. Maclean & Paolo Fadda & Tom Liu & Vineela Gangala, 2022. "Astroblastomas exhibit radial glia stem cell lineages and differential expression of imprinted and X-inactivation escape genes," Nature Communications, Nature, vol. 13(1), pages 1-20, December.
    13. Asta Laiho & Laura L Elo, 2014. "A Note on an Exon-Based Strategy to Identify Differentially Expressed Genes in RNA-Seq Experiments," PLOS ONE, Public Library of Science, vol. 9(12), pages 1-12, December.
    14. Masato Tsuji & Takanori Kawasaki & Takeru Matsuda & Tomio Arai & Satoshi Gojo & Jun K Takeuchi, 2017. "Sexual dimorphisms of mRNA and miRNA in human/murine heart disease," PLOS ONE, Public Library of Science, vol. 12(7), pages 1-21, July.
    15. Frances M. Nilsen & Jessica Frank & Nicolle S. Tulve, 2020. "A Systematic Review and Meta-Analysis Investigating the Relationship between Exposures to Chemical and Non-Chemical Stressors during Prenatal Development and Childhood Externalizing Behaviors," IJERPH, MDPI, vol. 17(7), pages 1-32, March.
    16. Erick da Conceição Amorim & Vinícius Diniz Mayrink, 2020. "Clustering non-linear interactions in factor analysis," METRON, Springer;Sapienza Università di Roma, vol. 78(3), pages 329-352, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1000849. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.