IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1006937.html
   My bibliography  Save this article

Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder

Author

Listed:
  • Abrar E Al-Shaer
  • George R Flentke
  • Mark E Berres
  • Ana Garic
  • Susan M Smith

Abstract

Gestational alcohol exposure causes fetal alcohol spectrum disorder (FASD) and is a prominent cause of neurodevelopmental disability. Whole transcriptome sequencing (RNA-Seq) offer insights into mechanisms underlying FASD, but gene-level analysis provides limited information regarding complex transcriptional processes such as alternative splicing and non-coding RNAs. Moreover, traditional analytical approaches that use multiple hypothesis testing with a false discovery rate adjustment prioritize genes based on an adjusted p-value, which is not always biologically relevant. We address these limitations with a novel approach and implemented an unsupervised machine learning model, which we applied to an exon-level analysis to reduce data complexity to the most likely functionally relevant exons, without loss of novel information. This was performed on an RNA-Seq paired-end dataset derived from alcohol-exposed neural fold-stage chick crania, wherein alcohol causes facial deficits recapitulating those of FASD. A principal component analysis along with k-means clustering was utilized to extract exons that deviated from baseline expression. This identified 6857 differentially expressed exons representing 1251 geneIDs; 391 of these genes were identified in a prior gene-level analysis of this dataset. It also identified exons encoding 23 microRNAs (miRNAs) having significantly differential expression profiles in response to alcohol. We developed an RDAVID pipeline to identify KEGG pathways represented by these exons, and separately identified predicted KEGG pathways targeted by these miRNAs. Several of these (ribosome biogenesis, oxidative phosphorylation) were identified in our prior gene-level analysis. Other pathways are crucial to facial morphogenesis and represent both novel (focal adhesion, FoxO signaling, insulin signaling) and known (Wnt signaling) alcohol targets. Importantly, there was substantial overlap between the exomes themselves and the predicted miRNA targets, suggesting these miRNAs contribute to the gene-level expression changes. Our novel application of unsupervised machine learning in conjunction with statistical analyses facilitated the discovery of signaling pathways and miRNAs that inform mechanisms underlying FASD.Author summary: Genomic research often yields an overwhelming amount of information. Accurate models for predicting and validating multivariate big data in genomics distill complex relationships and interactions. A prime example is fetal alcohol spectrum disorders, the largest known cause of neurodevelopmental disability affecting nearly 5% of children in the United States. Alcohol exposure during pregnancy leads to complex epigenetic and transcriptomic modifications, subsequently impairing signaling pathways in neural and morphologic development. Identifying transcriptomic mechanisms regulating alcohol’s teratogenicity during embryonic development is crucial for understanding variable phenotypic outcomes. This allows for the advancement of future therapeutic interventions that may mediate alcohol’s effects. Most genomic studies do not incorporate various levels of transcriptomic analysis, spanning gene, exon, and splicing variants, because it is difficult to meaningfully consolidate all those analyses. Therefore, enhancing machine learning approaches that corroborate traditional statistical methods can yield novel relationships, and is important for robust functional experiments that proceed from such genomic studies.

Suggested Citation

  • Abrar E Al-Shaer & George R Flentke & Mark E Berres & Ana Garic & Susan M Smith, 2019. "Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder," PLOS Computational Biology, Public Library of Science, vol. 15(4), pages 1-25, April.
  • Handle: RePEc:plo:pcbi00:1006937
    DOI: 10.1371/journal.pcbi.1006937
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1006937
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1006937&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1006937?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Sridevi Balaraman & Jordan J Schafer & Alexander M Tseng & Wladimir Wertelecki & Lyubov Yevtushok & Natalya Zymak-Zakutnya & Christina D Chambers & Rajesh C Miranda, 2016. "Plasma miRNA Profiles in Pregnant Women Predict Infant Outcomes following Prenatal Alcohol Exposure," PLOS ONE, Public Library of Science, vol. 11(11), pages 1-20, November.
    2. Ravi K Patel & Mukesh Jain, 2012. "NGS QC Toolkit: A Toolkit for Quality Control of Next Generation Sequencing Data," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-7, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dongya Wu & Enhui Shen & Bowen Jiang & Yu Feng & Wei Tang & Sangting Lao & Lei Jia & Han-Yang Lin & Lingjuan Xie & Xifang Weng & Chenfeng Dong & Qinghong Qian & Feng Lin & Haiming Xu & Huabing Lu & Lu, 2022. "Genomic insights into the evolution of Echinochloa species as weed and orphan crop," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    2. Wei Ding & Shougang Wang & Peng Qin & Shen Fan & Xiaoyan Su & Peiyan Cai & Jie Lu & Han Cui & Meng Wang & Yi Shu & Yongming Wang & Hui-Hui Fu & Yu-Zhong Zhang & Yong-Xin Li & Weipeng Zhang, 2023. "Anaerobic thiosulfate oxidation by the Roseobacter group is prevalent in marine biofilms," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    3. Irene Stefanini & Monica Di Paola & Gianni Liti & Andrea Marranci & Federico Sebastiani & Enrico Casalone & Duccio Cavalieri, 2022. "Resistance to Arsenite and Arsenate in Saccharomyces cerevisiae Arises through the Subtelomeric Expansion of a Cluster of Yeast Genes," IJERPH, MDPI, vol. 19(13), pages 1-15, July.
    4. Lihong Gu & Feng Wang & Zhemin Lin & Tieshan Xu & Dajie Lin & Manping Xing & Shaoxiong Yang & Zhe Chao & Baoguo Ye & Peng Lin & Chunhui Hui & Lizhi Lu & Shuisheng Hou, 2020. "Genetic characteristics of Jiaji Duck by whole genome re-sequencing," PLOS ONE, Public Library of Science, vol. 15(2), pages 1-15, February.
    5. Pingfen Zhu & Weiqiang Liu & Xiaoxiao Zhang & Meng Li & Gaoming Liu & Yang Yu & Zihao Li & Xuanjing Li & Juan Du & Xiao Wang & Cyril C. Grueter & Ming Li & Xuming Zhou, 2023. "Correlated evolution of social organization and lifespan in mammals," Nature Communications, Nature, vol. 14(1), pages 1-18, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1006937. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.