IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1002968.html
   My bibliography  Save this article

RFECS: A Random-Forest Based Algorithm for Enhancer Identification from Chromatin State

Author

Listed:
  • Nisha Rajagopal
  • Wei Xie
  • Yan Li
  • Uli Wagner
  • Wei Wang
  • John Stamatoyannopoulos
  • Jason Ernst
  • Manolis Kellis
  • Bing Ren

Abstract

Transcriptional enhancers play critical roles in regulation of gene expression, but their identification in the eukaryotic genome has been challenging. Recently, it was shown that enhancers in the mammalian genome are associated with characteristic histone modification patterns, which have been increasingly exploited for enhancer identification. However, only a limited number of cell types or chromatin marks have previously been investigated for this purpose, leaving the question unanswered whether there exists an optimal set of histone modifications for enhancer prediction in different cell types. Here, we address this issue by exploring genome-wide profiles of 24 histone modifications in two distinct human cell types, embryonic stem cells and lung fibroblasts. We developed a Random-Forest based algorithm, RFECS (Random Forest based Enhancer identification from Chromatin States) to integrate histone modification profiles for identification of enhancers, and used it to identify enhancers in a number of cell-types. We show that RFECS not only leads to more accurate and precise prediction of enhancers than previous methods, but also helps identify the most informative and robust set of three chromatin marks for enhancer prediction. Author Summary: Enhancers are regions in the genome that can activate the expression of a gene irrespective of their location with respect to the gene. Identifying these elements is critical in understanding regulatory differences between different cell-types. Since enhancers lack characteristic sequence features and can be far away from the gene they regulate, their identification is not trivial. Experimentally determining the genome-wide binding sites of transcriptional co-activator p300 is one way of finding enhancers but it can only identify a subset of enhancers. A few years ago, it was observed that the binding sites of p300 are marked by distinctive, post-translational histone modifications. Several groups have exploited this discovery to predict genome-wide enhancers based on their similarity to the histone modification profiles of p300 binding sites. We here report a novel algorithm for this purpose and show that it has much greater accuracy than existing methods. Another unique feature of our algorithm is the ability to automatically deduce the most informative set of histone modifications required for enhancer prediction. We expect that this method will become increasingly useful with the expanding number of known histone modifications and rapid accumulation of epigenomic datasets for various cell types and species.

Suggested Citation

  • Nisha Rajagopal & Wei Xie & Yan Li & Uli Wagner & Wei Wang & John Stamatoyannopoulos & Jason Ernst & Manolis Kellis & Bing Ren, 2013. "RFECS: A Random-Forest Based Algorithm for Enhancer Identification from Chromatin State," PLOS Computational Biology, Public Library of Science, vol. 9(3), pages 1-14, March.
  • Handle: RePEc:plo:pcbi00:1002968
    DOI: 10.1371/journal.pcbi.1002968
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002968
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1002968&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1002968?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Nathaniel D. Heintzman & Gary C. Hon & R. David Hawkins & Pouya Kheradpour & Alexander Stark & Lindsey F. Harp & Zhen Ye & Leonard K. Lee & Rhona K. Stuart & Christina W. Ching & Keith A. Ching & Jess, 2009. "Histone modifications at human enhancers reflect global cell-type-specific gene expression," Nature, Nature, vol. 459(7243), pages 108-112, May.
    2. Kyoung-Jae Won & Saurabh Agarwal & Li Shen & Robert Shoemaker & Bing Ren & Wei Wang, 2009. "An Integrated Approach to Identifying Cis-Regulatory Modules in the Human Genome," PLOS ONE, Public Library of Science, vol. 4(5), pages 1-8, May.
    3. Tae-Kyung Kim & Martin Hemberg & Jesse M. Gray & Allen M. Costa & Daniel M. Bear & Jing Wu & David A. Harmin & Mike Laptewicz & Kellie Barbara-Haley & Scott Kuersten & Eirene Markenscoff-Papadimitriou, 2010. "Widespread transcription at neuronal activity-regulated enhancers," Nature, Nature, vol. 465(7295), pages 182-187, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Christina M. Caragine & Victoria T. Le & Meer Mustafa & Bianca Jay Diaz & John A. Morris & Simon Müller & Alejandro Mendez-Mancilla & Evan Geller & Noa Liscovitch-Brauer & Neville E. Sanjana, 2025. "Comprehensive dissection of cis-regulatory elements in a 2.8 Mb topologically associated domain in six human cancers," Nature Communications, Nature, vol. 16(1), pages 1-17, December.
    2. Jessica L Larson & Guo-Cheng Yuan, 2012. "Chromatin States Accurately Classify Cell Differentiation Stages," PLOS ONE, Public Library of Science, vol. 7(2), pages 1-9, February.
    3. Kosei Nagata & Hironori Hojo & Song Ho Chang & Hiroyuki Okada & Fumiko Yano & Ryota Chijimatsu & Yasunori Omata & Daisuke Mori & Yuma Makii & Manabu Kawata & Taizo Kaneko & Yasuhide Iwanaga & Hideki N, 2022. "Runx2 and Runx3 differentially regulate articular chondrocytes during surgically induced osteoarthritis development," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    4. Lauren A. Choate & Gilad Barshad & Pierce W. McMahon & Iskander Said & Edward J. Rice & Paul R. Munn & James J. Lewis & Charles G. Danko, 2021. "Multiple stages of evolutionary change in anthrax toxin receptor expression in humans," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    5. Kota Hamamoto & Yusuke Umemura & Shiho Makino & Takashi Fukaya, 2023. "Dynamic interplay between non-coding enhancer transcription and gene activity in development," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    6. Jingting Xu & Hong Hu & Yang Dai, 2016. "LMethyR-SVM: Predict Human Enhancers Using Low Methylated Regions based on Weighted Support Vector Machines," PLOS ONE, Public Library of Science, vol. 11(9), pages 1-18, September.
    7. van Iterson Maarten & Duijkers Floor A.M. & Meijerink Jules P.P. & Admiraal Pieter & van Ommen Gert-Jan B. & Boer Judith M. & van Noesel Max M. & Menezes Renee X., 2012. "A Novel and Fast Normalization Method for High-Density Arrays," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 11(4), pages 1-31, July.
    8. Christopher G Bell & Sarah Finer & Cecilia M Lindgren & Gareth A Wilson & Vardhman K Rakyan & Andrew E Teschendorff & Pelin Akan & Elia Stupka & Thomas A Down & Inga Prokopenko & Ian M Morison & Jonat, 2010. "Integrated Genetic and Epigenetic Analysis Identifies Haplotype-Specific Methylation in the FTO Type 2 Diabetes and Obesity Susceptibility Locus," PLOS ONE, Public Library of Science, vol. 5(11), pages 1-12, November.
    9. B. Edginton-White & A. Maytum & S. G. Kellaway & D. K. Goode & P. Keane & I. Pagnuco & S. A. Assi & L. Ames & M. Clarke & P. N. Cockerill & B. Göttgens & J. B. Cazier & C. Bonifer, 2023. "A genome-wide relay of signalling-responsive enhancers drives hematopoietic specification," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    10. Charles Limouse & Owen K. Smith & David Jukam & Kelsey A. Fryer & William J. Greenleaf & Aaron F. Straight, 2023. "Global mapping of RNA-chromatin contacts reveals a proximity-dominated connectivity model for ncRNA-gene interactions," Nature Communications, Nature, vol. 14(1), pages 1-21, December.
    11. Vladyslava Gorbovytska & Seung-Kyoon Kim & Filiz Kuybu & Michael Götze & Dahun Um & Keunsoo Kang & Andreas Pittroff & Theresia Brennecke & Lisa-Marie Schneider & Alexander Leitner & Tae-Kyung Kim & Cl, 2022. "Enhancer RNAs stimulate Pol II pause release by harnessing multivalent interactions to NELF," Nature Communications, Nature, vol. 13(1), pages 1-22, December.
    12. Annkatrin Bressin & Olga Jasnovidova & Mirjam Arnold & Elisabeth Altendorfer & Filip Trajkovski & Thomas A. Kratz & Joanna E. Handzlik & Denes Hnisz & Andreas Mayer, 2023. "High-sensitive nascent transcript sequencing reveals BRD4-specific control of widespread enhancer and target gene transcription," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    13. Irene Robles-Rebollo & Sergi Cuartero & Adria Canellas-Socias & Sarah Wells & Mohammad M. Karimi & Elisabetta Mereu & Alexandra G. Chivu & Holger Heyn & Chad Whilding & Dirk Dormann & Samuel Marguerat, 2022. "Cohesin couples transcriptional bursting probabilities of inducible enhancers and promoters," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    14. Xiaofeng Dai & Wenwen Guo & Chunjun Zhan & Xiuxia Liu & Zhonghu Bai & Yankun Yang, 2015. "WDR5 Expression Is Prognostic of Breast Cancer Outcome," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-15, September.
    15. Ian C McDowell & Dinesh Manandhar & Christopher M Vockley & Amy K Schmid & Timothy E Reddy & Barbara E Engelhardt, 2018. "Clustering gene expression time series data using an infinite Gaussian process mixture model," PLOS Computational Biology, Public Library of Science, vol. 14(1), pages 1-27, January.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1002968. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.