IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0005501.html
   My bibliography  Save this article

An Integrated Approach to Identifying Cis-Regulatory Modules in the Human Genome

Author

Listed:
  • Kyoung-Jae Won
  • Saurabh Agarwal
  • Li Shen
  • Robert Shoemaker
  • Bing Ren
  • Wei Wang

Abstract

In eukaryotic genomes, it is challenging to accurately determine target sites of transcription factors (TFs) by only using sequence information. Previous efforts were made to tackle this task by considering the fact that TF binding sites tend to be more conserved than other functional sites and the binding sites of several TFs are often clustered. Recently, ChIP-chip and ChIP-sequencing experiments have been accumulated to identify TF binding sites as well as survey the chromatin modification patterns at the regulatory elements such as promoters and enhancers. We propose here a hidden Markov model (HMM) to incorporate sequence motif information, TF-DNA interaction data and chromatin modification patterns to precisely identify cis-regulatory modules (CRMs). We conducted ChIP-chip experiments on four TFs, CREB, E2F1, MAX, and YY1 in 1% of the human genome. We then trained a hidden Markov model (HMM) to identify the labels of the CRMs by incorporating the sequence motifs recognized by these TFs and the ChIP-chip ratio. Chromatin modification data was used to predict the functional sites and to further remove false positives. Cross-validation showed that our integrated HMM had a performance superior to other existing methods on predicting CRMs. Incorporating histone signature information successfully penalized false prediction and improved the whole performance. The dataset we used and the software are available at http://nash.ucsd.edu/CIS/.

Suggested Citation

  • Kyoung-Jae Won & Saurabh Agarwal & Li Shen & Robert Shoemaker & Bing Ren & Wei Wang, 2009. "An Integrated Approach to Identifying Cis-Regulatory Modules in the Human Genome," PLOS ONE, Public Library of Science, vol. 4(5), pages 1-8, May.
  • Handle: RePEc:plo:pone00:0005501
    DOI: 10.1371/journal.pone.0005501
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0005501
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0005501&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0005501?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Christopher T. Harbison & D. Benjamin Gordon & Tong Ihn Lee & Nicola J. Rinaldi & Kenzie D. Macisaac & Timothy W. Danford & Nancy M. Hannett & Jean-Bosco Tagne & David B. Reynolds & Jane Yoo & Ezra G., 2004. "Transcriptional regulatory code of a eukaryotic genome," Nature, Nature, vol. 431(7004), pages 99-104, September.
    2. Vishwanath R. Iyer & Christine E. Horak & Charles S. Scafe & David Botstein & Michael Snyder & Patrick O. Brown, 2001. "Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF," Nature, Nature, vol. 409(6819), pages 533-538, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Eilon Sharon & Shai Lubliner & Eran Segal, 2008. "A Feature-Based Approach to Modeling Protein–DNA Interactions," PLOS Computational Biology, Public Library of Science, vol. 4(8), pages 1-17, August.
    2. Matvei Khoroshkin & Andrey Buyan & Martin Dodel & Albertas Navickas & Johnny Yu & Fathima Trejo & Anthony Doty & Rithvik Baratam & Shaopu Zhou & Sean B. Lee & Tanvi Joshi & Kristle Garcia & Benedict C, 2024. "Systematic identification of post-transcriptional regulatory modules," Nature Communications, Nature, vol. 15(1), pages 1-21, December.
    3. Xinyi Liu & Bin Liu & Zhimin Huang & Ting Shi & Yingyi Chen & Jian Zhang, 2012. "SPPS: A Sequence-Based Method for Predicting Probability of Protein-Protein Interaction Partners," PLOS ONE, Public Library of Science, vol. 7(1), pages 1-6, January.
    4. G. Saharidis & I. Androulakis & M. Ierapetritou, 2011. "Model building using bi-level optimization," Journal of Global Optimization, Springer, vol. 49(1), pages 49-67, January.
    5. Armita Nourmohammad & Michael Lässig, 2011. "Formation of Regulatory Modules by Local Sequence Duplication," PLOS Computational Biology, Public Library of Science, vol. 7(10), pages 1-12, October.
    6. Emily N Manderson & Mohan Malleshaiah & Stephen W Michnick, 2008. "A Novel Genetic Screen Implicates Elm1 in the Inactivation of the Yeast Transcription Factor SBF," PLOS ONE, Public Library of Science, vol. 3(1), pages 1-9, January.
    7. Cheemeng Tan & Robert Phillip Smith & Ming-Chi Tsai & Russell Schwartz & Lingchong You, 2014. "Phenotypic Signatures Arising from Unbalanced Bacterial Growth," PLOS Computational Biology, Public Library of Science, vol. 10(8), pages 1-10, August.
    8. Guo-Cheng Yuan & Jun S Liu, 2008. "Genomic Sequence Is Highly Predictive of Local Nucleosome Depletion," PLOS Computational Biology, Public Library of Science, vol. 4(1), pages 1-11, January.
    9. Joshua S Weitz & Philip N Benfey & Ned S Wingreen, 2007. "Evolution, Interactions, and Biological Networks," PLOS Biology, Public Library of Science, vol. 5(1), pages 1-3, January.
    10. Manikandan Narayanan & Adrian Vetta & Eric E Schadt & Jun Zhu, 2010. "Simultaneous Clustering of Multiple Gene Expression and Physical Interaction Datasets," PLOS Computational Biology, Public Library of Science, vol. 6(4), pages 1-13, April.
    11. Sourav Bandyopadhyay & Ryan Kelley & Nevan J Krogan & Trey Ideker, 2008. "Functional Maps of Protein Complexes from Quantitative Genetic Interaction Data," PLOS Computational Biology, Public Library of Science, vol. 4(4), pages 1-8, April.
    12. Yue Yuan & Qiang Huo & Ziru Zhang & Qun Wang & Juanxia Wang & Shuaikang Chang & Peng Cai & Karen M. Song & David W. Galbraith & Weixiao Zhang & Long Huang & Rentao Song & Zeyang Ma, 2024. "Decoding the gene regulatory network of endosperm differentiation in maize," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    13. Xiaoyu Tu & Sibo Ren & Wei Shen & Jianjian Li & Yuxiang Li & Chuanshun Li & Yangmeihui Li & Zhanxiang Zong & Weibo Xie & Donald Grierson & Zhangjun Fei & Jim Giovannoni & Pinghua Li & Silin Zhong, 2022. "Limited conservation in cross-species comparison of GLK transcription factor binding suggested wide-spread cistrome divergence," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    14. Xun Lan & Christopher Adams & Mark Landers & Miroslav Dudas & Daniel Krissinger & George Marnellos & Russell Bonneville & Maoxiong Xu & Junbai Wang & Tim H-M Huang & Gavin Meredith & Victor X Jin, 2011. "High Resolution Detection and Analysis of CpG Dinucleotides Methylation Using MBD-Seq Technology," PLOS ONE, Public Library of Science, vol. 6(7), pages 1-11, July.
    15. Xiaoke Ma & Long Gao & Georgios Karamanlidis & Peng Gao & Chi Fung Lee & Lorena Garcia-Menendez & Rong Tian & Kai Tan, 2015. "Revealing Pathway Dynamics in Heart Diseases by Analyzing Multiple Differential Networks," PLOS Computational Biology, Public Library of Science, vol. 11(6), pages 1-19, June.
    16. Shojaie Ali & Michailidis George, 2010. "Network Enrichment Analysis in Complex Experiments," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 9(1), pages 1-36, May.
    17. Zhengdong D Zhang & Joel Rozowsky & Michael Snyder & Joseph Chang & Mark Gerstein, 2008. "Modeling ChIP Sequencing In Silico with Applications," PLOS Computational Biology, Public Library of Science, vol. 4(8), pages 1-10, August.
    18. Phaedra Agius & Aaron Arvey & William Chang & William Stafford Noble & Christina Leslie, 2010. "High Resolution Models of Transcription Factor-DNA Affinities Improve In Vitro and In Vivo Binding Predictions," PLOS Computational Biology, Public Library of Science, vol. 6(9), pages 1-12, September.
    19. Kenzie D MacIsaac & Ernest Fraenkel, 2006. "Practical Strategies for Discovering Regulatory DNA Sequence Motifs," PLOS Computational Biology, Public Library of Science, vol. 2(4), pages 1-10, April.
    20. John E Reid & Lorenz Wernisch, 2014. "STEME: A Robust, Accurate Motif Finder for Large Data Sets," PLOS ONE, Public Library of Science, vol. 9(3), pages 1-11, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0005501. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.