IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0138030.html
   My bibliography  Save this article

msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding

Author

Listed:
  • Anil Raj
  • Heejung Shim
  • Yoav Gilad
  • Jonathan K Pritchard
  • Matthew Stephens

Abstract

Understanding global gene regulation depends critically on accurate annotation of regulatory elements that are functional in a given cell type. CENTIPEDE, a powerful, probabilistic framework for identifying transcription factor binding sites from tissue-specific DNase I cleavage patterns and genomic sequence content, leverages the hypersensitivity of factor-bound chromatin and the information in the DNase I spatial cleavage profile characteristic of each DNA binding protein to accurately infer functional factor binding sites. However, the model for the spatial profile in this framework fails to account for the substantial variation in the DNase I cleavage profiles across different binding sites. Neither does it account for variation in the profiles at the same binding site across multiple replicate DNase I experiments, which are increasingly available. In this work, we introduce new methods, based on multi-scale models for inhomogeneous Poisson processes, to account for such variation in DNase I cleavage patterns both within and across binding sites. These models account for the spatial structure in the heterogeneity in DNase I cleavage patterns for each factor. Using DNase-seq measurements assayed in a lymphoblastoid cell line, we demonstrate the improved performance of this model for several transcription factors by comparing against the Chip-seq peaks for those factors. Finally, we explore the effects of DNase I sequence bias on inference of factor binding using a simple extension to our framework that allows for a more flexible background model. The proposed model can also be easily applied to paired-end ATAC-seq and DNase-seq data. msCentipede, a Python implementation of our algorithm, is available at http://rajanil.github.io/msCentipede.

Suggested Citation

  • Anil Raj & Heejung Shim & Yoav Gilad & Jonathan K Pritchard & Matthew Stephens, 2015. "msCentipede: Modeling Heterogeneity across Genomic Sites and Replicates Improves Accuracy in the Inference of Transcription Factor Binding," PLOS ONE, Public Library of Science, vol. 10(9), pages 1-15, September.
  • Handle: RePEc:plo:pone00:0138030
    DOI: 10.1371/journal.pone.0138030
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0138030
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0138030&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0138030?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Robert E. Thurman & Eric Rynes & Richard Humbert & Jeff Vierstra & Matthew T. Maurano & Eric Haugen & Nathan C. Sheffield & Andrew B. Stergachis & Hao Wang & Benjamin Vernot & Kavita Garg & Sam John &, 2012. "The accessible chromatin landscape of the human genome," Nature, Nature, vol. 489(7414), pages 75-82, September.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liang-Yu Fu & Tao Zhu & Xinkai Zhou & Ranran Yu & Zhaohui He & Peijing Zhang & Zhigui Wu & Ming Chen & Kerstin Kaufmann & Dijun Chen, 2022. "ChIP-Hub provides an integrative platform for exploring plant regulome," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    2. Xuelong Yao & Zongyang Lu & Zhanying Feng & Lei Gao & Xin Zhou & Min Li & Suijuan Zhong & Qian Wu & Zhenbo Liu & Haofeng Zhang & Zeyuan Liu & Lizhi Yi & Tao Zhou & Xudong Zhao & Jun Zhang & Yong Wang , 2022. "Comparison of chromatin accessibility landscapes during early development of prefrontal cortex between rhesus macaque and human," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    3. Fei Sun & Nourhan Nashat Ali & Daniela Londoño-Vásquez & Constantine A. Simintiras & Huanyu Qiao & M. Sofia Ortega & Yuksel Agca & Masashi Takahashi & Rocío M. Rivera & Andrew M. Kelleher & Peter Suto, 2024. "Increased DNA damage in full-grown oocytes is correlated with diminished autophagy activation," Nature Communications, Nature, vol. 15(1), pages 1-20, December.
    4. Chris Wallace & Antony J Cutler & Nikolas Pontikos & Marcin L Pekalski & Oliver S Burren & Jason D Cooper & Arcadio Rubio García & Ricardo C Ferreira & Hui Guo & Neil M Walker & Deborah J Smyth & Step, 2015. "Dissection of a Complex Disease Susceptibility Region Using a Bayesian Stochastic Search Approach to Fine Mapping," PLOS Genetics, Public Library of Science, vol. 11(6), pages 1-22, June.
    5. Siqian Feng & Chaitanya Rastogi & Ryan Loker & William J. Glassford & H. Tomas Rube & Harmen J. Bussemaker & Richard S. Mann, 2022. "Transcription factor paralogs orchestrate alternative gene regulatory networks by context-dependent cooperation with multiple cofactors," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    6. Grigorios Georgolopoulos & Nikoletta Psatha & Mineo Iwata & Andrew Nishida & Tannishtha Som & Minas Yiangou & John A. Stamatoyannopoulos & Jeff Vierstra, 2021. "Discrete regulatory modules instruct hematopoietic lineage commitment and differentiation," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    7. Nikolai Schleussner & Pierre Cauchy & Vedran Franke & Maciej Giefing & Oriol Fornes & Naveen Vankadari & Salam A. Assi & Mariantonia Costanza & Marc A. Weniger & Altuna Akalin & Ioannis Anagnostopoulo, 2023. "Transcriptional reprogramming by mutated IRF4 in lymphoma," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    8. Hillary Koch & Cheryl A. Keller & Guanjue Xiang & Belinda Giardine & Feipeng Zhang & Yicheng Wang & Ross C. Hardison & Qunhua Li, 2022. "CLIMB: High-dimensional association detection in large scale genomic data," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    9. Guang Guo & Lin Wang & Hexuan Liu & Thomas Randall, 2014. "Genomic Assortative Mating in Marriages in the United States," PLOS ONE, Public Library of Science, vol. 9(11), pages 1-14, November.
    10. Alan Selewa & Kaixuan Luo & Michael Wasney & Linsin Smith & Xiaotong Sun & Chenwei Tang & Heather Eckart & Ivan P. Moskowitz & Anindita Basu & Xin He & Sebastian Pott, 2023. "Single-cell genomics improves the discovery of risk variants and genes of atrial fibrillation," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    11. M d Mesbah Uddin & Ngoc Quynh H. Nguyen & Bing Yu & Jennifer A. Brody & Akhil Pampana & Tetsushi Nakao & Myriam Fornage & Jan Bressler & Nona Sotoodehnia & Joshua S. Weinstock & Michael C. Honigberg &, 2022. "Clonal hematopoiesis of indeterminate potential, DNA methylation, and risk for coronary artery disease," Nature Communications, Nature, vol. 13(1), pages 1-16, December.
    12. Grégoire Vernaz & Milan Malinsky & Hannes Svardal & Mingliu Du & Alexandra M. Tyers & M. Emília Santos & Richard Durbin & Martin J. Genner & George F. Turner & Eric A. Miska, 2021. "Mapping epigenetic divergence in the massive radiation of Lake Malawi cichlid fishes," Nature Communications, Nature, vol. 12(1), pages 1-13, December.
    13. Arthur S. Lee & Lauren J. Ayers & Michael Kosicki & Wai-Man Chan & Lydia N. Fozo & Brandon M. Pratt & Thomas E. Collins & Boxun Zhao & Matthew F. Rose & Alba Sanchis-Juan & Jack M. Fu & Isaac Wong & X, 2024. "A cell type-aware framework for nominating non-coding variants in Mendelian regulatory disorders," Nature Communications, Nature, vol. 15(1), pages 1-26, December.
    14. Kinjal Desai & Siyi Wanggou & Erika Luis & Heather Whetstone & Chunying Yu & Robert J. Vanner & Hayden J. Selvadurai & Lilian Lee & Jinchu Vijay & Julia E. Jaramillo & Jerry Fan & Paul Guilhamon & Mic, 2025. "OLIG2 mediates a rare targetable stem cell fate transition in sonic hedgehog medulloblastoma," Nature Communications, Nature, vol. 16(1), pages 1-20, December.
    15. Emely Möller & Viviane Praz & Sanalkumar Rajendran & Rui Dong & Alexandra Cauderay & Yu-Hang Xing & Lukuo Lee & Carlo Fusco & Liliane C. Broye & Luisa Cironi & Sowmya Iyer & Shruthi Rengarajan & Mary , 2022. "EWSR1-ATF1 dependent 3D connectivity regulates oncogenic and differentiation programs in Clear Cell Sarcoma," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    16. Vivekanandan Ramalingam & Xinyang Yu & Brian D. Slaughter & Jay R. Unruh & Kaelan J. Brennan & Anastasiia Onyshchenko & Jeffrey J. Lange & Malini Natarajan & Michael Buck & Julia Zeitlinger, 2023. "Lola-I is a promoter pioneer factor that establishes de novo Pol II pausing during development," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    17. Han Wang & Huiying Sun & Bilin Liang & Fang Zhang & Fan Yang & Bowen Cui & Lixia Ding & Xiang Wang & Ronghua Wang & Jiaoyang Cai & Yanjing Tang & Jianan Rao & Wenting Hu & Shuang Zhao & Wenyan Wu & Xi, 2023. "Chromatin accessibility landscape of relapsed pediatric B-lineage acute lymphoblastic leukemia," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    18. Lara Djakovic & Thomas Hennig & Katharina Reinisch & Andrea Milić & Adam W. Whisnant & Katharina Wolf & Elena Weiß & Tobias Haas & Arnhild Grothey & Christopher S. Jürges & Michael Kluge & Elmar Wolf , 2023. "The HSV-1 ICP22 protein selectively impairs histone repositioning upon Pol II transcription downstream of genes," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    19. Alan Yue Yang Teo & Jordan W. Squair & Gregoire Courtine & Michael A. Skinnider, 2024. "Best practices for differential accessibility analysis in single-cell epigenomics," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    20. Jie Lv & Shu Meng & Qilin Gu & Rongbin Zheng & Xinlei Gao & Jun-dae Kim & Min Chen & Bo Xia & Yihan Zuo & Sen Zhu & Dongyu Zhao & Yanqiang Li & Guangyu Wang & Xin Wang & Qingshu Meng & Qi Cao & John P, 2023. "Epigenetic landscape reveals MECOM as an endothelial lineage regulator," Nature Communications, Nature, vol. 14(1), pages 1-17, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0138030. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.