IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v13y2022i1d10.1038_s41467-022-31337-w.html
   My bibliography  Save this article

Learning representations of chromatin contacts using a recurrent neural network identifies genomic drivers of conformation

Author

Listed:
  • Kevin B. Dsouza

    (University of British Columbia)

  • Alexandra Maslova

    (Simon Fraser University)

  • Ediem Al-Jibury

    (Imperial College London
    Imperial College London)

  • Matthias Merkenschlager

    (Imperial College London)

  • Vijay K. Bhargava

    (University of British Columbia)

  • Maxwell W. Libbrecht

    (Simon Fraser University)

Abstract

Despite the availability of chromatin conformation capture experiments, discerning the relationship between the 1D genome and 3D conformation remains a challenge, which limits our understanding of their affect on gene expression and disease. We propose Hi-C-LSTM, a method that produces low-dimensional latent representations that summarize intra-chromosomal Hi-C contacts via a recurrent long short-term memory neural network model. We find that these representations contain all the information needed to recreate the observed Hi-C matrix with high accuracy, outperforming existing methods. These representations enable the identification of a variety of conformation-defining genomic elements, including nuclear compartments and conformation-related transcription factors. They furthermore enable in-silico perturbation experiments that measure the influence of cis-regulatory elements on conformation.

Suggested Citation

  • Kevin B. Dsouza & Alexandra Maslova & Ediem Al-Jibury & Matthias Merkenschlager & Vijay K. Bhargava & Maxwell W. Libbrecht, 2022. "Learning representations of chromatin contacts using a recurrent neural network identifies genomic drivers of conformation," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
  • Handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-31337-w
    DOI: 10.1038/s41467-022-31337-w
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-022-31337-w
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-022-31337-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Pau Farré & Eldon Emberly, 2018. "A maximum-entropy model for predicting chromatin contacts," PLOS Computational Biology, Public Library of Science, vol. 14(2), pages 1-16, February.
    2. Jesse R. Dixon & Siddarth Selvaraj & Feng Yue & Audrey Kim & Yan Li & Yin Shen & Ming Hu & Jun S. Liu & Bing Ren, 2012. "Topological domains in mammalian genomes identified by analysis of chromatin interactions," Nature, Nature, vol. 485(7398), pages 376-380, May.
    3. Wenbao Yu & Bing He & Kai Tan, 2017. "Identifying topologically associating domains and subdomains by Gaussian Mixture model And Proportion test," Nature Communications, Nature, vol. 8(1), pages 1-9, December.
    4. Kyle Xiong & Jian Ma, 2019. "Revealing Hi-C subcompartments by imputing inter-chromosomal chromatin interactions," Nature Communications, Nature, vol. 10(1), pages 1-12, December.
    5. Qian Du & Saul A. Bert & Nicola J. Armstrong & C. Elizabeth Caldon & Jenny Z. Song & Shalima S. Nair & Cathryn M. Gould & Phuc-Loi Luu & Timothy Peters & Amanda Khoury & Wenjia Qu & Elena Zotenko & Cl, 2019. "Replication timing and epigenome remodelling are associated with the nature of chromosomal rearrangements in cancer," Nature Communications, Nature, vol. 10(1), pages 1-15, December.
    6. Shilu Zhang & Deborah Chasman & Sara Knaack & Sushmita Roy, 2019. "In silico prediction of high-resolution Hi-C interaction matrices," Nature Communications, Nature, vol. 10(1), pages 1-18, December.
    7. Amanda Khoury & Joanna Achinger-Kawecka & Saul A. Bert & Grady C. Smith & Hugh J. French & Phuc-Loi Luu & Timothy J. Peters & Qian Du & Aled J. Parry & Fatima Valdes-Mora & Phillippa C. Taberlay & Cla, 2020. "Constitutively bound CTCF sites maintain 3D chromatin architecture and long-range epigenetically regulated domains," Nature Communications, Nature, vol. 11(1), pages 1-13, December.
    8. Peter K Koo & Sean R Eddy, 2019. "Representation learning of genomic sequence motifs with convolutional neural networks," PLOS Computational Biology, Public Library of Science, vol. 15(12), pages 1-17, December.
    9. Yun Zhu & Zhao Chen & Kai Zhang & Mengchi Wang & David Medovoy & John W. Whitaker & Bo Ding & Nan Li & Lina Zheng & Wei Wang, 2016. "Constructing 3D interaction maps from 1D epigenomes," Nature Communications, Nature, vol. 7(1), pages 1-11, April.
    10. Yan Zhang & Lin An & Jie Xu & Bo Zhang & W. Jim Zheng & Ming Hu & Jijun Tang & Feng Yue, 2018. "Enhancing Hi-C data resolution with deep convolutional neural network HiCPlus," Nature Communications, Nature, vol. 9(1), pages 1-9, December.
    11. Haitham Ashoor & Xiaowen Chen & Wojciech Rosikiewicz & Jiahui Wang & Albert Cheng & Ping Wang & Yijun Ruan & Sheng Li, 2020. "Graph embedding and unsupervised learning predict genomic sub-compartments from HiC chromatin interaction data," Nature Communications, Nature, vol. 11(1), pages 1-11, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yanlin Zhang & Mathieu Blanchette, 2022. "Reference panel guided topological structure annotation of Hi-C data," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    2. Zhinian Shu & Xiaorong Li, 2024. "The detection method of continuous outliers in complex network data streams based on C-LSTM," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 15(9), pages 4582-4593, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jingxuan Xu & Xiang Xu & Dandan Huang & Yawen Luo & Lin Lin & Xuemei Bai & Yang Zheng & Qian Yang & Yu Cheng & An Huang & Jingyi Shi & Xiaochen Bo & Jin Gu & Hebing Chen, 2024. "A comprehensive benchmarking with interpretation and operational guidance for the hierarchy of topologically associating domains," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
    2. Lina Zheng & Wei Wang, 2022. "Regulation associated modules reflect 3D genome modularity associated with chromatin activity," Nature Communications, Nature, vol. 13(1), pages 1-11, December.
    3. Yanlin Zhang & Mathieu Blanchette, 2022. "Reference panel guided topological structure annotation of Hi-C data," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    4. Fan Feng & Yuan Yao & Xue Qing David Wang & Xiaotian Zhang & Jie Liu, 2022. "Connecting high-resolution 3D chromatin organization with epigenomics," Nature Communications, Nature, vol. 13(1), pages 1-10, December.
    5. Julia Minderjahn & Alexander Fischer & Konstantin Maier & Karina Mendes & Margit Nuetzel & Johanna Raithel & Hanna Stanewsky & Ute Ackermann & Robert Månsson & Claudia Gebhard & Michael Rehli, 2022. "Postmitotic differentiation of human monocytes requires cohesin-structured chromatin," Nature Communications, Nature, vol. 13(1), pages 1-19, December.
    6. Yufan Zhou & Tian Li & Lavanya Choppavarapu & Kun Fang & Shili Lin & Victor X. Jin, 2024. "Integration of scHi-C and scRNA-seq data defines distinct 3D-regulated and biological-context dependent cell subpopulations," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    7. Guang Shi & D. Thirumalai, 2023. "A maximum-entropy model to predict 3D structural ensembles of chromatin from pairwise distances with applications to interphase chromosomes and structural variants," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    8. Hao Wang & Jiaxin Yang & Yu Zhang & Jianliang Qian & Jianrong Wang, 2022. "Reconstruct high-resolution 3D genome structures for diverse cell-types using FLAMINGO," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    9. Surya K Ghosh & Daniel Jost, 2018. "How epigenome drives chromatin folding and dynamics, insights from efficient coarse-grained models of chromosomes," PLOS Computational Biology, Public Library of Science, vol. 14(5), pages 1-26, May.
    10. Mattia Conte & Ehsan Irani & Andrea M. Chiariello & Alex Abraham & Simona Bianco & Andrea Esposito & Mario Nicodemi, 2022. "Loop-extrusion and polymer phase-separation can co-exist at the single-molecule level to shape chromatin folding," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    11. Zhen-Hui Wang & Xin-Feng Wang & Tianyuan Lu & Ming-Rui Li & Peng Jiang & Jing Zhao & Si-Tong Liu & Xue-Qi Fu & Jonathan F. Wendel & Yves Peer & Bao Liu & Lin-Feng Li, 2022. "Reshuffling of the ancestral core-eudicot genome shaped chromatin topology and epigenetic modification in Panax," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    12. Matthias Wielscher & Pooja R. Mandaviya & Brigitte Kuehnel & Roby Joehanes & Rima Mustafa & Oliver Robinson & Yan Zhang & Barbara Bodinier & Esther Walton & Pashupati P. Mishra & Pascal Schlosser & Ro, 2022. "DNA methylation signature of chronic low-grade inflammation and its role in cardio-respiratory diseases," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    13. Li, Xiao & Matsuda, Takeru & Komaki, Fumiyasu, 2024. "Empirical Bayes Poisson matrix completion," Computational Statistics & Data Analysis, Elsevier, vol. 197(C).
    14. Bhuwan Khatri & Kandice L. Tessneer & Astrid Rasmussen & Farhang Aghakhanian & Tove Ragna Reksten & Adam Adler & Ilias Alevizos & Juan-Manuel Anaya & Lara A. Aqrawi & Eva Baecklund & Johan G. Brun & S, 2022. "Genome-wide association study identifies Sjögren’s risk loci with functional implications in immune and glandular cells," Nature Communications, Nature, vol. 13(1), pages 1-17, December.
    15. Chong Wang & Xiang Liu & Jun Liang & Yohei Narita & Weiyue Ding & Difei Li & Luyao Zhang & Hongbo Wang & Merrin Man Long Leong & Isabella Hou & Catherine Gerdt & Chang Jiang & Qian Zhong & Zhonghui Ta, 2023. "A DNA tumor virus globally reprograms host 3D genome architecture to achieve immortal growth," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    16. Liyuan Zhou & Qiongzi Qiu & Qing Zhou & Jianwei Li & Mengqian Yu & Kezhen Li & Lingling Xu & Xiaohui Ke & Haiming Xu & Bingjian Lu & Hui Wang & Weiguo Lu & Pengyuan Liu & Yan Lu, 2022. "Long-read sequencing unveils high-resolution HPV integration and its oncogenic progression in cervical cancer," Nature Communications, Nature, vol. 13(1), pages 1-18, December.
    17. Vinícius G. Contessoto & Olga Dudchenko & Erez Lieberman Aiden & Peter G. Wolynes & José N. Onuchic & Michele Pierro, 2023. "Interphase chromosomes of the Aedes aegypti mosquito are liquid crystalline and can sense mechanical cues," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    18. Yi Li & James Lee & Lu Bai, 2024. "DNA methylation-based high-resolution mapping of long-distance chromosomal interactions in nucleosome-depleted regions," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    19. Hossein Salari & Geneviève Fourel & Daniel Jost, 2024. "Transcription regulates the spatio-temporal dynamics of genes through micro-compartmentalization," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    20. Alon Diament & Tamir Tuller, 2015. "Improving 3D Genome Reconstructions Using Orthologous and Functional Constraints," PLOS Computational Biology, Public Library of Science, vol. 11(5), pages 1-22, May.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-31337-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.