IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-54441-5.html
   My bibliography  Save this article

Predicting cell type-specific epigenomic profiles accounting for distal genetic effects

Author

Listed:
  • Alan E. Murphy

    (UK Dementia Research Institute at Imperial College London
    Imperial College London)

  • William Beardall

    (Imperial College London)

  • Marek Rei

    (Imperial College London)

  • Mike Phuycharoen

    (University of Manchester)

  • Nathan G. Skene

    (UK Dementia Research Institute at Imperial College London
    Imperial College London)

Abstract

Understanding how genetic variants affect the epigenome is key to interpreting GWAS, yet profiling these effects across the non-coding genome remains challenging due to experimental scalability. This necessitates accurate computational models. Existing machine learning approaches, while progressively improving, are confined to the cell types they were trained on, limiting their applicability. Here, we introduce Enformer Celltyping, a deep learning model which incorporates distal effects of DNA interactions, up to 100,000 base-pairs away, to predict epigenetic signals in previously unseen cell types. Using DNA and chromatin accessibility data for epigenetic imputation, Enformer Celltyping outperforms current best-in-class approaches and generalises across cell types and biological regions. Moreover, we propose a framework for evaluating models on genetic variant effect prediction using regulatory quantitative trait loci mapping studies, highlighting current limitations in genomic deep learning models. Despite this, Enformer Celltyping can also be used to study cell type-specific genetic enrichment of complex traits.

Suggested Citation

  • Alan E. Murphy & William Beardall & Marek Rei & Mike Phuycharoen & Nathan G. Skene, 2024. "Predicting cell type-specific epigenomic profiles accounting for distal genetic effects," Nature Communications, Nature, vol. 15(1), pages 1-19, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-54441-5
    DOI: 10.1038/s41467-024-54441-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-54441-5
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-54441-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Anshul Kundaje & Wouter Meuleman & Jason Ernst & Misha Bilenky & Angela Yen & Alireza Heravi-Moussavi & Pouya Kheradpour & Zhizhuo Zhang & Jianrong Wang & Michael J. Ziller & Viren Amin & John W. Whit, 2015. "Integrative analysis of 111 reference human epigenomes," Nature, Nature, vol. 518(7539), pages 317-330, February.
    2. David R Kelley, 2020. "Cross-species regulatory sequence activity prediction," PLOS Computational Biology, Public Library of Science, vol. 16(7), pages 1-27, July.
    3. Timothy J. Durham & Maxwell W. Libbrecht & J. Jeffry Howbert & Jeff Bilmes & William Stafford Noble, 2018. "PREDICTD PaRallel Epigenomics Data Imputation with Cloud-based Tensor Decomposition," Nature Communications, Nature, vol. 9(1), pages 1-15, December.
    4. Carles A. Boix & Benjamin T. James & Yongjin P. Park & Wouter Meuleman & Manolis Kellis, 2021. "Regulatory genomic circuitry of human disease loci by integrative epigenomics," Nature, Nature, vol. 590(7845), pages 300-307, February.
    5. Vassily Trubetskoy & Antonio F. Pardiñas & Ting Qi & Georgia Panagiotaropoulou & Swapnil Awasthi & Tim B. Bigdeli & Julien Bryois & Chia-Yen Chen & Charlotte A. Dennison & Lynsey S. Hall & Max Lam & K, 2022. "Mapping genomic loci implicates genes and synaptic biology in schizophrenia," Nature, Nature, vol. 604(7906), pages 502-508, April.
    6. Jill E. Moore & Michael J. Purcaro & Henry E. Pratt & Charles B. Epstein & Noam Shoresh & Jessika Adrian & Trupti Kawli & Carrie A. Davis & Alexander Dobin & Rajinder Kaul & Jessica Halow & Eric L. No, 2020. "Expanded encyclopaedias of DNA elements in the human and mouse genomes," Nature, Nature, vol. 583(7818), pages 699-710, July.
    7. Yeda Wu & Graham K. Murray & Enda M. Byrne & Julia Sidorenko & Peter M. Visscher & Naomi R. Wray, 2021. "GWAS of peptic ulcer disease implicates Helicobacter pylori infection, other gastrointestinal disorders and depression," Nature Communications, Nature, vol. 12(1), pages 1-17, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Samuel S. Kim & Buu Truong & Karthik Jagadeesh & Kushal K. Dey & Amber Z. Shen & Soumya Raychaudhuri & Manolis Kellis & Alkes L. Price, 2024. "Leveraging single-cell ATAC-seq and RNA-seq to identify disease-critical fetal and adult brain cell types," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    2. Wendiao Zhang & Ming Zhang & Zhenhong Xu & Hongye Yan & Huimin Wang & Jiamei Jiang & Juan Wan & Beisha Tang & Chunyu Liu & Chao Chen & Qingtuan Meng, 2023. "Human forebrain organoid-based multi-omics analyses of PCCB as a schizophrenia associated gene linked to GABAergic pathways," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    3. Pingting Ying & Can Chen & Zequn Lu & Shuoni Chen & Ming Zhang & Yimin Cai & Fuwei Zhang & Jinyu Huang & Linyun Fan & Caibo Ning & Yanmin Li & Wenzhuo Wang & Hui Geng & Yizhuo Liu & Wen Tian & Zhiyong, 2023. "Genome-wide enhancer-gene regulatory maps link causal variants to target genes underlying human cancer risk," Nature Communications, Nature, vol. 14(1), pages 1-20, December.
    4. Alex Hawkins-Hooker & Giovanni Visonà & Tanmayee Narendra & Mateo Rojas-Carulla & Bernhard Schölkopf & Gabriele Schweikert, 2023. "Getting personal with epigenetics: towards individual-specific epigenomic imputation with machine learning," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    5. Milton Pividori & Sumei Lu & Binglan Li & Chun Su & Matthew E. Johnson & Wei-Qi Wei & Qiping Feng & Bahram Namjou & Krzysztof Kiryluk & Iftikhar J. Kullo & Yuan Luo & Blair D. Sullivan & Benjamin F. V, 2023. "Projecting genetic associations through gene expression patterns highlights disease etiology and drug mechanisms," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    6. Kosmidis, Kosmas & Hütt, Marc-Thorsten, 2023. "DNA visibility graphs," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 626(C).
    7. Andrew D. Grotzinger & Travis T. Mallard & Zhaowen Liu & Jakob Seidlitz & Tian Ge & Jordan W. Smoller, 2023. "Multivariate genomic architecture of cortical thickness and surface area at multiple levels of analysis," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    8. Andrew D. Grotzinger & Javier de la Fuente & Gail Davies & Michel G. Nivard & Elliot M. Tucker-Drob, 2022. "Transcriptome-wide and stratified genomic structural equation modeling identify neurobiological pathways shared across diverse cognitive traits," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    9. Alan Selewa & Kaixuan Luo & Michael Wasney & Linsin Smith & Xiaotong Sun & Chenwei Tang & Heather Eckart & Ivan P. Moskowitz & Anindita Basu & Xin He & Sebastian Pott, 2023. "Single-cell genomics improves the discovery of risk variants and genes of atrial fibrillation," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    10. Yirong Shi & Yiwei Niu & Peng Zhang & Huaxia Luo & Shuai Liu & Sijia Zhang & Jiajia Wang & Yanyan Li & Xinyue Liu & Tingrui Song & Tao Xu & Shunmin He, 2023. "Characterization of genome-wide STR variation in 6487 human genomes," Nature Communications, Nature, vol. 14(1), pages 1-18, December.
    11. Caitlin E. Carey & Rebecca Shafee & Robbee Wedow & Amanda Elliott & Duncan S. Palmer & John Compitello & Masahiro Kanai & Liam Abbott & Patrick Schultz & Konrad J. Karczewski & Samuel C. Bryant & Caro, 2024. "Principled distillation of UK Biobank phenotype data reveals underlying structure in human variation," Nature Human Behaviour, Nature, vol. 8(8), pages 1599-1615, August.
    12. Clarice K. Y. Hong & Yawei Wu & Alyssa A. Erickson & Jie Li & Arnold J. Federico & Barak A. Cohen, 2024. "Massively parallel characterization of insulator activity across the genome," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    13. Shahram Bahrami & Kaja Nordengen & Jaroslav Rokicki & Alexey A. Shadrin & Zillur Rahman & Olav B. Smeland & Piotr P. Jaholkowski & Nadine Parker & Pravesh Parekh & Kevin S. O’Connell & Torbjørn Elvsås, 2024. "The genetic landscape of basal ganglia and implications for common brain disorders," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    14. Matthias Wielscher & Pooja R. Mandaviya & Brigitte Kuehnel & Roby Joehanes & Rima Mustafa & Oliver Robinson & Yan Zhang & Barbara Bodinier & Esther Walton & Pashupati P. Mishra & Pascal Schlosser & Ro, 2022. "DNA methylation signature of chronic low-grade inflammation and its role in cardio-respiratory diseases," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    15. Matthew Tegtmeyer & Jatin Arora & Samira Asgari & Beth A. Cimini & Ajay Nadig & Emily Peirent & Dhara Liyanage & Gregory P. Way & Erin Weisbart & Aparna Nathan & Tiffany Amariuta & Kevin Eggan & Marzi, 2024. "High-dimensional phenotyping to define the genetic basis of cellular morphology," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    16. Xuelong Yao & Zongyang Lu & Zhanying Feng & Lei Gao & Xin Zhou & Min Li & Suijuan Zhong & Qian Wu & Zhenbo Liu & Haofeng Zhang & Zeyuan Liu & Lizhi Yi & Tao Zhou & Xudong Zhao & Jun Zhang & Yong Wang , 2022. "Comparison of chromatin accessibility landscapes during early development of prefrontal cortex between rhesus macaque and human," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    17. Yash Patel & Jean Shin & Eeva Sliz & Ariana Tang & Aniket Mishra & Rui Xia & Edith Hofer & Hema Sekhar Reddy Rajula & Ruiqi Wang & Frauke Beyer & Katrin Horn & Max Riedl & Jing Yu & Henry Völzke & Rob, 2024. "Genetic risk factors underlying white matter hyperintensities and cortical atrophy," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    18. Gabriele Marcassa & Dan Dascenco & Blanca Lorente-Echeverría & Danie Daaboul & Jeroen Vandensteen & Elke Leysen & Lucas Baltussen & Andrew J. M. Howden & Joris Wit, 2025. "Synaptic signatures and disease vulnerabilities of layer 5 pyramidal neurons," Nature Communications, Nature, vol. 16(1), pages 1-16, December.
    19. Jordi Manuello & Joosung Min & Paul McCarthy & Fidel Alfaro-Almagro & Soojin Lee & Stephen Smith & Lloyd T. Elliott & Anderson M. Winkler & Gwenaëlle Douaud, 2024. "The effects of genetic and modifiable risk factors on brain regions vulnerable to ageing and disease," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    20. Ruoyu Tian & Tian Ge & Hyeokmoon Kweon & Daniel B. Rocha & Max Lam & Jimmy Z. Liu & Kritika Singh & Daniel F. Levey & Joel Gelernter & Murray B. Stein & Ellen A. Tsai & Hailiang Huang & Christopher F., 2024. "Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression," Nature Communications, Nature, vol. 15(1), pages 1-12, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-54441-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.