IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v634y2024i8036d10.1038_s41586-024-08070-z.html
   My bibliography  Save this article

Machine-guided design of cell-type-targeting cis-regulatory elements

Author

Listed:
  • Sager J. Gosai

    (Broad Institute of MIT and Harvard
    Harvard Graduate Program in Biological and Biomedical Science
    Harvard University
    Howard Hughes Medical Institute)

  • Rodrigo I. Castro

    (The Jackson Laboratory)

  • Natalia Fuentes

    (The Jackson Laboratory
    Harvard University)

  • John C. Butts

    (The Jackson Laboratory
    University of Maine)

  • Kousuke Mouri

    (The Jackson Laboratory)

  • Michael Alasoadura

    (The Jackson Laboratory)

  • Susan Kales

    (The Jackson Laboratory)

  • Thanh Thanh L. Nguyen

    (Yale School of Medicine)

  • Ramil R. Noche

    (Yale School of Medicine
    Yale School of Medicine)

  • Arya S. Rao

    (Broad Institute of MIT and Harvard
    Harvard Medical School)

  • Mary T. Joy

    (The Jackson Laboratory)

  • Pardis C. Sabeti

    (Broad Institute of MIT and Harvard
    Harvard University
    Howard Hughes Medical Institute
    Harvard T H Chan School of Public Health, Harvard University)

  • Steven K. Reilly

    (Yale School of Medicine
    Yale University)

  • Ryan Tewhey

    (The Jackson Laboratory
    University of Maine
    Tufts University School of Medicine)

Abstract

Cis-regulatory elements (CREs) control gene expression, orchestrating tissue identity, developmental timing and stimulus responses, which collectively define the thousands of unique cell types in the body1–3. While there is great potential for strategically incorporating CREs in therapeutic or biotechnology applications that require tissue specificity, there is no guarantee that an optimal CRE for these intended purposes has arisen naturally. Here we present a platform to engineer and validate synthetic CREs capable of driving gene expression with programmed cell-type specificity. We take advantage of innovations in deep neural network modelling of CRE activity across three cell types, efficient in silico optimization and massively parallel reporter assays to design and empirically test thousands of CREs4–8. Through large-scale in vitro validation, we show that synthetic sequences are more effective at driving cell-type-specific expression in three cell lines compared with natural sequences from the human genome and achieve specificity in analogous tissues when tested in vivo. Synthetic sequences exhibit distinct motif vocabulary associated with activity in the on-target cell type and a simultaneous reduction in the activity of off-target cells. Together, we provide a generalizable framework to prospectively engineer CREs from massively parallel reporter assay models and demonstrate the required literacy to write fit-for-purpose regulatory code.

Suggested Citation

  • Sager J. Gosai & Rodrigo I. Castro & Natalia Fuentes & John C. Butts & Kousuke Mouri & Michael Alasoadura & Susan Kales & Thanh Thanh L. Nguyen & Ramil R. Noche & Arya S. Rao & Mary T. Joy & Pardis C., 2024. "Machine-guided design of cell-type-targeting cis-regulatory elements," Nature, Nature, vol. 634(8036), pages 1211-1220, October.
  • Handle: RePEc:nat:nature:v:634:y:2024:i:8036:d:10.1038_s41586-024-08070-z
    DOI: 10.1038/s41586-024-08070-z
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-024-08070-z
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-024-08070-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:634:y:2024:i:8036:d:10.1038_s41586-024-08070-z. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.