Author
Listed:
- Jonathan C. Chen
(Broad Institute of Harvard and MIT
Harvard University
Harvard University)
- Jonathan P. Chen
(Uber Technologies, Inc.
Meta Platforms)
- Max W. Shen
(Broad Institute of Harvard and MIT
Harvard University
Harvard University
Massachusetts Institute of Technology)
- Michael Wornow
(Broad Institute of Harvard and MIT
Harvard University)
- Minwoo Bae
(Broad Institute of Harvard and MIT
Harvard University)
- Wei-Hsi Yeh
(Broad Institute of Harvard and MIT
Harvard University
Harvard University
Harvard Medical School)
- Alvin Hsu
(Broad Institute of Harvard and MIT
Harvard University
Harvard University)
- David R. Liu
(Broad Institute of Harvard and MIT
Harvard University
Harvard University)
Abstract
In vitro selection queries large combinatorial libraries for sequence-defined polymers with target binding and reaction catalysis activity. While the total sequence space of these libraries can extend beyond 1022 sequences, practical considerations limit starting sequences to ≤~1015 distinct molecules. Selection-induced sequence convergence and limited sequencing depth further constrain experimentally observable sequence space. To address these limitations, we integrate experimental and machine learning approaches to explore regions of sequence space unrelated to experimentally derived variants. We perform in vitro selections to discover highly side-chain-functionalized nucleic acid polymers (HFNAPs) with potent affinities for a target small molecule (daunomycin KD = 5–65 nM). We then use the selection data to train a conditional variational autoencoder (CVAE) machine learning model to generate diverse and unique HFNAP sequences with high daunomycin affinities (KD = 9–26 nM), even though they are unrelated in sequence to experimental polymers. Coupling in vitro selection with a machine learning model thus enables direct generation of active variants, demonstrating a new approach to the discovery of functional biopolymers.
Suggested Citation
Jonathan C. Chen & Jonathan P. Chen & Max W. Shen & Michael Wornow & Minwoo Bae & Wei-Hsi Yeh & Alvin Hsu & David R. Liu, 2022.
"Generating experimentally unrelated target molecule-binding highly functionalized nucleic-acid polymers using machine learning,"
Nature Communications, Nature, vol. 13(1), pages 1-17, December.
Handle:
RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-31955-4
DOI: 10.1038/s41467-022-31955-4
Download full text from publisher
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:13:y:2022:i:1:d:10.1038_s41467-022-31955-4. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.