IDEAS home Printed from https://ideas.repec.org/a/nat/nature/v570y2019i7761d10.1038_s41586-019-1274-4.html
   My bibliography  Save this article

De novo protein design by citizen scientists

Author

Listed:
  • Brian Koepnick

    (University of Washington
    University of Washington)

  • Jeff Flatten

    (University of Washington)

  • Tamir Husain

    (University of Washington)

  • Alex Ford

    (University of Washington
    University of Washington)

  • Daniel-Adriano Silva

    (University of Washington
    University of Washington)

  • Matthew J. Bick

    (University of Washington
    University of Washington)

  • Aaron Bauer

    (University of Washington)

  • Gaohua Liu

    (Rutgers University The State University of New Jersey
    Nexomics Biosciences)

  • Yojiro Ishida

    (Rutgers The State University of New Jersey)

  • Alexander Boykov
  • Roger D. Estep
  • Susan Kleinfelter
  • Toke Nørgård-Solano
  • Linda Wei
  • Foldit Players
  • Gaetano T. Montelione

    (Rutgers University The State University of New Jersey
    Rutgers The State University of New Jersey)

  • Frank DiMaio

    (University of Washington
    University of Washington)

  • Zoran Popović

    (University of Washington)

  • Firas Khatib

    (University of Massachusetts Dartmouth)

  • Seth Cooper

    (Northeastern University)

  • David Baker

    (University of Washington
    University of Washington
    University of Washington)

Abstract

Online citizen science projects such as GalaxyZoo1, Eyewire2 and Phylo3 have proven very successful for data collection, annotation and processing, but for the most part have harnessed human pattern-recognition skills rather than human creativity. An exception is the game EteRNA4, in which game players learn to build new RNA structures by exploring the discrete two-dimensional space of Watson–Crick base pairing possibilities. Building new proteins, however, is a more challenging task to present in a game, as both the representation and evaluation of a protein structure are intrinsically three-dimensional. We posed the challenge of de novo protein design in the online protein-folding game Foldit5. Players were presented with a fully extended peptide chain and challenged to craft a folded protein structure and an amino acid sequence encoding that structure. After many iterations of player design, analysis of the top-scoring solutions and subsequent game improvement, Foldit players can now—starting from an extended polypeptide chain—generate a diversity of protein structures and sequences that encode them in silico. One hundred forty-six Foldit player designs with sequences unrelated to naturally occurring proteins were encoded in synthetic genes; 56 were found to be expressed and soluble in Escherichia coli, and to adopt stable monomeric folded structures in solution. The diversity of these structures is unprecedented in de novo protein design, representing 20 different folds—including a new fold not observed in natural proteins. High-resolution structures were determined for four of the designs, and are nearly identical to the player models. This work makes explicit the considerable implicit knowledge that contributes to success in de novo protein design, and shows that citizen scientists can discover creative new solutions to outstanding scientific challenges such as the protein design problem.

Suggested Citation

  • Brian Koepnick & Jeff Flatten & Tamir Husain & Alex Ford & Daniel-Adriano Silva & Matthew J. Bick & Aaron Bauer & Gaohua Liu & Yojiro Ishida & Alexander Boykov & Roger D. Estep & Susan Kleinfelter & T, 2019. "De novo protein design by citizen scientists," Nature, Nature, vol. 570(7761), pages 390-394, June.
  • Handle: RePEc:nat:nature:v:570:y:2019:i:7761:d:10.1038_s41586-019-1274-4
    DOI: 10.1038/s41586-019-1274-4
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41586-019-1274-4
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1038/s41586-019-1274-4?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Fatima A. Davila-Hernandez & Biao Jin & Harley Pyles & Shuai Zhang & Zheming Wang & Timothy F. Huddy & Asim K. Bera & Alex Kang & Chun-Long Chen & James J. Yoreo & David Baker, 2023. "Directing polymorph specific calcium carbonate formation with de novo protein templates," Nature Communications, Nature, vol. 14(1), pages 1-11, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:nature:v:570:y:2019:i:7761:d:10.1038_s41586-019-1274-4. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.