IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v15y2024i1d10.1038_s41467-024-51895-5.html
   My bibliography  Save this article

The simplicity of protein sequence-function relationships

Author

Listed:
  • Yeonwoo Park

    (University of Chicago
    Institute for Basic Science)

  • Brian P. H. Metzger

    (University of Chicago
    Purdue University)

  • Joseph W. Thornton

    (University of Chicago
    University of Chicago)

Abstract

How complex are the rules by which a protein’s sequence determines its function? High-order epistatic interactions among residues are thought to be pervasive, suggesting an idiosyncratic and unpredictable sequence-function relationship. But many prior studies may have overestimated epistasis, because they analyzed sequence-function relationships relative to a single reference sequence—which causes measurement noise and local idiosyncrasies to snowball into high-order epistasis—or they did not fully account for global nonlinearities. Here we present a reference-free method that jointly infers specific epistatic interactions and global nonlinearity using a bird’s-eye view of sequence space. This technique yields the simplest explanation of sequence-function relationships and is more robust than existing methods to measurement noise, missing data, and model misspecification. We reanalyze 20 experimental datasets and find that context-independent amino acid effects and pairwise interactions, along with a simple nonlinearity to account for limited dynamic range, explain a median of 96% of phenotypic variance and over 92% in every case. Only a tiny fraction of genotypes are strongly affected by higher-order epistasis. Sequence-function relationships are also sparse: a miniscule fraction of amino acids and interactions account for 90% of phenotypic variance. Sequence-function causality across these datasets is therefore simple, opening the way for tractable approaches to characterize proteins’ genetic architecture.

Suggested Citation

  • Yeonwoo Park & Brian P. H. Metzger & Joseph W. Thornton, 2024. "The simplicity of protein sequence-function relationships," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-51895-5
    DOI: 10.1038/s41467-024-51895-5
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-51895-5
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-51895-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Adam C. Palmer & Erdal Toprak & Michael Baym & Seungsoo Kim & Adrian Veres & Shimon Bershtein & Roy Kishony, 2015. "Delayed commitment to evolutionary fate in antibiotic resistance fitness landscapes," Nature Communications, Nature, vol. 6(1), pages 1-8, November.
    2. Frank J. Poelwijk & Michael Socolich & Rama Ranganathan, 2019. "Learning the pattern of epistasis linking genotype and phenotype in a protein," Nature Communications, Nature, vol. 10(1), pages 1-11, December.
    3. Dave W. Anderson & Florian Baier & Gloria Yang & Nobuhiko Tokuriki, 2021. "The adaptive landscape of a metallo-enzyme is shaped by environment-dependent epistasis," Nature Communications, Nature, vol. 12(1), pages 1-11, December.
    4. Karol Buda & Charlotte M. Miton & Nobuhiko Tokuriki, 2023. "Pervasive epistasis exposes intramolecular networks in adaptive enzyme evolution," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    5. Júlia Domingo & Guillaume Diss & Ben Lehner, 2018. "Pairwise and higher-order genetic interactions during the evolution of a tRNA," Nature, Nature, vol. 558(7708), pages 117-121, June.
    6. David Ding & Ada Y. Shaw & Sam Sinai & Nathan Rollins & Noam Prywes & David F. Savage & Michael T. Laub & Debora S. Marks, 2024. "Protein design using structure-based residue preferences," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    7. Nicholas C. Wu & Jakub Otwinowski & Andrew J. Thompson & Corwin M. Nycholat & Armita Nourmohammad & Ian A. Wilson, 2020. "Major antigenic site B of human influenza H3N2 viruses has an evolving local fitness landscape," Nature Communications, Nature, vol. 11(1), pages 1-10, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Karol Buda & Charlotte M. Miton & Nobuhiko Tokuriki, 2023. "Pervasive epistasis exposes intramolecular networks in adaptive enzyme evolution," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    2. Andreas Wagner, 2023. "Evolvability-enhancing mutations in the fitness landscapes of an RNA and a protein," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    3. Solip Park & Fran Supek & Ben Lehner, 2021. "Higher order genetic interactions switch cancer genes from two-hit to one-hit drivers," Nature Communications, Nature, vol. 12(1), pages 1-10, December.
    4. Rachapun Rotrattanadumrong & Yohei Yokobayashi, 2022. "Experimental exploration of a ribozyme neutral network using evolutionary algorithm and deep learning," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    5. Brian M. Petersen & Monica B. Kirby & Karson M. Chrispens & Olivia M. Irvin & Isabell K. Strawn & Cyrus M. Haas & Alexis M. Walker & Zachary T. Baumer & Sophia A. Ulmer & Edgardo Ayala & Emily R. Rhod, 2024. "An integrated technology for quantitative wide mutational scanning of human antibody Fab libraries," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    6. Javier Santos-Moreno & Eve Tasiudi & Hadiastri Kusumawardhani & Joerg Stelling & Yolanda Schaerli, 2023. "Robustness and innovation in synthetic genotype networks," Nature Communications, Nature, vol. 14(1), pages 1-17, December.
    7. Jonathan Yaacov Weinstein & Carlos Martí-Gómez & Rosalie Lipsh-Sokolik & Shlomo Yakir Hoch & Demian Liebermann & Reinat Nevo & Haim Weissman & Ekaterina Petrovich-Kopitman & David Margulies & Dmitry I, 2023. "Designed active-site library reveals thousands of functional GFP variants," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    8. Syed Awais W. Shah & Daniel P. Palomar & Ian Barr & Leo L. M. Poon & Ahmed Abdul Quadeer & Matthew R. McKay, 2024. "Seasonal antigenic prediction of influenza A H3N2 using machine learning," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    9. Federica Luppino & Ivan A. Adzhubei & Christopher A. Cassa & Agnes Toth-Petroczy, 2023. "DeMAG predicts the effects of variants in clinically actionable genes by integrating structural and evolutionary epistatic features," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    10. David C. Marciano & Chen Wang & Teng-Kuei Hsu & Thomas Bourquard & Benu Atri & Ralf B. Nehring & Nicholas S. Abel & Elizabeth A. Bowling & Taylor J. Chen & Pamela D. Lurie & Panagiotis Katsonis & Susa, 2022. "Evolutionary action of mutations reveals antimicrobial resistance genes in Escherichia coli," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    11. David Ding & Ada Y. Shaw & Sam Sinai & Nathan Rollins & Noam Prywes & David F. Savage & Michael T. Laub & Debora S. Marks, 2024. "Protein design using structure-based residue preferences," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    12. Takahiro Nemoto & Tommaso Ocari & Arthur Planul & Muge Tekinsoy & Emilia A. Zin & Deniz Dalkara & Ulisse Ferrari, 2023. "ACIDES: on-line monitoring of forward genetic screens for protein engineering," Nature Communications, Nature, vol. 14(1), pages 1-11, December.
    13. Christos S. Karamitros & Kyle Murray & Yoichi Kumada & Kenneth A. Johnson & Sheena D’Arcy & George Georgiou, 2024. "Mechanistic conformational and substrate selectivity profiles emerging in the evolution of enzymes via parallel trajectories," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    14. Remkes A. Scheele & Laurens H. Lindenburg & Maya Petek & Markus Schober & Kevin N. Dalby & Florian Hollfelder, 2022. "Droplet-based screening of phosphate transfer catalysis reveals how epistasis shapes MAP kinase interactions with substrates," Nature Communications, Nature, vol. 13(1), pages 1-14, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:15:y:2024:i:1:d:10.1038_s41467-024-51895-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.