IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0007002.html
   My bibliography  Save this article

Design Parameters to Control Synthetic Gene Expression in Escherichia coli

Author

Listed:
  • Mark Welch
  • Sridhar Govindarajan
  • Jon E Ness
  • Alan Villalobos
  • Austin Gurney
  • Jeremy Minshull
  • Claes Gustafsson

Abstract

Background: Production of proteins as therapeutic agents, research reagents and molecular tools frequently depends on expression in heterologous hosts. Synthetic genes are increasingly used for protein production because sequence information is easier to obtain than the corresponding physical DNA. Protein-coding sequences are commonly re-designed to enhance expression, but there are no experimentally supported design principles. Principal Findings: To identify sequence features that affect protein expression we synthesized and expressed in E. coli two sets of 40 genes encoding two commercially valuable proteins, a DNA polymerase and a single chain antibody. Genes differing only in synonymous codon usage expressed protein at levels ranging from undetectable to 30% of cellular protein. Using partial least squares regression we tested the correlation of protein production levels with parameters that have been reported to affect expression. We found that the amount of protein produced in E. coli was strongly dependent on the codons used to encode a subset of amino acids. Favorable codons were predominantly those read by tRNAs that are most highly charged during amino acid starvation, not codons that are most abundant in highly expressed E. coli proteins. Finally we confirmed the validity of our models by designing, synthesizing and testing new genes using codon biases predicted to perform well. Conclusion: The systematic analysis of gene design parameters shown in this study has allowed us to identify codon usage within a gene as a critical determinant of achievable protein expression levels in E. coli. We propose a biochemical basis for this, as well as design algorithms to ensure high protein production from synthetic genes. Replication of this methodology should allow similar design algorithms to be empirically derived for any expression system.

Suggested Citation

  • Mark Welch & Sridhar Govindarajan & Jon E Ness & Alan Villalobos & Austin Gurney & Jeremy Minshull & Claes Gustafsson, 2009. "Design Parameters to Control Synthetic Gene Expression in Escherichia coli," PLOS ONE, Public Library of Science, vol. 4(9), pages 1-10, September.
  • Handle: RePEc:plo:pone00:0007002
    DOI: 10.1371/journal.pone.0007002
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0007002
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0007002&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0007002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alexey A Gritsenko & Marc Hulsman & Marcel J T Reinders & Dick de Ridder, 2015. "Unbiased Quantitative Models of Protein Translation Derived from Ribosome Profiling Data," PLOS Computational Biology, Public Library of Science, vol. 11(8), pages 1-26, August.
    2. Armando Fernandes & Susana Vinga, 2016. "Improving Protein Expression Prediction Using Extra Features and Ensemble Averaging," PLOS ONE, Public Library of Science, vol. 11(3), pages 1-15, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0007002. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.