IDEAS home Printed from https://ideas.repec.org/a/plo/pcbi00/1005786.html
   My bibliography  Save this article

Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization

Author

Listed:
  • Claire N Bedbrook
  • Kevin K Yang
  • Austin J Rice
  • Viviana Gradinaru
  • Frances H Arnold

Abstract

There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well.Author summary: A protein’s amino acid sequence determines how it will fold, traffic to subcellular locations, and carry out specific functions within the cell. Understanding this process would enable the design of protein sequences capable of useful functions; unfortunately, we cannot predict in detail how sequence encodes function. However, machine-learning models have the potential to infer the complex protein sequence-function relationship by identifying patterns or features that are important for function from sequences with known functions. We used machine learning to learn about and design membrane proteins (MPs). To function, a MP must be expressed, correctly folded in a lipid membrane, and trafficked to the proper cellular location. We built predictive, machine-learning models for this complex process from a set of >200 chimeric MPs and used them to design new sequences with optimal performance on the challenging task of membrane localization. This general approach to understanding and designing MPs could be broadly useful for important pharmaceutical and engineering MP targets.

Suggested Citation

  • Claire N Bedbrook & Kevin K Yang & Austin J Rice & Viviana Gradinaru & Frances H Arnold, 2017. "Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization," PLOS Computational Biology, Public Library of Science, vol. 13(10), pages 1-21, October.
  • Handle: RePEc:plo:pcbi00:1005786
    DOI: 10.1371/journal.pcbi.1005786
    as

    Download full text from publisher

    File URL: https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005786
    Download Restriction: no

    File URL: https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1005786&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pcbi.1005786?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Hideaki E. Kato & Feng Zhang & Ofer Yizhar & Charu Ramakrishnan & Tomohiro Nishizawa & Kunio Hirata & Jumpei Ito & Yusuke Aita & Tomoya Tsukazaki & Shigehiko Hayashi & Peter Hegemann & Andrés D. Matur, 2012. "Crystal structure of the channelrhodopsin light-gated cation channel," Nature, Nature, vol. 482(7385), pages 369-374, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ribeiro, Barbara & Shapira, Philip, 2019. "Anticipating governance challenges in synthetic biology: Insights from biosynthetic menthol," Technological Forecasting and Social Change, Elsevier, vol. 139(C), pages 311-320.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yuanyue Shan & Liping Zhao & Meiyu Chen & Xiao Li & Mingfeng Zhang & Duanqing Pei, 2024. "Channelrhodopsins with distinct chromophores and binding patterns," Nature Communications, Nature, vol. 15(1), pages 1-10, December.
    2. Kyle Tucker & Savitha Sridharan & Hillel Adesnik & Stephen G. Brohawn, 2022. "Cryo-EM structures of the channelrhodopsin ChRmine in lipid nanodiscs," Nature Communications, Nature, vol. 13(1), pages 1-12, December.
    3. Takefumi Morizumi & Kyumhyuk Kim & Hai Li & Elena G. Govorunova & Oleg A. Sineshchekov & Yumei Wang & Lei Zheng & Éva Bertalan & Ana-Nicoleta Bondar & Azam Askari & Leonid S. Brown & John L. Spudich &, 2023. "Structures of channelrhodopsin paralogs in peptidiscs explain their contrasting K+ and Na+ selectivities," Nature Communications, Nature, vol. 14(1), pages 1-13, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1005786. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.