IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v14y2023i1d10.1038_s41467-023-40129-9.html
   My bibliography  Save this article

MSBooster: improving peptide identification rates using deep learning-based features

Author

Listed:
  • Kevin L. Yang

    (University of Michigan)

  • Fengchao Yu

    (University of Michigan)

  • Guo Ci Teo

    (University of Michigan)

  • Kai Li

    (University of Michigan)

  • Vadim Demichev

    (Charité Universitätsmedizin
    University of Cambridge)

  • Markus Ralser

    (Charité Universitätsmedizin
    University of Oxford
    Max Planck Institute for Molecular Genetics)

  • Alexey I. Nesvizhskii

    (University of Michigan
    University of Michigan)

Abstract

Peptide identification in liquid chromatography-tandem mass spectrometry (LC-MS/MS) experiments relies on computational algorithms for matching acquired MS/MS spectra against sequences of candidate peptides using database search tools, such as MSFragger. Here, we present a new tool, MSBooster, for rescoring peptide-to-spectrum matches using additional features incorporating deep learning-based predictions of peptide properties, such as LC retention time, ion mobility, and MS/MS spectra. We demonstrate the utility of MSBooster, in tandem with MSFragger and Percolator, in several different workflows, including nonspecific searches (immunopeptidomics), direct identification of peptides from data independent acquisition data, single-cell proteomics, and data generated on an ion mobility separation-enabled timsTOF MS platform. MSBooster is fast, robust, and fully integrated into the widely used FragPipe computational platform.

Suggested Citation

  • Kevin L. Yang & Fengchao Yu & Guo Ci Teo & Kai Li & Vadim Demichev & Markus Ralser & Alexey I. Nesvizhskii, 2023. "MSBooster: improving peptide identification rates using deep learning-based features," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
  • Handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-40129-9
    DOI: 10.1038/s41467-023-40129-9
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-023-40129-9
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-023-40129-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Florian Meier & Niklas D. Köhler & Andreas-David Brunner & Jean-Marc H. Wanka & Eugenia Voytik & Maximilian T. Strauss & Fabian J. Theis & Matthias Mann, 2021. "Deep learning the collisional cross sections of the peptide universe from a million experimental values," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    2. Wen-Feng Zeng & Xie-Xuan Zhou & Sander Willems & Constantin Ammar & Maria Wahle & Isabell Bludau & Eugenia Voytik & Maximillian T. Strauss & Matthias Mann, 2022. "AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    3. Ruedi Aebersold & Matthias Mann, 2016. "Mass-spectrometric exploration of proteome structure and function," Nature, Nature, vol. 537(7620), pages 347-355, September.
    4. Michal Bassani-Sternberg & Eva Bräunlein & Richard Klar & Thomas Engleitner & Pavel Sinitcyn & Stefan Audehm & Melanie Straub & Julia Weber & Julia Slotta-Huspenina & Katja Specht & Marc E. Martignoni, 2016. "Direct identification of clinically relevant neoepitopes presented on native human melanoma tissue by mass spectrometry," Nature Communications, Nature, vol. 7(1), pages 1-16, December.
    5. Bo Wen & Kai Li & Yun Zhang & Bing Zhang, 2020. "Cancer neoantigen prioritization through sensitive and reliable proteogenomics analysis," Nature Communications, Nature, vol. 11(1), pages 1-14, December.
    6. Mathias Wilhelm & Daniel P. Zolg & Michael Graber & Siegfried Gessulat & Tobias Schmidt & Karsten Schnatbaum & Celina Schwencke-Westphal & Philipp Seifert & Niklas Andrade Krätzig & Johannes Zerweck &, 2021. "Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics," Nature Communications, Nature, vol. 12(1), pages 1-12, December.
    7. Yi Yang & Xiaohui Liu & Chengpin Shen & Yu Lin & Pengyuan Yang & Liang Qiao, 2020. "In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics," Nature Communications, Nature, vol. 11(1), pages 1-11, December.
    8. Mathias Wilhelm & Daniel P. Zolg & Michael Graber & Siegfried Gessulat & Tobias Schmidt & Karsten Schnatbaum & Celina Schwencke-Westphal & Philipp Seifert & Niklas Andrade Krätzig & Johannes Zerweck &, 2021. "Author Correction: Deep learning boosts sensitivity of mass spectrometry-based immunopeptidomics," Nature Communications, Nature, vol. 12(1), pages 1-1, December.
    9. Fengchao Yu & Guo Ci Teo & Andy T. Kong & Klemens Fröhlich & Ginny Xiaohe Li & Vadim Demichev & Alexey I. Nesvizhskii, 2023. "Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Humberto J. Ferreira & Brian J. Stevenson & HuiSong Pak & Fengchao Yu & Jessica Almeida Oliveira & Florian Huber & Marie Taillandier-Coindard & Justine Michaux & Emma Ricart-Altimiras & Anne I. Kraeme, 2024. "Immunopeptidomics-based identification of naturally presented non-canonical circRNA-derived peptides," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    2. Eduardo Vieira de Souza & Angie L. Bookout & Christopher A. Barnes & Brendan Miller & Pablo Machado & Luiz A. Basso & Cristiano V. Bizarro & Alan Saghatelian, 2024. "Rp3: Ribosome profiling-assisted proteogenomics improves coverage and confidence during microprotein discovery," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    3. Yi Yang & Qun Fang, 2024. "Prediction of glycopeptide fragment mass spectra by deep learning," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    4. Hui Peng & He Wang & Weijia Kong & Jinyan Li & Wilson Wen Bin Goh, 2024. "Optimizing differential expression analysis for proteomics data via high-performing rules and ensemble inference," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    5. Charlotte Adams & Wassim Gabriel & Kris Laukens & Mario Picciani & Mathias Wilhelm & Wout Bittremieux & Kurt Boonen, 2024. "Fragment ion intensity prediction improves the identification rate of non-tryptic peptides in timsTOF," Nature Communications, Nature, vol. 15(1), pages 1-11, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yi Yang & Qun Fang, 2024. "Prediction of glycopeptide fragment mass spectra by deep learning," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    2. Wen-Feng Zeng & Xie-Xuan Zhou & Sander Willems & Constantin Ammar & Maria Wahle & Isabell Bludau & Eugenia Voytik & Maximillian T. Strauss & Matthias Mann, 2022. "AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    3. Lei Xin & Rui Qiao & Xin Chen & Hieu Tran & Shengying Pan & Sahar Rabinoviz & Haibo Bian & Xianliang He & Brenton Morse & Baozhen Shan & Ming Li, 2022. "A streamlined platform for analyzing tera-scale DDA and DIA mass spectrometry data enables highly sensitive immunopeptidomics," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    4. Charlotte Adams & Wassim Gabriel & Kris Laukens & Mario Picciani & Mathias Wilhelm & Wout Bittremieux & Kurt Boonen, 2024. "Fragment ion intensity prediction improves the identification rate of non-tryptic peptides in timsTOF," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    5. Henry Webel & Lili Niu & Annelaura Bach Nielsen & Marie Locard-Paulet & Matthias Mann & Lars Juhl Jensen & Simon Rasmussen, 2024. "Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    6. Daniela Klaproth-Andrade & Johannes Hingerl & Yanik Bruns & Nicholas H. Smith & Jakob Träuble & Mathias Wilhelm & Julien Gagneur, 2024. "Deep learning-driven fragment ion series classification enables highly precise and sensitive de novo peptide sequencing," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    7. Weiping Sun & Qianqiu Zhang & Xiyue Zhang & Ngoc Hieu Tran & M. Ziaur Rahman & Zheng Chen & Chao Peng & Jun Ma & Ming Li & Lei Xin & Baozhen Shan, 2023. "Glycopeptide database search and de novo sequencing with PEAKS GlycanFinder enable highly sensitive glycoproteomics," Nature Communications, Nature, vol. 14(1), pages 1-15, December.
    8. Hanqing Liao & Carolina Barra & Zhicheng Zhou & Xu Peng & Isaac Woodhouse & Arun Tailor & Robert Parker & Alexia Carré & Persephone Borrow & Michael J. Hogan & Wayne Paes & Laurence C. Eisenlohr & Rob, 2024. "MARS an improved de novo peptide candidate selection method for non-canonical antigen target discovery in cancer," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    9. Celina Tretter & Niklas Andrade Krätzig & Matteo Pecoraro & Sebastian Lange & Philipp Seifert & Clara Frankenberg & Johannes Untch & Gabriela Zuleger & Mathias Wilhelm & Daniel P. Zolg & Florian S. Dr, 2023. "Proteogenomic analysis reveals RNA as a source for tumor-agnostic neoantigen identification," Nature Communications, Nature, vol. 14(1), pages 1-22, December.
    10. David Gomez-Zepeda & Danielle Arnold-Schild & Julian Beyrle & Arthur Declercq & Ralf Gabriels & Elena Kumm & Annica Preikschat & Mateusz Krzysztof Łącki & Aurélie Hirschler & Jeewan Babu Rijal & Chris, 2024. "Thunder-DDA-PASEF enables high-coverage immunopeptidomics and is boosted by MS2Rescore with MS2PIP timsTOF fragmentation prediction model," Nature Communications, Nature, vol. 15(1), pages 1-18, December.
    11. Samuel Rivero-Hinojosa & Melanie Grant & Aswini Panigrahi & Huizhen Zhang & Veronika Caisova & Catherine M. Bollard & Brian R. Rood, 2021. "Proteogenomic discovery of neoantigens facilitates personalized multi-antigen targeted T cell immunotherapy for brain tumors," Nature Communications, Nature, vol. 12(1), pages 1-15, December.
    12. Naomi Hoenisch Gravel & Annika Nelde & Jens Bauer & Lena Mühlenbruch & Sarah M. Schroeder & Marian C. Neidert & Jonas Scheid & Steffen Lemke & Marissa L. Dubbelaar & Marcel Wacker & Anna Dengler & Rei, 2023. "TOFIMS mass spectrometry-based immunopeptidomics refines tumor antigen identification," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    13. Fengchao Yu & Guo Ci Teo & Andy T. Kong & Klemens Fröhlich & Ginny Xiaohe Li & Vadim Demichev & Alexey I. Nesvizhskii, 2023. "Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    14. Georges Bedran & Daniel A. Polasky & Yi Hsiao & Fengchao Yu & Felipe Veiga Leprevost & Javier A. Alfaro & Marcin Cieslik & Alexey I. Nesvizhskii, 2023. "Unraveling the glycosylated immunopeptidome with HLA-Glyco," Nature Communications, Nature, vol. 14(1), pages 1-12, December.
    15. Sofani Tafesse Gebreyesus & Asad Ali Siyal & Reta Birhanu Kitata & Eric Sheng-Wen Chen & Bayarmaa Enkhbayar & Takashi Angata & Kuo-I Lin & Yu-Ju Chen & Hsiung-Lin Tu, 2022. "Streamlined single-cell proteomics by an integrated microfluidic chip and data-independent acquisition mass spectrometry," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    16. Maximilian T. Strauss & Isabell Bludau & Wen-Feng Zeng & Eugenia Voytik & Constantin Ammar & Julia P. Schessner & Rajesh Ilango & Michelle Gill & Florian Meier & Sander Willems & Matthias Mann, 2024. "AlphaPept: a modern and open framework for MS-based proteomics," Nature Communications, Nature, vol. 15(1), pages 1-16, December.
    17. Melih Yilmaz & William E. Fondrie & Wout Bittremieux & Carlo F. Melendez & Rowan Nelson & Varun Ananth & Sewoong Oh & William Stafford Noble, 2024. "Sequence-to-sequence translation from mass spectra to peptides with a transformer model," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    18. Jennifer G. Abelin & Erik J. Bergstrom & Keith D. Rivera & Hannah B. Taylor & Susan Klaeger & Charles Xu & Eva K. Verzani & C. Jackson White & Hilina B. Woldemichael & Maya Virshup & Meagan E. Olive &, 2023. "Workflow enabling deepscale immunopeptidome, proteome, ubiquitylome, phosphoproteome, and acetylome analyses of sample-limited tissues," Nature Communications, Nature, vol. 14(1), pages 1-22, December.
    19. Hao Hu & Wei Hu & An-Di Guo & Linhui Zhai & Song Ma & Hui-Jun Nie & Bin-Shan Zhou & Tianxian Liu & Xinglong Jia & Xing Liu & Xuebiao Yao & Minjia Tan & Xiao-Hua Chen, 2024. "Spatiotemporal and direct capturing global substrates of lysine-modifying enzymes in living cells," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    20. Jens Bauer & Natalie Köhler & Yacine Maringer & Philip Bucher & Tatjana Bilich & Melissa Zwick & Severin Dicks & Annika Nelde & Marissa Dubbelaar & Jonas Scheid & Marcel Wacker & Jonas S. Heitmann & S, 2022. "The oncogenic fusion protein DNAJB1-PRKACA can be specifically targeted by peptide-based immunotherapy in fibrolamellar hepatocellular carcinoma," Nature Communications, Nature, vol. 13(1), pages 1-16, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:14:y:2023:i:1:d:10.1038_s41467-023-40129-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.