IDEAS home Printed from https://ideas.repec.org/a/nat/natcom/v16y2025i1d10.1038_s41467-024-54871-1.html
   My bibliography  Save this article

iDIA-QC: AI-empowered data-independent acquisition mass spectrometry-based quality control

Author

Listed:
  • Huanhuan Gao

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Yi Zhu

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Dongxue Wang

    (Beijing Institute of Lifeomics
    International Academy of Phronesis Medicine)

  • Zongxiang Nie

    (Ltd.)

  • He Wang

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Guibin Wang

    (Beijing Institute of Lifeomics)

  • Shuang Liang

    (Zhejiang Academy of Agricultural Sciences)

  • Yuting Xie

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Yingying Sun

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Wenhao Jiang

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Zhen Dong

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Liqin Qian

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

  • Xufei Wang

    (Guangzhou Medical University)

  • Mengdi Liang

    (Guangzhou Medical University)

  • Min Chen

    (Ltd)

  • Houqi Fang

    (Ltd)

  • Qiufang Zeng

    (Ltd)

  • Jiao Tian

    (Ltd)

  • Zeyu Sun

    (Zhejiang University)

  • Juan Xue

    (Hubei University of Medicine
    Huazhong Agricultural University)

  • Shan Li

    (Hubei University of Medicine
    Huazhong Agricultural University)

  • Chen Chen

    (SCIEX)

  • Xiang Liu

    (SCIEX)

  • Xiaolei Lyu

    (SCIEX)

  • Zhenchang Guo

    (Thermo Fisher Scientific)

  • Yingzi Qi

    (Thermo Fisher Scientific)

  • Ruoyu Wu

    (Bruker Daltonics)

  • Xiaoxian Du

    (Bruker Daltonics)

  • Tingde Tong

    (Thermo Fisher Scientific)

  • Fengchun Kong

    (SCIEX)

  • Liming Han

    (Bruker Daltonics)

  • Minghui Wang

    (Bruker Daltonics)

  • Yang Zhao

    (National Institute of Metrology)

  • Xinhua Dai

    (National Institute of Metrology)

  • Fuchu He

    (Beijing Institute of Lifeomics
    International Academy of Phronesis Medicine)

  • Tiannan Guo

    (Westlake University
    Westlake Laboratory of Life Sciences and Biomedicine
    Westlake University)

Abstract

Quality control (QC) in mass spectrometry (MS)-based proteomics is mainly based on data-dependent acquisition (DDA) analysis of standard samples. Here, we collect 2754 files acquired by data independent acquisition (DIA) and paired 2638 DDA files from mouse liver digests using 21 mass spectrometers across nine laboratories over 31 months. Our data demonstrate that DIA-based LC-MS/MS-related consensus QC metrics exhibit higher sensitivity compared to DDA-based QC metrics in detecting changes in LC-MS status. We then prioritize 15 metrics and invite 21 experts to manually assess the quality of 2754 DIA files based on those metrics. We develop an AI model for DIA-based QC using 2110 training files. It achieves AUCs of 0.91 (LC) and 0.97 (MS) in the first validation dataset (n = 528), and 0.78 (LC) and 0.94 (MS) in an independent validation dataset (n = 116). Finally, we develop an offline software called iDIA-QC for convenient adoption of this methodology.

Suggested Citation

  • Huanhuan Gao & Yi Zhu & Dongxue Wang & Zongxiang Nie & He Wang & Guibin Wang & Shuang Liang & Yuting Xie & Yingying Sun & Wenhao Jiang & Zhen Dong & Liqin Qian & Xufei Wang & Mengdi Liang & Min Chen &, 2025. "iDIA-QC: AI-empowered data-independent acquisition mass spectrometry-based quality control," Nature Communications, Nature, vol. 16(1), pages 1-13, December.
  • Handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-54871-1
    DOI: 10.1038/s41467-024-54871-1
    as

    Download full text from publisher

    File URL: https://www.nature.com/articles/s41467-024-54871-1
    File Function: Abstract
    Download Restriction: no

    File URL: https://libkey.io/10.1038/s41467-024-54871-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Ruedi Aebersold & Matthias Mann, 2016. "Mass-spectrometric exploration of proteome structure and function," Nature, Nature, vol. 537(7620), pages 347-355, September.
    2. Vadim Demichev & Lukasz Szyrwiel & Fengchao Yu & Guo Ci Teo & George Rosenberger & Agathe Niewienda & Daniela Ludwig & Jens Decker & Stephanie Kaspar-Schoenefeld & Kathryn S. Lilley & Michael Mülleder, 2022. "dia-PASEF data analysis using FragPipe and DIA-NN for deep proteomics of low sample amounts," Nature Communications, Nature, vol. 13(1), pages 1-8, December.
    3. Yue Xuan & Nicholas W. Bateman & Sebastien Gallien & Sandra Goetze & Yue Zhou & Pedro Navarro & Mo Hu & Niyati Parikh & Brian L. Hood & Kelly A. Conrads & Christina Loosse & Reta Birhanu Kitata & Sand, 2020. "Standardization and harmonization of distributed multi-center proteotype analysis supporting precision medicine studies," Nature Communications, Nature, vol. 11(1), pages 1-12, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yi Yang & Qun Fang, 2024. "Prediction of glycopeptide fragment mass spectra by deep learning," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    2. Kai Li & Guo Ci Teo & Kevin L. Yang & Fengchao Yu & Alexey I. Nesvizhskii, 2025. "diaTracer enables spectrum-centric analysis of diaPASEF proteomics data," Nature Communications, Nature, vol. 16(1), pages 1-14, December.
    3. Valdemaras Petrosius & Pedro Aragon-Fernandez & Nil Üresin & Gergo Kovacs & Teeradon Phlairaharn & Benjamin Furtwängler & Jeff Op De Beeck & Sarah L. Skovbakke & Steffen Goletz & Simon Francis Thomsen, 2023. "Exploration of cell state heterogeneity using single-cell proteomics through sensitivity-tailored data-independent acquisition," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    4. Fengchao Yu & Guo Ci Teo & Andy T. Kong & Klemens Fröhlich & Ginny Xiaohe Li & Vadim Demichev & Alexey I. Nesvizhskii, 2023. "Analysis of DIA proteomics data using MSFragger-DIA and FragPipe computational platform," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    5. Henry Webel & Lili Niu & Annelaura Bach Nielsen & Marie Locard-Paulet & Matthias Mann & Lars Juhl Jensen & Simon Rasmussen, 2024. "Imputation of label-free quantitative mass spectrometry-based proteomics data using self-supervised deep learning," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    6. Guilherme Reis-de-Oliveira & Victor Corasolla Carregari & Gabriel Rodrigues dos Reis de Sousa & Daniel Martins-de-Souza, 2024. "OmicScope unravels systems-level insights from quantitative proteomics data," Nature Communications, Nature, vol. 15(1), pages 1-12, December.
    7. Lei Xin & Rui Qiao & Xin Chen & Hieu Tran & Shengying Pan & Sahar Rabinoviz & Haibo Bian & Xianliang He & Brenton Morse & Baozhen Shan & Ming Li, 2022. "A streamlined platform for analyzing tera-scale DDA and DIA mass spectrometry data enables highly sensitive immunopeptidomics," Nature Communications, Nature, vol. 13(1), pages 1-9, December.
    8. Sofani Tafesse Gebreyesus & Asad Ali Siyal & Reta Birhanu Kitata & Eric Sheng-Wen Chen & Bayarmaa Enkhbayar & Takashi Angata & Kuo-I Lin & Yu-Ju Chen & Hsiung-Lin Tu, 2022. "Streamlined single-cell proteomics by an integrated microfluidic chip and data-independent acquisition mass spectrometry," Nature Communications, Nature, vol. 13(1), pages 1-13, December.
    9. Melih Yilmaz & William E. Fondrie & Wout Bittremieux & Carlo F. Melendez & Rowan Nelson & Varun Ananth & Sewoong Oh & William Stafford Noble, 2024. "Sequence-to-sequence translation from mass spectra to peptides with a transformer model," Nature Communications, Nature, vol. 15(1), pages 1-13, December.
    10. Hao Hu & Wei Hu & An-Di Guo & Linhui Zhai & Song Ma & Hui-Jun Nie & Bin-Shan Zhou & Tianxian Liu & Xinglong Jia & Xing Liu & Xuebiao Yao & Minjia Tan & Xiao-Hua Chen, 2024. "Spatiotemporal and direct capturing global substrates of lysine-modifying enzymes in living cells," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    11. Yuwan Chen & Wen Zhou & Yufei Xia & Weijie Zhang & Qun Zhao & Xinwei Li & Hang Gao & Zhen Liang & Guanghui Ma & Kaiguang Yang & Lihua Zhang & Yukui Zhang, 2023. "Targeted cross-linker delivery for the in situ mapping of protein conformations and interactions in mitochondria," Nature Communications, Nature, vol. 14(1), pages 1-16, December.
    12. Yulin Sun & Zhengguang Guo & Xiaoyan Liu & Lijun Yang & Zongpan Jing & Meng Cai & Zhaoxu Zheng & Chen Shao & Yefan Zhang & Haidan Sun & Li Wang & Minjie Wang & Jun Li & Lusong Tian & Yue Han & Shuangm, 2022. "Noninvasive urinary protein signatures associated with colorectal cancer diagnosis and metastasis," Nature Communications, Nature, vol. 13(1), pages 1-15, December.
    13. Karsten Suhre & Guhan Ram Venkataraman & Harendra Guturu & Anna Halama & Nisha Stephan & Gaurav Thareja & Hina Sarwath & Khatereh Motamedchaboki & Margaret K. R. Donovan & Asim Siddiqui & Serafim Batz, 2024. "Nanoparticle enrichment mass-spectrometry proteomics identifies protein-altering variants for precise pQTL mapping," Nature Communications, Nature, vol. 15(1), pages 1-11, December.
    14. Martin Mehnert & Rodolfo Ciuffa & Fabian Frommelt & Federico Uliana & Audrey Drogen & Kilian Ruminski & Matthias Gstaiger & Ruedi Aebersold, 2020. "Multi-layered proteomic analyses decode compositional and functional effects of cancer mutations on kinase complexes," Nature Communications, Nature, vol. 11(1), pages 1-18, December.
    15. Zhen Dong & Wenhao Jiang & Chunlong Wu & Ting Chen & Jiayi Chen & Xuan Ding & Shu Zheng & Kiryl D. Piatkevich & Yi Zhu & Tiannan Guo, 2024. "Spatial proteomics of single cells and organelles on tissue slides using filter-aided expansion proteomics," Nature Communications, Nature, vol. 15(1), pages 1-15, December.
    16. Daniela Klaproth-Andrade & Johannes Hingerl & Yanik Bruns & Nicholas H. Smith & Jakob Träuble & Mathias Wilhelm & Julien Gagneur, 2024. "Deep learning-driven fragment ion series classification enables highly precise and sensitive de novo peptide sequencing," Nature Communications, Nature, vol. 15(1), pages 1-14, December.
    17. Karl K. Krull & Syed Azmal Ali & Jeroen Krijgsveld, 2024. "Enhanced feature matching in single-cell proteomics characterizes IFN-γ response and co-existence of cell states," Nature Communications, Nature, vol. 15(1), pages 1-17, December.
    18. Wen-Feng Zeng & Xie-Xuan Zhou & Sander Willems & Constantin Ammar & Maria Wahle & Isabell Bludau & Eugenia Voytik & Maximillian T. Strauss & Matthias Mann, 2022. "AlphaPeptDeep: a modular deep learning framework to predict peptide properties for proteomics," Nature Communications, Nature, vol. 13(1), pages 1-14, December.
    19. Erik Hartman & Aaron M. Scott & Christofer Karlsson & Tirthankar Mohanty & Suvi T. Vaara & Adam Linder & Lars Malmström & Johan Malmström, 2023. "Interpreting biologically informed neural networks for enhanced proteomic biomarker discovery and pathway analysis," Nature Communications, Nature, vol. 14(1), pages 1-13, December.
    20. Fushi Wang & Chunxiao Zhao & Pinlong Zhao & Fanfan Chen & Dan Qiao & Jiandong Feng, 2023. "MoS2 nanopore identifies single amino acids with sub-1 Dalton resolution," Nature Communications, Nature, vol. 14(1), pages 1-8, December.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nat:natcom:v:16:y:2025:i:1:d:10.1038_s41467-024-54871-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.nature.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.