Center for Cell and Gene Therapy, Clinical Research Center for Cell-based Immunotherapy, Shanghai Pudong Hospital, State Key Laboratory of Genetic Engineering, School of Life Sciences, Human Phenome Institute, Fudan University, E301, School of Life Sciences, No. 2005, Songhu Road, Yangpu District, Shanghai 200438, P.R. China.
Department of Pediatric Orthopedics, Xinhua Hospital affiliated to Shanghai Jiao Tong University School of Medicine, No. 1665, Kongjiang Road, Yangpu District, Shanghai 200092, China.
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae553.
Mass spectrometry (MS)-based proteomics has become instrumental in comprehensively investigating complex biological systems. Data-independent acquisition (DIA)-MS, utilizing hybrid spectral library search strategies, allows for the simultaneous quantification of thousands of proteins, showing promise in enhancing protein identification and quantification precision. However, low-quality profiles can considerably undermine quantitative precision, resulting in inaccurate protein quantification. To tackle this challenge, we introduced STAVER, a novel algorithm that leverages standardized benchmark datasets to reduce non-biological variation in large-scale DIA-MS analyses. By eliminating unwanted noise in MS signals, STAVER significantly improved protein quantification precision, especially in hybrid spectral library searches. Moreover, we validated STAVER's robustness and applicability across multiple large-scale DIA datasets, demonstrating significantly enhanced precision and reproducibility of protein quantification. STAVER offers an innovative and effective approach for enhancing the quality of large-scale DIA proteomic data, facilitating cross-platform and cross-laboratory comparative analyses. This advancement significantly enhances the consistency and reliability of findings in clinical research. The complete package is available at https://github.com/Ran485/STAVER.
基于质谱(MS)的蛋白质组学已成为全面研究复杂生物系统的有力工具。数据非依赖性采集(DIA)-MS 利用混合谱库搜索策略,可以同时定量数千种蛋白质,有望提高蛋白质鉴定和定量的精度。然而,低质量的图谱会极大地降低定量精度,导致蛋白质定量不准确。为了解决这个挑战,我们引入了 STAVER,这是一种利用标准化基准数据集来减少大规模 DIA-MS 分析中非生物学变异的新算法。通过消除 MS 信号中的无用噪声,STAVER 显著提高了蛋白质定量的精度,特别是在混合谱库搜索中。此外,我们验证了 STAVER 在多个大规模 DIA 数据集上的稳健性和适用性,证明了蛋白质定量的精度和重现性得到了显著提高。STAVER 为提高大规模 DIA 蛋白质组数据的质量提供了一种创新而有效的方法,促进了跨平台和跨实验室的比较分析。这一进展显著提高了临床研究结果的一致性和可靠性。完整的软件包可在 https://github.com/Ran485/STAVER 上获得。