• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MSpectraAI:一个强大的平台,用于使用深度神经网络破译多肿瘤质谱数据的蛋白质组谱。

MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks.

机构信息

West China-Washington Mitochondria and Metabolism Research Center; Key Lab of Transplant Engineering and Immu-Nology, MOH, Regenerative Medicine Research Center, West China Hospital, Sichuan University, No. 88, Keyuan South Road, Hi-tech Zone, Chengdu, 610041, China.

Shanghai Institute of Materia Medica, Chinese Academy of Sciences, Shanghai, China.

出版信息

BMC Bioinformatics. 2020 Oct 7;21(1):439. doi: 10.1186/s12859-020-03783-0.

DOI:10.1186/s12859-020-03783-0
PMID:33028193
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7539376/
Abstract

BACKGROUND

Mass spectrometry (MS) has become a promising analytical technique to acquire proteomics information for the characterization of biological samples. Nevertheless, most studies focus on the final proteins identified through a suite of algorithms by using partial MS spectra to compare with the sequence database, while the pattern recognition and classification of raw mass-spectrometric data remain unresolved.

RESULTS

We developed an open-source and comprehensive platform, named MSpectraAI, for analyzing large-scale MS data through deep neural networks (DNNs); this system involves spectral-feature swath extraction, classification, and visualization. Moreover, this platform allows users to create their own DNN model by using Keras. To evaluate this tool, we collected the publicly available proteomics datasets of six tumor types (a total of 7,997,805 mass spectra) from the ProteomeXchange consortium and classified the samples based on the spectra profiling. The results suggest that MSpectraAI can distinguish different types of samples based on the fingerprint spectrum and achieve better prediction accuracy in MS1 level (average 0.967).

CONCLUSION

This study deciphers proteome profiling of raw mass spectrometry data and broadens the promising application of the classification and prediction of proteomics data from multi-tumor samples using deep learning methods. MSpectraAI also shows a better performance compared to the other classical machine learning approaches.

摘要

背景

质谱(MS)已成为一种很有前途的分析技术,可用于获取蛋白质组学信息以对生物样本进行特征分析。然而,大多数研究都集中在通过使用部分 MS 谱与序列数据库进行比较的一系列算法来最终确定蛋白质,而原始质谱数据的模式识别和分类仍然没有得到解决。

结果

我们开发了一个名为 MSpectraAI 的开源且全面的平台,用于通过深度神经网络(DNN)分析大规模 MS 数据;该系统涉及光谱特征条带提取、分类和可视化。此外,该平台允许用户使用 Keras 创建自己的 DNN 模型。为了评估这个工具,我们从 ProteomeXchange 联盟收集了六个肿瘤类型的公开可用蛋白质组学数据集(共计 7,997,805 个质谱),并根据谱图分析对样本进行分类。结果表明,MSpectraAI 可以根据指纹谱区分不同类型的样本,并在 MS1 水平上实现更好的预测精度(平均 0.967)。

结论

本研究揭示了原始质谱数据的蛋白质组特征,并通过深度学习方法拓宽了对多肿瘤样本的蛋白质组数据分类和预测的应用前景。与其他经典机器学习方法相比,MSpectraAI 也表现出更好的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/e29dc826667c/12859_2020_3783_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/a772ca0ba9a1/12859_2020_3783_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/72cf7eded322/12859_2020_3783_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/fe79726bdcd8/12859_2020_3783_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/62b326b5ee8d/12859_2020_3783_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/e29dc826667c/12859_2020_3783_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/a772ca0ba9a1/12859_2020_3783_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/72cf7eded322/12859_2020_3783_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/fe79726bdcd8/12859_2020_3783_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/62b326b5ee8d/12859_2020_3783_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0020/7539376/e29dc826667c/12859_2020_3783_Fig5_HTML.jpg

相似文献

1
MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks.MSpectraAI:一个强大的平台,用于使用深度神经网络破译多肿瘤质谱数据的蛋白质组谱。
BMC Bioinformatics. 2020 Oct 7;21(1):439. doi: 10.1186/s12859-020-03783-0.
2
On the feasibility of deep learning applications using raw mass spectrometry data.利用原始质谱数据进行深度学习应用的可行性研究。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i245-i253. doi: 10.1093/bioinformatics/btab311.
3
A Deep Learning-Based Tumor Classifier Directly Using MS Raw Data.一种直接使用质谱原始数据的基于深度学习的肿瘤分类器。
Proteomics. 2020 Nov;20(21-22):e1900344. doi: 10.1002/pmic.201900344. Epub 2020 Jul 26.
4
Human follicular fluid proteomic and peptidomic composition quantitative studies by SWATH-MS methodology. Applicability of high pH RP-HPLC fractionation.SWATH-MS 方法学对人卵泡液蛋白质组学和肽组学组成的定量研究。高 pH 值反相 HPLC 分级分离的适用性。
J Proteomics. 2019 Jan 16;191:131-142. doi: 10.1016/j.jprot.2018.03.010. Epub 2018 Mar 10.
5
The APEX Quantitative Proteomics Tool: generating protein quantitation estimates from LC-MS/MS proteomics results.APEX定量蛋白质组学工具:从液相色谱-串联质谱蛋白质组学结果生成蛋白质定量估计值。
BMC Bioinformatics. 2008 Dec 9;9:529. doi: 10.1186/1471-2105-9-529.
6
Corra: Computational framework and tools for LC-MS discovery and targeted mass spectrometry-based proteomics.科拉:用于液相色谱-质谱联用发现和基于靶向质谱的蛋白质组学的计算框架及工具。
BMC Bioinformatics. 2008 Dec 16;9:542. doi: 10.1186/1471-2105-9-542.
7
PRIDE Inspector Toolsuite: Moving Toward a Universal Visualization Tool for Proteomics Data Standard Formats and Quality Assessment of ProteomeXchange Datasets.PRIDE Inspector工具套件:迈向用于蛋白质组学数据标准格式和ProteomeXchange数据集质量评估的通用可视化工具
Mol Cell Proteomics. 2016 Jan;15(1):305-17. doi: 10.1074/mcp.O115.050229. Epub 2015 Nov 6.
8
Multi-laboratory assessment of reproducibility, qualitative and quantitative performance of SWATH-mass spectrometry.SWATH质谱技术的可重复性、定性和定量性能的多实验室评估。
Nat Commun. 2017 Aug 21;8(1):291. doi: 10.1038/s41467-017-00249-5.
9
iProX: an integrated proteome resource.iProX:一个综合蛋白质组资源库。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1211-D1217. doi: 10.1093/nar/gky869.
10
Proteome analysis using machine learning approaches and its applications to diseases.基于机器学习方法的蛋白质组学分析及其在疾病中的应用。
Adv Protein Chem Struct Biol. 2021;127:161-216. doi: 10.1016/bs.apcsb.2021.02.003. Epub 2021 Mar 24.

引用本文的文献

1
Progress and trends on machine learning in proteomics during 1997-2024: a bibliometric analysis.1997 - 2024年蛋白质组学中机器学习的进展与趋势:文献计量分析
Front Med (Lausanne). 2025 Aug 15;12:1594442. doi: 10.3389/fmed.2025.1594442. eCollection 2025.
2
Towards robust medical machine olfaction: Debiasing GC-MS data enhances prostate cancer diagnosis from urine volatiles.迈向稳健的医用机器嗅觉:去偏置气相色谱-质谱数据可增强基于尿液挥发性成分的前列腺癌诊断。
PLoS One. 2025 May 30;20(5):e0314742. doi: 10.1371/journal.pone.0314742. eCollection 2025.
3
A deep learning framework for hepatocellular carcinoma diagnosis using MS1 data.

本文引用的文献

1
Combining discovery and targeted proteomics reveals a prognostic signature in oral cancer.联合发现和靶向蛋白质组学揭示口腔癌的预后标志物。
Nat Commun. 2018 Sep 5;9(1):3598. doi: 10.1038/s41467-018-05696-2.
2
In vivo phosphoproteomics reveals kinase activity profiles that predict treatment outcome in triple-negative breast cancer.体内磷酸蛋白质组学揭示激酶活性谱,可预测三阴性乳腺癌的治疗结果。
Nat Commun. 2018 Aug 29;9(1):3501. doi: 10.1038/s41467-018-05742-z.
3
MetaboGroup S: A Group Entropy-Based Web Platform for Evaluating Normalization Methods in Blood Metabolomics Data from Maintenance Hemodialysis Patients.
基于 MS1 数据的肝细胞癌诊断深度学习框架。
Sci Rep. 2024 Nov 4;14(1):26705. doi: 10.1038/s41598-024-77494-4.
4
Novel research and future prospects of artificial intelligence in cancer diagnosis and treatment.人工智能在癌症诊断和治疗中的新研究与未来展望。
J Hematol Oncol. 2023 Nov 27;16(1):114. doi: 10.1186/s13045-023-01514-5.
5
LC-MS-based serum metabolomics analysis for the screening and monitoring of colorectal cancer.基于液相色谱-质谱联用的血清代谢组学分析用于结直肠癌的筛查与监测
Front Oncol. 2023 Jun 28;13:1173424. doi: 10.3389/fonc.2023.1173424. eCollection 2023.
6
Omics Data and Data Representations for Deep Learning-Based Predictive Modeling.基于组学数据和数据表示的深度学习预测模型
Int J Mol Sci. 2022 Oct 14;23(20):12272. doi: 10.3390/ijms232012272.
MetaboGroup S:一个基于组内信息熵的网站平台,用于评估维持性血液透析患者血液代谢组学数据标准化方法。
Anal Chem. 2018 Sep 18;90(18):11124-11130. doi: 10.1021/acs.analchem.8b03065. Epub 2018 Aug 31.
4
Comparative proteomics reveals a diagnostic signature for pulmonary head-and-neck cancer metastasis.比较蛋白质组学揭示了用于诊断肺头颈部癌症转移的特征性生物标记物。
EMBO Mol Med. 2018 Sep;10(9). doi: 10.15252/emmm.201708428.
5
Mapping the HLA Ligandome of Colorectal Cancer Reveals an Imprint of Malignant Cell Transformation.绘制结直肠癌 HLA 配体组图谱揭示恶性细胞转化的印记
Cancer Res. 2018 Aug 15;78(16):4627-4641. doi: 10.1158/0008-5472.CAN-17-1745. Epub 2018 May 22.
6
Comparative proteomic analysis of human malignant ascitic fluids for the development of gastric cancer biomarkers.用于胃癌生物标志物开发的人恶性腹水的比较蛋白质组学分析
Clin Biochem. 2018 Jun;56:55-61. doi: 10.1016/j.clinbiochem.2018.04.003. Epub 2018 Apr 11.
7
Ordinal logistic regression.有序逻辑回归
Am J Orthod Dentofacial Orthop. 2018 Jan;153(1):157-158. doi: 10.1016/j.ajodo.2017.11.011.
8
Systematic proteome and proteostasis profiling in human Trisomy 21 fibroblast cells.人类 21 三体综合征成纤维细胞的系统蛋白质组和蛋白质稳态分析。
Nat Commun. 2017 Oct 31;8(1):1212. doi: 10.1038/s41467-017-01422-6.
9
Phosphoproteomics Profiling of Nonsmall Cell Lung Cancer Cells Treated with a Novel Phosphatase Activator.用新型磷酸酶激活剂处理的非小细胞肺癌细胞的磷酸化蛋白质组学分析
Proteomics. 2017 Nov;17(22). doi: 10.1002/pmic.201700214. Epub 2017 Oct 25.
10
Data Conversion with ProteoWizard msConvert.使用ProteoWizard msConvert进行数据转换。
Methods Mol Biol. 2017;1550:339-368. doi: 10.1007/978-1-4939-6747-6_23.