• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种直接使用质谱原始数据的基于深度学习的肿瘤分类器。

A Deep Learning-Based Tumor Classifier Directly Using MS Raw Data.

作者信息

Dong Hao, Liu Yi, Zeng Wen-Feng, Shu Kunxian, Zhu Yunping, Chang Cheng

机构信息

State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China.

School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China.

出版信息

Proteomics. 2020 Nov;20(21-22):e1900344. doi: 10.1002/pmic.201900344. Epub 2020 Jul 26.

DOI:10.1002/pmic.201900344
PMID:32643271
Abstract

Since the launch of Chinese Human Proteome Project (CNHPP) and Clinical Proteomic Tumor Analysis Consortium (CPTAC), large-scale mass spectrometry (MS) based proteomic profiling of different kinds of human tumor samples have provided huge amount of valuable data for both basic and clinical researchers. Accurate prediction for tumor and non-tumor samples, as well as the tumor types has become a key step for biological and medical research, such as biomarker discovery, diagnosis, and monitoring of diseases. The traditional MS-based classification strategy mainly depends on the identification and quantification results of MS data, which has some inherent limitations, such as the low identification rate of MS data. Here, a deep learning-based tumor classifier directly using MS raw data is proposed, which is independent of the identification and quantification results of MS data. The potential precursors with intensities and retention times from MS data as input is first detected and extracted. Then, a deep learning-based classifier is trained, which can accurately distinguish between the tumor and non-tumor samples. Finally, it is demonstrated the deep learning-based classifier has a good performance compared with other machine learning methods and may help researchers find the potential biomarkers which are likely to be missed by the traditional strategy.

摘要

自中国人类蛋白质组计划(CNHPP)和临床蛋白质组肿瘤分析联盟(CPTAC)启动以来,基于大规模质谱(MS)对不同类型人类肿瘤样本进行蛋白质组分析,为基础和临床研究人员提供了大量有价值的数据。准确区分肿瘤样本和非肿瘤样本以及肿瘤类型,已成为生物和医学研究(如生物标志物发现、疾病诊断和监测)的关键步骤。传统的基于质谱的分类策略主要依赖于质谱数据的鉴定和定量结果,存在一些固有局限性,比如质谱数据的鉴定率较低。在此,提出了一种直接使用质谱原始数据的基于深度学习的肿瘤分类器,该分类器独立于质谱数据的鉴定和定量结果。首先检测并提取来自质谱数据的具有强度和保留时间的潜在前体。然后,训练一个基于深度学习的分类器,它能够准确区分肿瘤样本和非肿瘤样本。最后,证明了与其他机器学习方法相比,基于深度学习的分类器具有良好的性能,并且可能有助于研究人员发现传统策略可能遗漏的潜在生物标志物。

相似文献

1
A Deep Learning-Based Tumor Classifier Directly Using MS Raw Data.一种直接使用质谱原始数据的基于深度学习的肿瘤分类器。
Proteomics. 2020 Nov;20(21-22):e1900344. doi: 10.1002/pmic.201900344. Epub 2020 Jul 26.
2
On the feasibility of deep learning applications using raw mass spectrometry data.利用原始质谱数据进行深度学习应用的可行性研究。
Bioinformatics. 2021 Jul 12;37(Suppl_1):i245-i253. doi: 10.1093/bioinformatics/btab311.
3
MSpectraAI: a powerful platform for deciphering proteome profiling of multi-tumor mass spectrometry data by using deep neural networks.MSpectraAI:一个强大的平台,用于使用深度神经网络破译多肿瘤质谱数据的蛋白质组谱。
BMC Bioinformatics. 2020 Oct 7;21(1):439. doi: 10.1186/s12859-020-03783-0.
4
Proteome analysis using machine learning approaches and its applications to diseases.基于机器学习方法的蛋白质组学分析及其在疾病中的应用。
Adv Protein Chem Struct Biol. 2021;127:161-216. doi: 10.1016/bs.apcsb.2021.02.003. Epub 2021 Mar 24.
5
Deep Learning Powers Protein Identification from Precursor MS Information.深度学习助力从前体 MS 信息中鉴定蛋白质。
J Proteome Res. 2024 Sep 6;23(9):3837-3846. doi: 10.1021/acs.jproteome.4c00118. Epub 2024 Aug 21.
6
[Research progress of feature selection and machine learning methods for mass spectrometry-based protein biomarker discovery].基于质谱的蛋白质生物标志物发现的特征选择与机器学习方法研究进展
Sheng Wu Gong Cheng Xue Bao. 2019 Sep 25;35(9):1619-1632. doi: 10.13345/j.cjb.190064.
7
Phenotype Classification using Proteome Data in a Data-Independent Acquisition Tensor Format.基于无信息采集张量格式的蛋白质组数据进行表型分类。
J Am Soc Mass Spectrom. 2020 Nov 4;31(11):2296-2304. doi: 10.1021/jasms.0c00254. Epub 2020 Oct 26.
8
Clinically Applicable Deep Learning Algorithm Using Quantitative Proteomic Data.临床适用的深度学习算法,利用定量蛋白质组学数据。
J Proteome Res. 2019 Aug 2;18(8):3195-3202. doi: 10.1021/acs.jproteome.9b00268. Epub 2019 Jul 17.
9
Proteomic cancer classification with mass spectrometry data.基于质谱数据的蛋白质组学癌症分类
Am J Pharmacogenomics. 2005;5(5):281-92. doi: 10.2165/00129785-200505050-00001.
10
PB-Net: Automatic peak integration by sequential deep learning for multiple reaction monitoring.PB-Net:用于多反应监测的基于序列深度学习的自动峰积分
J Proteomics. 2020 Jul 15;223:103820. doi: 10.1016/j.jprot.2020.103820. Epub 2020 May 13.

引用本文的文献

1
Toward molecular diagnosis of major depressive disorder by plasma peptides using a deep learning approach.采用深度学习方法通过血浆肽实现对重度抑郁症的分子诊断。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae554.
2
Unveiling diagnostic and therapeutic strategies for cervical cancer: biomarker discovery through proteomics approaches and exploring the role of cervical cancer stem cells.揭示宫颈癌的诊断和治疗策略:通过蛋白质组学方法发现生物标志物并探索宫颈癌干细胞的作用。
Front Oncol. 2024 Jan 24;13:1277772. doi: 10.3389/fonc.2023.1277772. eCollection 2023.
3
DeepRTAlign: toward accurate retention time alignment for large cohort mass spectrometry data analysis.
DeepRTAlign:实现大型队列质谱数据分析中精确保留时间对齐的方法。
Nat Commun. 2023 Dec 11;14(1):8188. doi: 10.1038/s41467-023-43909-5.
4
Novel research and future prospects of artificial intelligence in cancer diagnosis and treatment.人工智能在癌症诊断和治疗中的新研究与未来展望。
J Hematol Oncol. 2023 Nov 27;16(1):114. doi: 10.1186/s13045-023-01514-5.
5
Early Diagnosis: End-to-End CNN-LSTM Models for Mass Spectrometry Data Classification.早期诊断:用于质谱数据分析分类的端到端 CNN-LSTM 模型。
Anal Chem. 2023 Sep 12;95(36):13431-13437. doi: 10.1021/acs.analchem.3c00613. Epub 2023 Aug 25.
6
Artificial Intelligence-Based Medical Data Mining.基于人工智能的医学数据挖掘
J Pers Med. 2022 Aug 24;12(9):1359. doi: 10.3390/jpm12091359.
7
Managing of Unassigned Mass Spectrometric Data by Neural Network for Cancer Phenotypes Classification.通过神经网络管理未分配的质谱数据用于癌症表型分类
J Pers Med. 2021 Dec 3;11(12):1288. doi: 10.3390/jpm11121288.
8
Deep Learning in Proteomics.蛋白质组学中的深度学习。
Proteomics. 2020 Nov;20(21-22):e1900335. doi: 10.1002/pmic.201900335. Epub 2020 Oct 30.