• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Alpha-Tri:一种用于评分预测谱和实测谱之间相似度的深度神经网络,可提高 DIA 数据的肽鉴定。

Alpha-Tri: a deep neural network for scoring the similarity between predicted and measured spectra improves peptide identification of DIA data.

机构信息

Zhejiang University, Hangzhou, Zhejiang Province, China.

School of Engineering, Westlake University, Hangzhou, Zhejiang Province 310024, China.

出版信息

Bioinformatics. 2022 Mar 4;38(6):1525-1531. doi: 10.1093/bioinformatics/btab878.

DOI:10.1093/bioinformatics/btab878
PMID:34999750
Abstract

MOTIVATION

Peptide identification of data-independent acquisition (DIA) mass spectrometry applying the peptide-centric approach heavily relies on the spectral library matching, such as the fragment intensity similarity. If the intensity similarity is calculated through all possible fragment ions of a targeted peptide instead of just a few fragment ions provided by the spectral library, the matching will be more comprehensive and reliable, and thus the identification will be more confident. In addition, the emergence of high precision spectrum predictors, like Prosit, also makes it possible to capitalize on the predicted spectrum, which contains all possible fragment ion intensities, to calculate the intensity similarity for DIA data.

RESULTS

In this work, we propose Alpha-Tri, a neural-network-based model to calculate intensity similarity as a post-processing score using the predicted spectrum, measured spectrum and correlation spectrum (triple-spectrum). The predicted spectrum is generated by Prosit, the measured spectrum is retrieved from the apex of the chromatograms of all possible fragment ions and the correlation spectrum is used to indicate the present probabilities of these fragment ions as the link between the precursor and its fragment ions is lost in DIA. By adopting a data-driven method, Alpha-Tri is able to learn the intensity similarity from the triple-spectrum. This learned value is appended to initial scores from DIA-NN, allowing the ensuing statistical validation tool to report more peptides at the same false discovery rate (FDR). In our evaluation of the HeLa dataset with gradient lengths ranging from 0.5 to 2 h, Alpha-Tri delivered 3.0-7.2% gains in peptide detections at 1% FDR. On LFQbench dataset, a mixed-species dataset with known ratios, Alpha-Tri identified more peptides and proteins fell within the valid ratio ranges by up to 8.6% and 7.6%, respectively, compared with DIA-NN solely.

AVAILABILITY AND IMPLEMENTATION

The original datasets for benchmarks are downloaded from the ProteomeXchange with the identifiers PXD005573, PXD000954 and PXD002952. Source code is available at https://github.com/YuAirLab/Alpha-Tri.

摘要

动机

应用基于肽的方法对数据独立采集 (DIA) 质谱进行肽鉴定严重依赖于谱库匹配,例如片段强度相似性。如果通过靶向肽的所有可能的片段离子而不是仅通过谱库提供的少数几个片段离子来计算强度相似性,则匹配将更加全面和可靠,从而鉴定将更加有信心。此外,高精度谱预测器(如 Prosit)的出现也使得可以利用包含所有可能的片段离子强度的预测谱来计算 DIA 数据的强度相似性。

结果

在这项工作中,我们提出了基于神经网络的模型 Alpha-Tri,该模型使用预测谱、测量谱和相关谱(三谱)作为后处理评分来计算强度相似性。预测谱由 Prosit 生成,测量谱从所有可能的片段离子色谱峰的顶点中检索,相关谱用于指示这些片段离子的存在概率,因为在 DIA 中,前体与其片段离子之间的连接丢失。通过采用数据驱动的方法,Alpha-Tri 能够从三谱中学习强度相似性。这个学习到的值被添加到来自 DIA-NN 的初始得分中,从而使随后的统计验证工具能够在相同的假发现率 (FDR) 下报告更多的肽。在我们对 HeLa 数据集的评估中,梯度长度从 0.5 到 2 小时不等,Alpha-Tri 在 1% FDR 下提供了 3.0-7.2%的肽检测增益。在 LFQbench 数据集上,一个具有已知比例的混合物种数据集,与仅使用 DIA-NN 相比,Alpha-Tri 分别确定了更多的肽和蛋白质落在有效比例范围内,最多可达 8.6%和 7.6%。

可用性和实现

基准测试的原始数据集从 ProteomeXchange 下载,标识符为 PXD005573、PXD000954 和 PXD002952。源代码可在 https://github.com/YuAirLab/Alpha-Tri 上获得。

相似文献

1
Alpha-Tri: a deep neural network for scoring the similarity between predicted and measured spectra improves peptide identification of DIA data.Alpha-Tri:一种用于评分预测谱和实测谱之间相似度的深度神经网络,可提高 DIA 数据的肽鉴定。
Bioinformatics. 2022 Mar 4;38(6):1525-1531. doi: 10.1093/bioinformatics/btab878.
2
Alpha-XIC: a deep neural network for scoring the coelution of peak groups improves peptide identification by data-independent acquisition mass spectrometry.Alpha-XIC:一种用于评分峰组共洗脱的深度神经网络,通过数据非依赖性采集质谱法提高肽鉴定的性能。
Bioinformatics. 2021 Dec 22;38(1):38-43. doi: 10.1093/bioinformatics/btab544.
3
MSSort-DIA: A deep learning classification tool of the peptide precursors quantified by OpenSWATH.MSSort-DIA:一种基于深度学习的 OpenSWATH 定量肽前体分类工具。
J Proteomics. 2022 May 15;259:104542. doi: 10.1016/j.jprot.2022.104542. Epub 2022 Feb 26.
4
Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning.Prosit:基于深度学习的肽串联质谱的蛋白质组范围预测。
Nat Methods. 2019 Jun;16(6):509-518. doi: 10.1038/s41592-019-0426-7. Epub 2019 May 27.
5
[Advances of peptide-centric data-independent acquisition analysis algorithms and software tools].[以肽段为中心的数据非依赖采集分析算法和软件工具的进展]
Sheng Wu Gong Cheng Xue Bao. 2023 Sep 25;39(9):3579-3593. doi: 10.13345/j.cjb.230079.
6
Data-Independent Acquisition Coupled to Visible Laser-Induced Dissociation at 473 nm (DIA-LID) for Peptide-Centric Specific Analysis of Cysteine-Containing Peptide Subset.基于数据非依赖采集的可见光激光诱导解离(DIA-LID)在含半胱氨酸肽亚组的以肽为中心的特异性分析中的应用。
Anal Chem. 2018 Mar 20;90(6):3928-3935. doi: 10.1021/acs.analchem.7b04821. Epub 2018 Feb 28.
7
UniSpec: Deep Learning for Predicting the Full Range of Peptide Fragment Ion Series to Enhance the Proteomics Data Analysis Workflow.UniSpec:用于预测全范围肽段碎片离子系列的深度学习,以增强蛋白质组学数据分析工作流程。
Anal Chem. 2024 Feb 8. doi: 10.1021/acs.analchem.3c02321.
8
[Research progress and application of retention time prediction method based on deep learning].基于深度学习的保留时间预测方法的研究进展与应用
Se Pu. 2021 Mar;39(3):211-218. doi: 10.3724/SP.J.1123.2020.08015.
9
Improved identification and quantification of peptides in mass spectrometry data via chemical and random additive noise elimination (CRANE).通过化学和随机加性噪声消除(CRANE)提高质谱数据中肽的鉴定和定量。
Bioinformatics. 2021 Dec 11;37(24):4719-4726. doi: 10.1093/bioinformatics/btab563.
10
Calib-RT: an open source python package for peptide retention time calibration in DIA mass spectrometry data.Calib-RT:一个用于 DIA 质谱数据中肽保留时间校准的开源 Python 包。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae417.

引用本文的文献

1
DIA-BERT: pre-trained end-to-end transformer models for enhanced DIA proteomics data analysis.DIA-BERT:用于增强DIA蛋白质组学数据分析的预训练端到端Transformer模型。
Nat Commun. 2025 Apr 14;16(1):3530. doi: 10.1038/s41467-025-58866-4.