• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

UniNovo:从头测序肽的通用工具。

UniNovo: a universal tool for de novo peptide sequencing.

机构信息

Department of Electrical and Computer Engineering and Department of Computer Science and Engineering, University of California-San Diego, CA 92093, USA.

出版信息

Bioinformatics. 2013 Aug 15;29(16):1953-62. doi: 10.1093/bioinformatics/btt338. Epub 2013 Jun 12.

DOI:10.1093/bioinformatics/btt338
PMID:23766417
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3722526/
Abstract

MOTIVATION

Mass spectrometry (MS) instruments and experimental protocols are rapidly advancing, but de novo peptide sequencing algorithms to analyze tandem mass (MS/MS) spectra are lagging behind. Although existing de novo sequencing tools perform well on certain types of spectra [e.g. Collision Induced Dissociation (CID) spectra of tryptic peptides], their performance often deteriorates on other types of spectra, such as Electron Transfer Dissociation (ETD), Higher-energy Collisional Dissociation (HCD) spectra or spectra of non-tryptic digests. Thus, rather than developing a new algorithm for each type of spectra, we develop a universal de novo sequencing algorithm called UniNovo that works well for all types of spectra or even for spectral pairs (e.g. CID/ETD spectral pairs). UniNovo uses an improved scoring function that captures the dependences between different ion types, where such dependencies are learned automatically using a modified offset frequency function.

RESULTS

The performance of UniNovo is compared with PepNovo+, PEAKS and pNovo using various types of spectra. The results show that the performance of UniNovo is superior to other tools for ETD spectra and superior or comparable with others for CID and HCD spectra. UniNovo also estimates the probability that each reported reconstruction is correct, using simple statistics that are readily obtained from a small training dataset. We demonstrate that the estimation is accurate for all tested types of spectra (including CID, HCD, ETD, CID/ETD and HCD/ETD spectra of trypsin, LysC or AspN digested peptides).

AVAILABILITY

UniNovo is implemented in JAVA and tested on Windows, Ubuntu and OS X machines. UniNovo is available at http://proteomics.ucsd.edu/Software/UniNovo.html along with the manual.

摘要

动机

质谱(MS)仪器和实验方案正在迅速发展,但用于分析串联质谱(MS/MS)谱的从头多肽测序算法却滞后了。尽管现有的从头测序工具在某些类型的光谱[例如胰蛋白酶肽的碰撞诱导解离(CID)光谱]上表现良好,但它们在其他类型的光谱上的性能往往会恶化,例如电子转移解离(ETD)、更高能量的碰撞解离(HCD)光谱或非胰蛋白酶消化的光谱。因此,我们没有为每种类型的光谱开发新的算法,而是开发了一种通用的从头测序算法 UniNovo,它适用于所有类型的光谱,甚至适用于光谱对(例如 CID/ETD 光谱对)。UniNovo 使用改进的评分函数来捕获不同离子类型之间的依赖关系,其中这些依赖关系是使用修改后的偏移频率函数自动学习的。

结果

使用各种类型的光谱比较了 UniNovo、PepNovo+、PEAKS 和 pNovo 的性能。结果表明,UniNovo 在 ETD 光谱上的性能优于其他工具,在 CID 和 HCD 光谱上的性能优于或可与其他工具相媲美。UniNovo 还使用简单的统计信息来估计每个报告重建的正确性的概率,这些信息可以从一个小的训练数据集轻松获得。我们证明,该估计对于所有测试类型的光谱(包括 CID、HCD、ETD、CID/ETD 和 HCD/ETD 胰蛋白酶、LysC 或 AspN 消化肽的光谱)都是准确的。

可用性

UniNovo 用 Java 实现,并在 Windows、Ubuntu 和 OS X 机器上进行了测试。UniNovo 可在 http://proteomics.ucsd.edu/Software/UniNovo.html 上获得,同时提供手册。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/16a8ce02df13/btt338f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/6884ce2eaff5/btt338f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/f9b457891697/btt338f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/ac7fcea02c3f/btt338f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/c411963fef98/btt338f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/6ad1ce7fbd61/btt338f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/16a8ce02df13/btt338f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/6884ce2eaff5/btt338f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/f9b457891697/btt338f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/ac7fcea02c3f/btt338f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/c411963fef98/btt338f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/6ad1ce7fbd61/btt338f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6882/3722526/16a8ce02df13/btt338f6p.jpg

相似文献

1
UniNovo: a universal tool for de novo peptide sequencing.UniNovo:从头测序肽的通用工具。
Bioinformatics. 2013 Aug 15;29(16):1953-62. doi: 10.1093/bioinformatics/btt338. Epub 2013 Jun 12.
2
pNovo+: de novo peptide sequencing using complementary HCD and ETD tandem mass spectra.pNovo+:使用互补的 HCD 和 ETD 串联质谱进行从头多肽测序。
J Proteome Res. 2013 Feb 1;12(2):615-25. doi: 10.1021/pr3006843. Epub 2012 Dec 28.
3
De novo peptide sequencing using CID and HCD spectra pairs.使用CID和HCD光谱对进行从头肽测序。
Proteomics. 2016 Oct;16(20):2615-2624. doi: 10.1002/pmic.201500251. Epub 2016 Aug 24.
4
pNovo: de novo peptide sequencing and identification using HCD spectra.pNovo:利用 HCD 谱进行从头多肽测序和鉴定。
J Proteome Res. 2010 May 7;9(5):2713-24. doi: 10.1021/pr100182k.
5
Spectra library assisted de novo peptide sequencing for HCD and ETD spectra pairs.用于HCD和ETD谱图对的光谱库辅助从头肽测序
BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):538. doi: 10.1186/s12859-016-1330-0.
6
Mass spectrometric de novo sequencing of natural non-tryptic peptides: comparing peculiarities of collision-induced dissociation (CID) and high energy collision dissociation (HCD).天然非胰蛋白酶解肽段的质谱从头测序:比较碰撞诱导解离(CID)和高能碰撞解离(HCD)的特点
Rapid Commun Mass Spectrom. 2014 Dec 15;28(23):2595-604. doi: 10.1002/rcm.7049.
7
The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.串联质谱的 CID、ETD 和 CID/ETD 对的生成函数:在数据库搜索中的应用。
Mol Cell Proteomics. 2010 Dec;9(12):2840-52. doi: 10.1074/mcp.M110.003731. Epub 2010 Sep 9.
8
Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序:是否已经成为数据库驱动肽鉴定的准确替代方法?
Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.
9
De novo peptide sequencing by tandem MS using complementary CID and electron transfer dissociation.采用互补的 CID 和电子转移解离的串联 MS 进行从头肽测序。
Electrophoresis. 2009 Nov;30(21):3736-47. doi: 10.1002/elps.200900332.
10
Sequencing-grade de novo analysis of MS/MS triplets (CID/HCD/ETD) from overlapping peptides.从头分析(CID/HCD/ETD)重叠肽的 MS/MS 三重体的测序级分析。
J Proteome Res. 2013 Jun 7;12(6):2846-57. doi: 10.1021/pr400173d. Epub 2013 May 30.

引用本文的文献

1
A Shotgun Proteomic-Based Approach with a Q-Exactive Hybrid Quadrupole-Orbitrap High-Resolution Mass Spectrometer for the Assessment of Pesticide Mixture-Induced Neurotoxicity on a 3D-Developed Neurospheroid Model from Human Brain Meningiomas: Identification of Trityl-Post-Translational Modification.基于鸟枪法蛋白质组学的方法,结合Q-Exactive混合型四极杆-轨道阱高分辨率质谱仪,用于评估农药混合物对源自人脑膜瘤的3D培养神经球模型的神经毒性:三苯甲基翻译后修饰的鉴定
J Proteome Res. 2024 Dec 6;23(12):5554-5576. doi: 10.1021/acs.jproteome.4c00804. Epub 2024 Nov 18.
2
De Novo Sequencing of Peptides from Tandem Mass Spectra and Applications in Proteogenomics.从头测序肽段的串联质谱分析及其在蛋白质基因组学中的应用。
Methods Mol Biol. 2025;2859:1-19. doi: 10.1007/978-1-0716-4152-1_1.
3

本文引用的文献

1
An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。
J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.
2
De novo sequencing and homology searching.从头测序和同源搜索。
Mol Cell Proteomics. 2012 Feb;11(2):O111.014902. doi: 10.1074/mcp.O111.014902. Epub 2011 Nov 16.
3
Gapped spectral dictionaries and their applications for database searches of tandem mass spectra.有间隙的光谱字典及其在串联质谱数据库搜索中的应用。
Expanding N-glycopeptide identifications by modeling fragmentation, elution, and glycome connectivity.通过模拟碎片化、洗脱和糖基连接来扩展 N-糖肽的鉴定。
Nat Commun. 2024 Jul 22;15(1):6168. doi: 10.1038/s41467-024-50338-5.
4
Accurate de novo peptide sequencing using fully convolutional neural networks.利用全卷积神经网络进行精确从头肽测序。
Nat Commun. 2023 Dec 2;14(1):7974. doi: 10.1038/s41467-023-43010-x.
5
Shotgun Proteomic-Based Approach with a Q-Exactive Hybrid Quadrupole-Orbitrap High-Resolution Mass Spectrometer for Protein Adductomics on a 3D Human Brain Tumor Neurospheroid Culture Model: The Identification of Adduct Formation in Calmodulin-Dependent Protein Kinase-2 and Annexin-A1 Induced by Pesticide Mixture.基于 shotgun 蛋白质组学的方法,结合 Q-Exactive 杂交四极杆轨道阱高分辨率质谱仪,用于在 3D 人脑肿瘤神经球培养模型上进行蛋白质加合物组学研究:农药混合物诱导钙调蛋白依赖性蛋白激酶-2 和膜联蛋白 A1 加合物形成的鉴定。
J Proteome Res. 2023 Dec 1;22(12):3811-3832. doi: 10.1021/acs.jproteome.3c00484. Epub 2023 Oct 31.
6
Illuminating the oral microbiome and its host interactions: recent advancements in omics and bioinformatics technologies in the context of oral microbiome research.阐明口腔微生物组及其宿主相互作用:组学和生物信息学技术在口腔微生物组研究背景下的最新进展。
FEMS Microbiol Rev. 2023 Sep 5;47(5). doi: 10.1093/femsre/fuad051.
7
Peptidomics Methods Applied to the Study of Flower Development.肽组学方法在花发育研究中的应用。
Methods Mol Biol. 2023;2686:509-536. doi: 10.1007/978-1-0716-3299-4_24.
8
The Current State-of-the-Art Identification of Unknown Proteins Using Mass Spectrometry Exemplified on De Novo Sequencing of a Venom Protease from .利用质谱法对来自 的毒液蛋白酶从头测序鉴定未知蛋白的最新技术进展举例说明
Molecules. 2022 Aug 5;27(15):4976. doi: 10.3390/molecules27154976.
9
Metaproteomics of the human gut microbiota: Challenges and contributions to other OMICS.人类肠道微生物群的宏蛋白质组学:挑战及对其他组学的贡献
Clin Mass Spectrom. 2019 Jun 4;14 Pt A:18-30. doi: 10.1016/j.clinms.2019.06.001. eCollection 2019 Sep.
10
Recent advances in mass spectrometry analysis of neuropeptides.近年来神经肽的质谱分析进展。
Mass Spectrom Rev. 2023 Mar;42(2):706-750. doi: 10.1002/mas.21734. Epub 2021 Sep 24.
Mol Cell Proteomics. 2011 Jun;10(6):M110.002220. doi: 10.1074/mcp.M110.002220. Epub 2011 Mar 28.
4
Improved peptide identification by targeted fragmentation using CID, HCD and ETD on an LTQ-Orbitrap Velos.在 LTQ-Orbitrap Velos 上使用 CID、HCD 和 ETD 进行靶向碎裂可提高肽段鉴定。
J Proteome Res. 2011 May 6;10(5):2377-88. doi: 10.1021/pr1011729. Epub 2011 Apr 1.
5
ADEPTS: advanced peptide de novo sequencing with a pair of tandem mass spectra.ADEPTS:利用一对串联质谱进行高级肽段从头测序
J Bioinform Comput Biol. 2010 Dec;8(6):981-94. doi: 10.1142/s0219720010005099.
6
The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.串联质谱的 CID、ETD 和 CID/ETD 对的生成函数:在数据库搜索中的应用。
Mol Cell Proteomics. 2010 Dec;9(12):2840-52. doi: 10.1074/mcp.M110.003731. Epub 2010 Sep 9.
7
A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics.用于在鸟枪法蛋白质组学中鉴定肽和蛋白质的计算方法和错误率估计程序的调查。
J Proteomics. 2010 Oct 10;73(11):2092-123. doi: 10.1016/j.jprot.2010.08.009. Epub 2010 Sep 8.
8
pNovo: de novo peptide sequencing and identification using HCD spectra.pNovo:利用 HCD 谱进行从头多肽测序和鉴定。
J Proteome Res. 2010 May 7;9(5):2713-24. doi: 10.1021/pr100182k.
9
Better score function for peptide identification with ETD MS/MS spectra.具有 ETD MS/MS 谱的肽鉴定的更好得分函数。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S4. doi: 10.1186/1471-2105-11-S1-S4.
10
Value of using multiple proteases for large-scale mass spectrometry-based proteomics.利用多种蛋白酶进行大规模基于质谱的蛋白质组学研究的价值。
J Proteome Res. 2010 Mar 5;9(3):1323-9. doi: 10.1021/pr900863u.