SPIDER：用于从带有从头测序错误的序列标签中鉴定蛋白质的软件。

SPIDER: software for protein identification from sequence tags with de novo sequencing error.

作者信息

Han Yonghua, Ma Bin, Zhang Kaizhong

机构信息

Department of Computer Science, University of Western Ontario, London, Canada.

出版信息

Proc IEEE Comput Syst Bioinform Conf. 2004:206-15. doi: 10.1109/csb.2004.1332434.

DOI:10.1109/csb.2004.1332434

PMID:16448014

Abstract

For the identification of novel proteins using MS/MS, de novo sequencing software computes one or several possible amino acid sequences (called sequence tags) for each MS/MS spectrum. Those tags are then used to match, accounting amino acid mutations, the sequences in a protein database. If the de novo sequencing gives correct tags, the homologs of the proteins can be identified by this approach and software such as MS-BLAST is available for the matching. However, de novo sequencing very often gives only partially correct tags. The most common error is that a segment of amino acids is replaced by another segment with approximately the same masses. We developed a new efficient algorithm to match sequence tags with errors to database sequences for the purpose of protein and peptide identification. A software package, SPIDER, was developed and made available on Internet for free public use. This paper describes the algorithms and features of the SPIDER software.

摘要

对于使用串联质谱（MS/MS）鉴定新型蛋白质，从头测序软件会为每个MS/MS谱计算一个或几个可能的氨基酸序列（称为序列标签）。然后，考虑氨基酸突变，使用这些标签来匹配蛋白质数据库中的序列。如果从头测序给出正确的标签，那么通过这种方法可以鉴定出蛋白质的同源物，并且可以使用诸如MS-BLAST之类的软件进行匹配。然而，从头测序常常只能给出部分正确的标签。最常见的错误是一段氨基酸被另一段质量大致相同的氨基酸所取代。为了鉴定蛋白质和肽段，我们开发了一种新的高效算法，用于将带有错误的序列标签与数据库序列进行匹配。开发了一个软件包SPIDER，并在互联网上免费提供给公众使用。本文描述了SPIDER软件的算法和特点。

相似文献

SPIDER: software for protein identification from sequence tags with de novo sequencing error.SPIDER：用于从带有从头测序错误的序列标签中鉴定蛋白质的软件。

Proc IEEE Comput Syst Bioinform Conf. 2004:206-15. doi: 10.1109/csb.2004.1332434.

SPIDER: software for protein identification from sequence tags with de novo sequencing error.SPIDER：用于从带有从头测序错误的序列标签中鉴定蛋白质的软件。

J Bioinform Comput Biol. 2005 Jun;3(3):697-716. doi: 10.1142/s0219720005001247.

Searching sequence databases via de novo peptide sequencing by tandem mass spectrometry.通过串联质谱从头测序肽段来搜索序列数据库。

Mol Biotechnol. 2002 Nov;22(3):301-15. doi: 10.1385/MB:22:3:301.

DeNovoID: a web-based tool for identifying peptides from sequence and mass tags deduced from de novo peptide sequencing by mass spectroscopy.DeNovoID：一种基于网络的工具，用于从通过质谱从头肽测序推导的序列和质量标签中鉴定肽段。

Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W376-81. doi: 10.1093/nar/gki461.

TANDEM: matching proteins with tandem mass spectra.串联：将蛋白质与串联质谱进行匹配。

Bioinformatics. 2004 Jun 12;20(9):1466-7. doi: 10.1093/bioinformatics/bth092. Epub 2004 Feb 19.

Mass spectrometry-based protein identification by integrating de novo sequencing with database searching.基于质谱的蛋白质鉴定：从头测序与数据库搜索的整合。

BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S24. doi: 10.1186/1471-2105-14-S2-S24. Epub 2013 Jan 21.

Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?评估蛋白质组学中的从头测序：是否已经成为数据库驱动肽鉴定的准确替代方法？

Brief Bioinform. 2018 Sep 28;19(5):954-970. doi: 10.1093/bib/bbx033.

Robust accurate identification of peptides (RAId): deciphering MS2 data using a structured library search with de novo based statistics.肽段的稳健准确鉴定（RAId）：使用基于从头统计的结构化库搜索来解析MS2数据。

Bioinformatics. 2005 Oct 1;21(19):3726-32. doi: 10.1093/bioinformatics/bti620. Epub 2005 Aug 16.

MagicMatch--cross-referencing sequence identifiers across databases.MagicMatch——跨数据库交叉引用序列标识符。

Bioinformatics. 2005 Aug 15;21(16):3429-30. doi: 10.1093/bioinformatics/bti548. Epub 2005 Jun 16.

Finding homologs in amino acid sequences using network BLAST searches.使用网络BLAST搜索在氨基酸序列中寻找同源物。

Curr Protoc Bioinformatics. 2003 Feb;Chapter 3:Unit 3.4. doi: 10.1002/0471250953.bi0304s00.

引用本文的文献

Phylogenetically informative proteins from an Early Miocene rhinocerotid.来自中新世早期犀科动物的系统发育信息丰富的蛋白质。

Nature. 2025 Jul 9. doi: 10.1038/s41586-025-09231-4.

NovoLign: metaproteomics by sequence alignment.NovoLign：通过序列比对进行宏蛋白质组学分析。

ISME Commun. 2024 Oct 12;4(1):ycae121. doi: 10.1093/ismeco/ycae121. eCollection 2024 Jan.

Coexistence of nonfluorescent chromoproteins and fluorescent proteins in massive spp. corals manifesting a pink pigmentation response.在呈现粉色色素沉着反应的大量石珊瑚物种中，非荧光色素蛋白和荧光蛋白共存。

Front Physiol. 2024 Jun 17;15:1339907. doi: 10.3389/fphys.2024.1339907. eCollection 2024.

SeqWiz: a modularized toolkit for next-generation protein sequence database management and analysis.SeqWiz：用于下一代蛋白质序列数据库管理和分析的模块化工具包。

BMC Bioinformatics. 2023 May 17;24(1):201. doi: 10.1186/s12859-023-05334-9.

The Experimental Proteome of Promastigote and Its Usefulness for Improving Gene Annotations.无

Genes (Basel). 2020 Sep 2;11(9):1036. doi: 10.3390/genes11091036.

Human cells adapt to translational errors by modulating protein synthesis rate and protein turnover.人类细胞通过调节蛋白质合成速率和蛋白质周转率来适应翻译错误。

RNA Biol. 2020 Jan;17(1):135-149. doi: 10.1080/15476286.2019.1670039. Epub 2019 Oct 1.

Mass Spectrometry-Based Methodologies for Targeted and Untargeted Identification of Protein Covalent Adducts (Adductomics): Current Status and Challenges.基于质谱的蛋白质共价加合物靶向和非靶向鉴定方法（加合物组学）：现状与挑战

High Throughput. 2019 Apr 23;8(2):9. doi: 10.3390/ht8020009.

Codon misreading tRNAs promote tumor growth in mice.错义 tRNA 促进小鼠肿瘤生长。

RNA Biol. 2018;15(6):773-786. doi: 10.1080/15476286.2018.1454244. Epub 2018 Jun 7.

The Progestin Receptor Interactome in the Female Mouse Hypothalamus: Interactions with Synaptic Proteins Are Isoform Specific and Ligand Dependent.孕激素受体互作组在雌性小鼠下丘脑中的研究：与突触蛋白的相互作用具有同种型特异性和配体依赖性。

eNeuro. 2017 Sep 20;4(5). doi: 10.1523/ENEURO.0272-17.2017. eCollection 2017 Sep-Oct.

An enhanced algorithm for multiple sequence alignment of protein sequences using genetic algorithm.一种使用遗传算法的蛋白质序列多序列比对增强算法。

EXCLI J. 2015 Dec 15;14:1232-55. doi: 10.17179/excli2015-302. eCollection 2015.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

SPIDER：用于从带有从头测序错误的序列标签中鉴定蛋白质的软件。

SPIDER: software for protein identification from sequence tags with de novo sequencing error.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献