通过比较多种串联质谱搜索算法鉴定黄曲霉中的可变剪接异构体。

Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms.

机构信息

Bioinformatics Research Center, North Carolina State University, Raleigh, NC 27695, USA.

出版信息

BMC Genomics. 2011 Jul 11;12:358. doi: 10.1186/1471-2164-12-358.

DOI:10.1186/1471-2164-12-358

PMID:21745387

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3146456/

Abstract

BACKGROUND

Database searching is the most frequently used approach for automated peptide assignment and protein inference of tandem mass spectra. The results, however, depend on the sequences in target databases and on search algorithms. Recently by using an alternative splicing database, we identified more proteins than with the annotated proteins in Aspergillus flavus. In this study, we aimed at finding a greater number of eligible splice variants based on newly available transcript sequences and the latest genome annotation. The improved database was then used to compare four search algorithms: Mascot, OMSSA, X! Tandem, and InsPecT.

RESULTS

The updated alternative splicing database predicted 15833 putative protein variants, 61% more than the previous results. There was transcript evidence for 50% of the updated genes compared to the previous 35% coverage. Database searches were conducted using the same set of spectral data, search parameters, and protein database but with different algorithms. The false discovery rates of the peptide-spectrum matches were estimated < 2%. The numbers of the total identified proteins varied from 765 to 867 between algorithms. Whereas 42% (1651/3891) of peptide assignments were unanimous, the comparison showed that 51% (568/1114) of the RefSeq proteins and 15% (11/72) of the putative splice variants were inferred by all algorithms. 12 plausible isoforms were discovered by focusing on the consensus peptides which were detected by at least three different algorithms. The analysis found different conserved domains in two putative isoforms of UDP-galactose 4-epimerase.

CONCLUSIONS

We were able to detect dozens of new peptides using the improved alternative splicing database with the recently updated annotation of the A. flavus genome. Unlike the identifications of the peptides and the RefSeq proteins, large variations existed between the putative splice variants identified by different algorithms. 12 candidates of putative isoforms were reported based on the consensus peptide-spectrum matches. This suggests that applications of multiple search engines effectively reduced the possible false positive results and validated the protein identifications from tandem mass spectra using an alternative splicing database.

摘要

背景

数据库搜索是自动化肽分配和串联质谱蛋白质推断最常用的方法。然而，结果取决于目标数据库中的序列和搜索算法。最近，我们使用替代剪接数据库，鉴定到的蛋白质比黄曲霉中注释的蛋白质更多。在这项研究中，我们旨在根据新获得的转录序列和最新的基因组注释找到更多合格的剪接变体。然后使用改进的数据库比较了四种搜索算法：Mascot、OMSSA、X!Tandem 和 InsPecT。

结果

更新的替代剪接数据库预测了 15833 个假定的蛋白质变体，比之前的结果多 61%。与之前 35%的覆盖范围相比，有转录证据的更新基因占 50%。数据库搜索使用相同的光谱数据集、搜索参数和蛋白质数据库进行，但使用不同的算法。肽谱匹配的假发现率估计<2%。不同算法鉴定的总蛋白质数量在 765 到 867 之间变化。虽然 42%（1651/3891）的肽分配是一致的，但比较表明，所有算法都推断出 51%（568/1114）的 RefSeq 蛋白和 15%（11/72）的假定剪接变体。通过关注至少三种不同算法检测到的共识肽，发现了 12 个合理的同工型。分析发现，两种假定的 UDP-半乳糖 4-差向异构酶同工型中存在不同的保守结构域。

结论

我们能够使用改进的替代剪接数据库和黄曲霉基因组的最新更新注释来检测数十个新的肽。与肽和 RefSeq 蛋白的鉴定不同，不同算法鉴定的假定剪接变体之间存在很大差异。根据共识肽谱匹配，报告了 12 个假定同工型的候选者。这表明应用多个搜索引擎可以有效地减少可能的假阳性结果，并使用替代剪接数据库验证串联质谱的蛋白质鉴定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f8b9/3146456/818372e46681/1471-2164-12-358-1.jpg

相似文献

Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms.通过比较多种串联质谱搜索算法鉴定黄曲霉中的可变剪接异构体。

BMC Genomics. 2011 Jul 11;12:358. doi: 10.1186/1471-2164-12-358.

Detection of alternative splice variants at the proteome level in Aspergillus flavus.在黄曲霉中进行蛋白质组水平的可变剪接变体检测。

J Proteome Res. 2010 Mar 5;9(3):1209-17. doi: 10.1021/pr900602d.

In-depth analysis of protein inference algorithms using multiple search engines and well-defined metrics.使用多个搜索引擎和明确的指标对蛋白质推断算法进行深入分析。

J Proteomics. 2017 Jan 6;150:170-182. doi: 10.1016/j.jprot.2016.08.002. Epub 2016 Aug 4.

Identification of novel alternative splicing biomarkers for breast cancer with LC/MS/MS and RNA-Seq.利用 LC/MS/MS 和 RNA-Seq 鉴定乳腺癌新型可变剪接生物标志物。

BMC Bioinformatics. 2020 Dec 3;21(Suppl 9):541. doi: 10.1186/s12859-020-03824-8.

PEPPI: a peptidomic database of human protein isoforms for proteomics experiments.PEPPI：人类蛋白质同工型的肽组学数据库，用于蛋白质组学实验。

BMC Bioinformatics. 2010 Oct 7;11 Suppl 6(Suppl 6):S7. doi: 10.1186/1471-2105-11-S6-S7.

Prophossi: automating expert validation of phosphopeptide-spectrum matches from tandem mass spectrometry.Prophossi：自动化磷酸肽谱匹配的专家验证，源自串联质谱技术。

Bioinformatics. 2010 Sep 1;26(17):2153-9. doi: 10.1093/bioinformatics/btq341. Epub 2010 Jul 22.

Enhanced peptide identification by electron transfer dissociation using an improved Mascot Percolator.采用改进的 Mascot Percolator 进行电子转移解离增强肽鉴定。

Mol Cell Proteomics. 2012 Aug;11(8):478-91. doi: 10.1074/mcp.O111.014522. Epub 2012 Apr 6.

Discovery and mass spectrometric analysis of novel splice-junction peptides using RNA-Seq.利用 RNA-Seq 发现和质谱分析新型剪接连接肽。

Mol Cell Proteomics. 2013 Aug;12(8):2341-53. doi: 10.1074/mcp.O113.028142. Epub 2013 Apr 29.

MassMatrix: a database search program for rapid characterization of proteins and peptides from tandem mass spectrometry data.质量矩阵：一种用于从串联质谱数据中快速鉴定蛋白质和肽段的数据库搜索程序。

Proteomics. 2009 Mar;9(6):1548-55. doi: 10.1002/pmic.200700322.

The generating function of CID, ETD, and CID/ETD pairs of tandem mass spectra: applications to database search.串联质谱的 CID、ETD 和 CID/ETD 对的生成函数：在数据库搜索中的应用。

Mol Cell Proteomics. 2010 Dec;9(12):2840-52. doi: 10.1074/mcp.M110.003731. Epub 2010 Sep 9.

引用本文的文献

Transcriptional Landscapes of Long Non-coding RNAs and Alternative Splicing in Revealed by RNA-Seq.RNA测序揭示的长链非编码RNA转录图谱与可变剪接

Front Plant Sci. 2021 Sep 8;12:723636. doi: 10.3389/fpls.2021.723636. eCollection 2021.

Alternative Splicing of the Aflatoxin-Associated Baeyer⁻Villiger Monooxygenase from : Characterisation of MoxY Isoforms.来自：黄曲霉毒素相关 Baeyer-Villiger 单加氧酶的可变剪接。MoxY 同工型的特性。

Toxins (Basel). 2018 Dec 5;10(12):521. doi: 10.3390/toxins10120521.

Comparative transcriptomics uncovers alternative splicing and molecular marker development in radish (Raphanus sativus L.).比较转录组学揭示了萝卜（Raphanus sativus L.）中的可变剪接和分子标记开发。

BMC Genomics. 2017 Jul 3;18(1):505. doi: 10.1186/s12864-017-3874-4.

Alternative Splicing May Not Be the Key to Proteome Complexity.可变剪接可能并非蛋白质组复杂性的关键所在。

Trends Biochem Sci. 2017 Feb;42(2):98-110. doi: 10.1016/j.tibs.2016.08.008. Epub 2016 Oct 3.

Integrative analyses reveal transcriptome-proteome correlation in biological pathways and secondary metabolism clusters in A. flavus in response to temperature.综合分析揭示了黄曲霉在响应温度时生物途径和次生代谢物簇中的转录组-蛋白质组相关性。

Sci Rep. 2015 Sep 29;5:14582. doi: 10.1038/srep14582.

Transcriptome analysis of the filamentous fungus Aspergillus nidulans directed to the global identification of promoters.针对丝状真菌构巢曲霉启动子进行全局鉴定的转录组分析。

BMC Genomics. 2013 Dec 3;14(1):847. doi: 10.1186/1471-2164-14-847.

Draft genome of Omphalotus olearius provides a predictive framework for sesquiterpenoid natural product biosynthesis in Basidiomycota.豹皮香菇的基因组草图为担子菌门倍半萜类天然产物生物合成提供了一个预测框架。

Chem Biol. 2012 Jun 22;19(6):772-83. doi: 10.1016/j.chembiol.2012.05.012.

本文引用的文献

An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database.一种将肽的串联质谱数据与蛋白质数据库中氨基酸序列相关联的方法。

J Am Soc Mass Spectrom. 1994 Nov;5(11):976-89. doi: 10.1016/1044-0305(94)80016-2.

Andromeda: a peptide search engine integrated into the MaxQuant environment.Andromeda：集成到 MaxQuant 环境中的肽搜索引擎。

J Proteome Res. 2011 Apr 1;10(4):1794-805. doi: 10.1021/pr101065j. Epub 2011 Feb 22.

Maximizing the sensitivity and reliability of peptide identification in large-scale proteomic experiments by harnessing multiple search engines.利用多个搜索引擎，最大限度地提高大规模蛋白质组学实验中肽鉴定的灵敏度和可靠性。

Proteomics. 2010 Mar;10(6):1172-89. doi: 10.1002/pmic.200900074.

Detection of alternative splice variants at the proteome level in Aspergillus flavus.在黄曲霉中进行蛋白质组水平的可变剪接变体检测。

J Proteome Res. 2010 Mar 5;9(3):1209-17. doi: 10.1021/pr900602d.

De novo sequencing methods in proteomics.蛋白质组学中的从头测序方法。

Methods Mol Biol. 2010;604:105-21. doi: 10.1007/978-1-60761-444-9_8.

CDD: specific functional annotation with the Conserved Domain Database.CDD：使用保守结构域数据库进行特定功能注释。

Nucleic Acids Res. 2009 Jan;37(Database issue):D205-10. doi: 10.1093/nar/gkn845. Epub 2008 Nov 4.

Temperature-dependent regulation of proteins in Aspergillus flavus: whole organism stable isotope labeling by amino acids.黄曲霉中蛋白质的温度依赖性调控：基于氨基酸的全生物体稳定同位素标记

J Proteome Res. 2008 Jul;7(7):2973-9. doi: 10.1021/pr8001047. Epub 2008 Jun 5.

Improving sensitivity by probabilistically combining results from multiple MS/MS search methodologies.通过概率性合并多种串联质谱（MS/MS）搜索方法的结果来提高灵敏度。

J Proteome Res. 2008 Jan;7(1):245-53. doi: 10.1021/pr070540w.

False discovery rates and related statistical concepts in mass spectrometry-based proteomics.基于质谱的蛋白质组学中的错误发现率及相关统计概念。

J Proteome Res. 2008 Jan;7(1):47-50. doi: 10.1021/pr700747q. Epub 2007 Dec 8.

Assigning significance to peptides identified by tandem mass spectrometry using decoy databases.使用诱饵数据库对通过串联质谱鉴定的肽段赋予显著性。

J Proteome Res. 2008 Jan;7(1):29-34. doi: 10.1021/pr700600n. Epub 2007 Dec 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

通过比较多种串联质谱搜索算法鉴定黄曲霉中的可变剪接异构体。

Identification of alternative splice variants in Aspergillus flavus through comparison of multiple tandem MS search algorithms.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献