Suppr超能文献

SpliceProt:一个预测的人类剪接变体的蛋白质序列数据库。

SpliceProt: a protein sequence repository of predicted human splice variants.

作者信息

Tavares Raphael, de Miranda Scherer Nicole, Pauletti Bianca Alves, Araújo Elói, Folador Edson Luiz, Espindola Gabriel, Ferreira Carlos Gil, Paes Leme Adriana Franco, de Oliveira Paulo Sergio Lopes, Passetti Fabio

机构信息

Bioinformatics Unit, Clinical Research Coordination, Instituto Nacional de Câncer (INCA), Rio de Janeiro, Brazil.

出版信息

Proteomics. 2014 Feb;14(2-3):181-5. doi: 10.1002/pmic.201300078.

Abstract

The mechanism of alternative splicing in the transcriptome may increase the proteome diversity in eukaryotes. In proteomics, several studies aim to use protein sequence repositories to annotate MS experiments or to detect differentially expressed proteins. However, the available protein sequence repositories are not designed to fully detect protein isoforms derived from mRNA splice variants. To foster knowledge for the field, here we introduce SpliceProt, a new protein sequence repository of transcriptome experimental data used to investigate for putative splice variants in human proteomes. Current version of SpliceProt contains 159 719 non-redundant putative polypeptide sequences. The assessment of the potential of SpliceProt in detecting new protein isoforms resulting from alternative splicing was performed by using publicly available proteomics data. We detected 173 peptides hypothetically derived from splice variants, which 54 of them are not present in UniprotKB/TrEMBL sequence repository. In comparison to other protein sequence repositories, SpliceProt contains a greater number of unique peptides and is able to detect more splice variants. Therefore, SpliceProt provides a solution for the annotation of proteomics experiments regarding splice isofoms. The repository files containing the translated sequences of the predicted splice variants and a visualization tool are freely available at http://lbbc.inca.gov.br/spliceprot.

摘要

转录组中的可变剪接机制可能会增加真核生物中的蛋白质组多样性。在蛋白质组学中,有几项研究旨在利用蛋白质序列数据库来注释质谱实验或检测差异表达的蛋白质。然而,现有的蛋白质序列数据库并非设计用于全面检测源自mRNA剪接变体的蛋白质异构体。为推动该领域的知识发展,我们在此引入SpliceProt,这是一个新的转录组实验数据蛋白质序列数据库,用于研究人类蛋白质组中的假定剪接变体。SpliceProt的当前版本包含159719个非冗余假定多肽序列。通过使用公开可用的蛋白质组学数据,对SpliceProt检测由可变剪接产生的新蛋白质异构体的潜力进行了评估。我们检测到173个假定源自剪接变体的肽段,其中54个在UniprotKB/TrEMBL序列数据库中不存在。与其他蛋白质序列数据库相比,SpliceProt包含更多独特的肽段,并且能够检测到更多的剪接变体。因此,SpliceProt为蛋白质组学实验中关于剪接异构体的注释提供了解决方案。包含预测剪接变体翻译序列的数据库文件和一个可视化工具可在http://lbbc.inca.gov.br/spliceprot免费获取。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验