SPdb——一个信号肽数据库。
SPdb--a signal peptide database.
作者信息
Choo Khar Heng, Tan Tin Wee, Ranganathan Shoba
机构信息
Department of Biochemistry, National University of Singapore, Singapore.
出版信息
BMC Bioinformatics. 2005 Oct 13;6:249. doi: 10.1186/1471-2105-6-249.
BACKGROUND
The signal peptide plays an important role in protein targeting and protein translocation in both prokaryotic and eukaryotic cells. This transient, short peptide sequence functions like a postal address on an envelope by targeting proteins for secretion or for transfer to specific organelles for further processing. Understanding how signal peptides function is crucial in predicting where proteins are translocated. To support this understanding, we present SPdb signal peptide database http://proline.bic.nus.edu.sg/spdb, a repository of experimentally determined and computationally predicted signal peptides.
RESULTS
SPdb integrates information from two sources (a) Swiss-Prot protein sequence database which is now part of UniProt and (b) EMBL nucleotide sequence database. The database update is semi-automated with human checking and verification of the data to ensure the correctness of the data stored. The latest release SPdb release 3.2 contains 18,146 entries of which 2,584 entries are experimentally verified signal sequences; the remaining 15,562 entries are either signal sequences that fail to meet our filtering criteria or entries that contain unverified signal sequences.
CONCLUSION
SPdb is a manually curated database constructed to support the understanding and analysis of signal peptides. SPdb tracks the major updates of the two underlying primary databases thereby ensuring that its information remains up-to-date.
背景
信号肽在原核细胞和真核细胞的蛋白质靶向和蛋白质转运过程中发挥着重要作用。这个短暂的短肽序列的功能类似于信封上的邮政编码,可将蛋白质靶向分泌或转移到特定细胞器进行进一步加工。了解信号肽的功能对于预测蛋白质的转运位置至关重要。为了辅助这一理解,我们推出了SPdb信号肽数据库http://proline.bic.nus.edu.sg/spdb,这是一个存储通过实验确定和通过计算预测的信号肽的数据库。
结果
SPdb整合了来自两个来源的信息:(a)现已成为UniProt一部分的Swiss-Prot蛋白质序列数据库,以及(b)EMBL核苷酸序列数据库。数据库更新是半自动的,需要人工检查和验证数据,以确保所存储数据的正确性。最新版本SPdb 3.2包含18,146个条目,其中2,584个条目是经过实验验证的信号序列;其余15,562个条目要么是不符合我们筛选标准的信号序列,要么是包含未经验证的信号序列的条目。
结论
SPdb是一个经过人工精心策划的数据库,旨在支持对信号肽的理解和分析。SPdb跟踪两个基础主要数据库的重大更新,从而确保其信息始终保持最新状态。