Sigrist Christian J A, Cerutti Lorenzo, Hulo Nicolas, Gattiker Alexandre, Falquet Laurent, Pagni Marco, Bairoch Amos, Bucher Philipp
Swiss Institute of Bioinformatics CMU, University of Geneva.
Brief Bioinform. 2002 Sep;3(3):265-74. doi: 10.1093/bib/3.3.265.
Among the various databases dedicated to the identification of protein families and domains, PROSITE is the first one created and has continuously evolved since. PROSITE currently consists of a large collection of biologically meaningful motifs that are described as patterns or profiles, and linked to documentation briefly describing the protein family or domain they are designed to detect. The close relationship of PROSITE with the SWISS-PROT protein database allows the evaluation of the sensitivity and specificity of the PROSITE motifs and their periodic reviewing. In return, PROSITE is used to help annotate SWISS-PROT entries. The main characteristics and the techniques of family and domain identification used by PROSITE are reviewed in this paper.
在众多致力于识别蛋白质家族和结构域的数据库中,PROSITE是最早创建的一个,并且自创建以来一直在不断发展。PROSITE目前包含大量具有生物学意义的基序,这些基序被描述为模式或图谱,并与简要描述它们旨在检测的蛋白质家族或结构域的文档相关联。PROSITE与SWISS-PROT蛋白质数据库的密切关系使得能够评估PROSITE基序的敏感性和特异性,并对其进行定期审查。作为回报,PROSITE被用于帮助注释SWISS-PROT条目。本文综述了PROSITE用于家族和结构域识别的主要特征和技术。