Attwood Terri K
School of Biological Sciences, Department of Computer Science, The University of Manchester, UK.
Brief Bioinform. 2002 Sep;3(3):252-63. doi: 10.1093/bib/3.3.252.
The PRINTS database houses a collection of protein fingerprints, which may be used to assign family and functional attributes to uncharacterised sequences, such as those currently emanating from the various genome-sequencing projects. The April 2002 release includes 1,700 family fingerprints, encoding approximately 10,500 motifs, covering a range of globular and membrane proteins, modular polypeptides and so on. Fingerprints are groups of conserved motifs that, taken together, provide diagnostic protein family signatures. They derive much of their potency from the biological context afforded by matching motif neighbours; this makes them at once more flexible and powerful than single-motif approaches. The technique further departs from other pattern-matching methods by readily allowing the creation of fingerprints at superfamily-, family- and subfamily-specific levels, thereby allowing more fine-grained diagnoses. Here, we provide an overview of the method of protein fingerprinting and how the results of fingerprint analyses are used to build PRINTS and its relational cousin, PRINTS-S.
PRINTS数据库包含一系列蛋白质指纹图谱,可用于为未表征的序列赋予家族和功能属性,比如当前来自各种基因组测序项目的那些序列。2002年4月发布的版本包含1700个家族指纹图谱,编码约10500个基序,涵盖了一系列球状蛋白和膜蛋白、模块化多肽等。指纹图谱是保守基序的组合,它们共同构成了诊断性的蛋白质家族特征。它们的大部分效力源于匹配基序邻域所提供的生物学背景;这使得它们比单基序方法更灵活、更强大。该技术与其他模式匹配方法的进一步不同之处在于,它很容易在超家族、家族和亚家族特定水平上创建指纹图谱,从而实现更精细的诊断。在此,我们概述蛋白质指纹图谱方法,以及指纹分析结果如何用于构建PRINTS及其相关的PRINTS-S。