Mallika V, Bhaduri Anirban, Sowdhamini R
National Centre for Biological Sciences, UAS-GKVK Campus, Bangalore 560 065, India.
Nucleic Acids Res. 2002 Jan 1;30(1):284-8. doi: 10.1093/nar/30.1.284.
PASS2 is a nearly automated version of CAMPASS and contains sequence alignments of proteins grouped at the level of superfamilies. This database has been created to fall in correspondence with SCOP database (1.53 release) and currently consists of 110 multi-member superfamilies and 613 superfamilies corresponding to single members. In multi-member superfamilies, protein chains with no more than 25% sequence identity have been considered for the alignment and hence the database aims to address sequence alignments which represent 26 219 protein domains under the SCOP 1.53 release. Structure-based sequence alignments have been obtained by COMPARER and the initial equivalences are provided automatically from a MALIGN alignment and subsequently augmented using STAMP4.0. The final sequence alignments have been annotated for the structural features using JOY4.0. Several interesting links are provided to other related databases and genome sequence relatives. Availability of reliable sequence alignments of distantly related proteins, despite poor sequence identity and single-member superfamilies, permit better sampling of structures in libraries for fold recognition of new sequences and for the understanding of protein structure-function relationships of individual superfamilies. The database can be queried by keywords and also by sequence search, interfaced by PSI-BLAST methods. Structure-annotated sequence alignments and several structural accessory files can be retrieved for all the superfamilies including the user-input sequence. The database can be accessed from http://www.ncbs.res.in/%7Efaculty/mini/campass/pass.html.
PASS2是CAMPASS的一个近乎自动化的版本,包含按超家族水平分组的蛋白质序列比对。创建这个数据库是为了与SCOP数据库(1.53版本)相对应,目前由110个多成员超家族和613个对应单成员的超家族组成。在多成员超家族中,序列同一性不超过25%的蛋白质链被用于比对,因此该数据库旨在处理代表SCOP 1.53版本下26219个蛋白质结构域的序列比对。基于结构的序列比对由COMPARER获得,初始等效性自动从MALIGN比对中提供,随后使用STAMP4.0进行扩充。最终的序列比对已使用JOY4.0对结构特征进行了注释。提供了几个到其他相关数据库和基因组序列亲属的有趣链接。尽管序列同一性较差且存在单成员超家族,但可靠的远缘相关蛋白质序列比对的可用性允许在文库中更好地对结构进行采样,以用于新序列的折叠识别以及理解各个超家族的蛋白质结构-功能关系。该数据库可以通过关键词查询,也可以通过序列搜索进行查询,通过PSI-BLAST方法进行接口。可以检索包括用户输入序列在内的所有超家族的结构注释序列比对和几个结构辅助文件。该数据库可从http://www.ncbs.res.in/%7Efaculty/mini/campass/pass.html访问。