基于结构和序列家族的蛋白质重复分类比较。

Comparison of protein repeat classifications based on structure and sequence families.

作者信息

Paladin Lisanna, Tosatto Silvio C E

机构信息

Department of Biomedical Sciences, University of Padua, viale G. Colombo 3, 35121 Padova, Italy.

Department of Biomedical Sciences, University of Padua, viale G. Colombo 3, 35121 Padova, Italy

出版信息

Biochem Soc Trans. 2015 Oct;43(5):832-7. doi: 10.1042/BST20150079.

DOI:10.1042/BST20150079

PMID:26517890

Abstract

Tandem repeats (TR) in proteins are common in nature and have several unique functions. They come in various forms that are frequently difficult to recognize from a sequence. A previously proposed structural classification has been recently implemented in the RepeatsDB database. This defines five main classes, mainly based on repeat unit length, with subclasses representing specific folds. Sequence-based classifications, such as Pfam, provide an alternative classification based on evolutionarily conserved repeat families. Here, we discuss a detailed comparison between the structural classes in RepeatsDB and the corresponding Pfam repeat families and clans. Most instances are found to map one-to-one between structure and sequence. Some notable exceptions such as leucine-rich repeats (LRRs) and α-solenoids are discussed.

摘要

蛋白质中的串联重复序列（TR）在自然界中很常见，并且具有多种独特功能。它们有多种形式，通常很难从序列中识别出来。之前提出的一种结构分类最近已在RepeatsDB数据库中实施。这定义了五个主要类别，主要基于重复单元长度，子类代表特定的折叠结构。基于序列的分类，如Pfam，提供了一种基于进化保守重复家族的替代分类方法。在这里，我们讨论了RepeatsDB中的结构类别与相应的Pfam重复家族和族之间的详细比较。发现大多数实例在结构和序列之间是一一对应的。还讨论了一些显著的例外情况，如富含亮氨酸的重复序列（LRR）和α-螺线管。