Medvedeva Irina V, Demenkov Pavel S, Ivanisenko Vladimir A
BMC Genomics. 2015;16 Suppl 13(Suppl 13):S2. doi: 10.1186/1471-2164-16-S13-S2. Epub 2015 Dec 16.
Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem.
One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling.
These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and allow a better understanding of the emergence of biological diversity.
研究蛋白质的结构与功能组织及其编码基因之间的关系,对于理解分子系统的进化是必要的,并且可以为许多应用提供新知识,这些应用旨在设计具有改善的医学和生物学特性的蛋白质。众所周知,蛋白质的功能特性由其功能位点决定。功能位点通常由少数氨基酸残基表示,这些残基在氨基酸序列中彼此相距甚远。它们在其功能组内高度保守,并且在不同功能组之间结构差异很大。基于这些事实,在蛋白质水平以及编码基因的外显子 - 内含子结构水平上分析功能位点的结构组织的一般特性仍然是一个实际问题。
这种分析的一种方法是将功能位点的氨基酸残基位置以及外显子边界投影到基因结构上。在本文中,我们研究了基因外显子 - 内含子结构中功能位点的不连续性,以及脊椎动物基因中编码功能位点的外显子的长度和相位分布。我们已经表明,编码功能位点的DNA片段位于同一外显子中,或者在相邻外显子中。观察到的编码功能位点的外显子聚类趋势可被视为蛋白质进化的单位。我们研究了11种后生动物物种中编码和不编码功能位点的外显子边界结构的特征。这伴随着编码氨基酸残基功能位点的外显子中密码子间间隙(相位0)频率的降低,这可能是外显子重排存在进化限制的证据。
这些结果表征了影响编码蛋白质功能的编码外显子 - 内含子结构的特征,并有助于更好地理解生物多样性的出现。