Suppr超能文献

膜蛋白家族的基因组分析:丰度与保守基序

Genomic analysis of membrane protein families: abundance and conserved motifs.

作者信息

Liu Yang, Engelman Donald M, Gerstein Mark

机构信息

Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520-8114, USA.

出版信息

Genome Biol. 2002 Sep 19;3(10):research0054. doi: 10.1186/gb-2002-3-10-research0054.

Abstract

BACKGROUND

Polytopic membrane proteins can be related to each other on the basis of the number of transmembrane helices and sequence similarities. Building on the Pfam classification of protein domain families, and using transmembrane-helix prediction and sequence-similarity searching, we identified a total of 526 well-characterized membrane protein families in 26 recently sequenced genomes. To this we added a clustering of a number of predicted but unclassified membrane proteins, resulting in a total of 637 membrane protein families.

RESULTS

Analysis of the occurrence and composition of these families revealed several interesting trends. The number of assigned membrane protein domains has an approximately linear relationship to the total number of open reading frames (ORFs) in 26 genomes studied. Caenorhabditis elegans is an apparent outlier, because of its high representation of seven-span transmembrane (7-TM) chemoreceptor families. In all genomes, including that of C. elegans, the number of distinct membrane protein families has a logarithmic relation to the number of ORFs. Glycine, proline, and tyrosine locations tend to be conserved in transmembrane regions within families, whereas isoleucine, valine, and methionine locations are relatively mutable. Analysis of motifs in putative transmembrane helices reveals that GxxxG and GxxxxxxG (which can be written GG4 and GG7, respectively; see Materials and methods) are among the most prevalent. This was noted in earlier studies; we now find these motifs are particularly well conserved in families, however, especially those corresponding to transporters, symporters, and channels.

CONCLUSIONS

We carried out a genome-wide analysis on patterns of the classified polytopic membrane protein families and analyzed the distribution of conserved amino acids and motifs in the transmembrane helix regions in these families.

摘要

背景

多结构域膜蛋白可根据跨膜螺旋的数量和序列相似性相互关联。基于蛋白质结构域家族的Pfam分类,并使用跨膜螺旋预测和序列相似性搜索,我们在26个最近测序的基因组中总共鉴定出526个特征明确的膜蛋白家族。在此基础上,我们又对一些预测但未分类的膜蛋白进行了聚类,最终得到637个膜蛋白家族。

结果

对这些家族的出现情况和组成进行分析,发现了几个有趣的趋势。在所研究的26个基因组中,已分配的膜蛋白结构域数量与开放阅读框(ORF)总数大致呈线性关系。秀丽隐杆线虫是一个明显的异常值,因为其七跨膜(7-TM)化学感受器家族的比例很高。在所有基因组中,包括秀丽隐杆线虫的基因组,不同膜蛋白家族的数量与ORF数量呈对数关系。甘氨酸、脯氨酸和酪氨酸的位置在家族内的跨膜区域往往是保守的,而异亮氨酸、缬氨酸和甲硫氨酸的位置则相对可变。对假定跨膜螺旋中的基序进行分析发现,GxxxG和GxxxxxxG(分别可写成GG4和GG7;见材料与方法)是最普遍的基序之一。这在早期研究中已有提及;然而,我们现在发现这些基序在家族中,尤其是那些对应于转运蛋白、同向转运蛋白和通道蛋白的家族中,特别保守。

结论

我们对已分类的多结构域膜蛋白家族模式进行了全基因组分析,并分析了这些家族跨膜螺旋区域中保守氨基酸和基序的分布。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5901/134483/ccc1663e7318/gb-2002-3-10-research0054-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验