Liu G, McDaniel T K, Falkow S, Karlin S
Department of Mathematics, Stanford University, Stanford, CA 94305-2125, USA.
Proc Natl Acad Sci U S A. 1999 Jun 8;96(12):7011-6. doi: 10.1073/pnas.96.12.7011.
The severity of Helicobacter pylori-related disease is correlated with a pathogenicity island (the Cag region of about 26 genes) whose presence is associated with the up-regulation of an IL-8 cytokine inflammatory response in gastric epithelial cells. Statistical analysis of the Cag gene sequences calculated from the complete genome of strain 26695 revealed several unusual features. The Cag7 sequence (1,927 aa) has two repeat regions. Repeat region I runs 317 aa in a form of AAA proximal to the protein N terminal; repeat region II extends 907 aa in the middle of the protein sequence consisting of 74 contiguous segments composed from selections among six consensus sequences and includes 58 regularly distributed cysteine residues with consecutive cysteines mostly 12, 18, or 24 aa apart. This "regular" cysteine arrangement may provide a scaffolding of linker elements stabilized by disulfide bridges. When Cag7 homologues from different strains are compared, differences were found almost exclusively in the repeat regions, resulting from deletion and/or insertion of repeating units. These observations suggest that the anomalous repetitive structure of the sequence plays an important role in the conformation of Cag7 gene product and potentially in the function of the pathogenicity island. Other facets of the Cag7 sequence show significant charge clusters, high multiplet count, and extremes of amino acid usage.
幽门螺杆菌相关疾病的严重程度与一个致病岛(约26个基因的Cag区域)相关,该致病岛的存在与胃上皮细胞中白细胞介素-8细胞因子炎症反应的上调有关。对从菌株26695的全基因组计算得出的Cag基因序列进行统计分析,发现了几个不同寻常的特征。Cag7序列(1927个氨基酸)有两个重复区域。重复区域I在靠近蛋白质N端的位置以AAA形式延伸317个氨基酸;重复区域II在蛋白质序列中部延伸907个氨基酸,由六个共有序列中的选择组成的74个连续片段组成,包括58个规则分布的半胱氨酸残基,相邻半胱氨酸大多相隔12、18或24个氨基酸。这种“规则”的半胱氨酸排列可能提供了一个由二硫键稳定的连接元件支架。当比较来自不同菌株的Cag7同源物时,几乎只在重复区域发现差异,这是由重复单元的缺失和/或插入导致的。这些观察结果表明,该序列异常的重复结构在Cag7基因产物的构象中起重要作用,并且可能在致病岛的功能中起重要作用。Cag7序列的其他方面显示出显著的电荷簇、高多重性计数以及氨基酸使用的极端情况。