Fields C
Computing Research Laboratory, New Mexico State University, Las Cruces 88003-0001.
J Mol Evol. 1988;28(1-2):55-63. doi: 10.1007/BF02143497.
The amino acid (aa) sequences of the polypeptides encoded by five collagen genes of the nematode Caenorhabditis elegans, col-6, col-7 (partial), col-8, col-14, and col-19, were determined. These collagen polypeptides, as well as those encoded by the previously sequenced C. elegans collagen genes col-1 and col-2, share a common organization into five domains: an amino-terminal leader, a short (30-33 aa) (Gly-X-Y)n domain, a non(Gly-X-Y) spacer, a long (127-132 aa) (Gly-X-Y)n domain, and a short carboxyl-terminal domain. The domain organizations and intron positions of these polypeptides were compared with those of the polypeptides encoded by Drosophila and Strongylocentrotus type IV, and vertebrate types I, II, III, IV, and IX collagen genes; the C. elegans collagen polypeptides are most similar to the vertebrate type IX collagens. It is suggested that the collagen gene family comprises two divergent subfamilies, one of which includes the vertebrate interstitial collagen genes, and the other of which includes the invertebrate collagen genes and the vertebrate type IV and type IX collagen genes. Only the vertebrate interstitial collagen genes display clear evidence of evolution via the tandem duplication of a 54-bp exon.
测定了秀丽隐杆线虫5种胶原蛋白基因(col-6、col-7(部分)、col-8、col-14和col-19)所编码的多肽的氨基酸(aa)序列。这些胶原蛋白多肽,以及先前测序的秀丽隐杆线虫胶原蛋白基因col-1和col-2所编码的多肽,共有一个由五个结构域组成的共同结构:一个氨基末端前导序列、一个短的(30 - 33个氨基酸)(甘氨酸- X - 酪氨酸)n结构域、一个非(甘氨酸- X - 酪氨酸)间隔区、一个长的(127 - 132个氨基酸)(甘氨酸- X - 酪氨酸)n结构域和一个短的羧基末端结构域。将这些多肽的结构域组织和内含子位置与果蝇和海胆IV型以及脊椎动物I型、II型、III型、IV型和IX型胶原蛋白基因所编码的多肽进行了比较;秀丽隐杆线虫的胶原蛋白多肽与脊椎动物IX型胶原蛋白最为相似。有人提出,胶原蛋白基因家族包括两个不同的亚家族,其中一个包括脊椎动物的间质胶原蛋白基因,另一个包括无脊椎动物的胶原蛋白基因以及脊椎动物的IV型和IX型胶原蛋白基因。只有脊椎动物的间质胶原蛋白基因通过一个54bp外显子的串联重复显示出明显的进化证据。