Exposito J Y, Garrone R
Institute of Biology and Chemistry of Proteins, Centre National de la Recherche Scientifique, Lyons, France.
Proc Natl Acad Sci U S A. 1990 Sep;87(17):6669-73. doi: 10.1073/pnas.87.17.6669.
We have characterized cDNA and genomic clones coding for a sponge collagen. The partial cDNA has an open reading frame encoding 547 amino acid residues. The conceptual translation product contains a probably incomplete triple-helical domain (307 amino acids) with one Gly-Xaa-Yaa-Zaa imperfection in the otherwise perfect Gly-Xaa-Yaa repeats and a carboxyl propeptide (240 amino acids) that includes 7 cysteine residues. Amino acid sequence comparisons indicate that this sponge collagen is homologous to vertebrate and sea urchin fibrillar collagens. Partial characterization of the corresponding gene reveals an intron-exon organization clearly related to the fibrillar collagen gene family. The exons coding for the triple-helical domain are 54 base pairs (bp) or multiples thereof, except for a 57-bp exon containing the Gly-Xaa-Yaa-Zaa coding sequence and for two unusual exons of 126 and 18 bp, respectively. This latter 18-bp exon marks the end of the triple-helical domain, contrary to the other known fibrillar collagen genes that contain exons coding for the junction between the triple-helical domain and the carboxyl propeptide. Compared to other fibrillar collagen genes, the introns are remarkably small. Hybridization to blotted RNAs established that the gene transcript is 4.9 kilobases. Together with previous results that showed the existence of a nonfibrillar collagen in the same species, these data demonstrate that at least two collagen gene families are represented in the most primitive metazoa.
我们已对编码一种海绵胶原蛋白的cDNA和基因组克隆进行了特征分析。部分cDNA具有一个开放阅读框,编码547个氨基酸残基。概念翻译产物包含一个可能不完整的三螺旋结构域(307个氨基酸),在原本完美的Gly-Xaa-Yaa重复序列中有一个Gly-Xaa-Yaa-Zaa缺陷,以及一个羧基前肽(240个氨基酸),其中包括7个半胱氨酸残基。氨基酸序列比较表明,这种海绵胶原蛋白与脊椎动物和海胆的纤维状胶原蛋白同源。对相应基因的部分特征分析揭示了一种与纤维状胶原蛋白基因家族明显相关的内含子-外显子组织。编码三螺旋结构域的外显子为54个碱基对(bp)或其倍数,但有一个包含Gly-Xaa-Yaa-Zaa编码序列的57-bp外显子以及两个分别为126 bp和18 bp的异常外显子除外。后一个18-bp外显子标志着三螺旋结构域的末端,这与其他已知的纤维状胶原蛋白基因不同,后者包含编码三螺旋结构域与羧基前肽之间连接区域的外显子。与其他纤维状胶原蛋白基因相比,这些内含子非常小。与印迹RNA的杂交表明该基因转录本为4.9千碱基。结合之前显示同一物种中存在非纤维状胶原蛋白的结果,这些数据表明在最原始的后生动物中至少代表了两个胶原蛋白基因家族。