Kiss I, Deák F, Holloway R G, Delius H, Mebust K A, Frimberger E, Argraves W S, Tsonis P A, Winterbottom N, Goetinck P F
Institute of Biochemistry, Biological Research Center of the Hungarian Academy of Sciences, Szeged.
J Biol Chem. 1989 May 15;264(14):8126-34.
The entire gene for chicken cartilage matrix protein (CMP) has been isolated and characterized by restriction mapping, electron microscopy, nuclease S1 mapping, and sequence analysis. The gene, which is present in a single copy in the chicken genome, is 18 kilobase pairs long and comprises eight exons and seven introns. It has two transcription initiation sites, 8 base pairs from each other. A sequence very homologous to the consensus nuclear factor III binding-site sequence, a CAT- and a TATA-like sequence are found in the promoter region and ATTAAA is used as a polyadenylation signal. The nucleotide sequence defines a primary translation product of 493 amino acids which consists of a 23-amino acid signal peptide and two large repeated domains connected by an epidermal growth factor module. Amino acid sequences homologous to those of the repeated domains are present in the type A repeats of von Willebrand factor, complement factors B and C2, and in the alpha chains of the integrins Mac-1, p150,95, and LFA-1. The exon-intron structure indicates that the CMP gene may have arisen by exon duplication and exon shuffling during evolution. The GT-AG splice rule cannot be applied for the excision of the last intron of the CMP pre-mRNA. The donor splice site of intron G is basically different from the consensus sequence indicating that a novel type of splicing mechanism might exist in cartilage.
鸡软骨基质蛋白(CMP)的完整基因已通过限制性图谱分析、电子显微镜观察、核酸酶S1图谱分析和序列分析进行了分离和鉴定。该基因在鸡基因组中以单拷贝形式存在,长度为18千碱基对,由八个外显子和七个内含子组成。它有两个转录起始位点,彼此相距8个碱基对。在启动子区域发现了一个与共有核因子III结合位点序列非常同源的序列、一个CAT样序列和一个TATA样序列,并且ATTAAA被用作聚腺苷酸化信号。核苷酸序列确定了一个由493个氨基酸组成的初级翻译产物,该产物由一个23个氨基酸的信号肽和两个由表皮生长因子模块连接的大的重复结构域组成。与重复结构域氨基酸序列同源的序列存在于血管性血友病因子的A类重复序列、补体因子B和C2中,以及整合素Mac-1、p150,95和LFA-1的α链中。外显子-内含子结构表明,CMP基因可能是在进化过程中通过外显子重复和外显子重排产生的。GT-AG剪接规则不适用于CMP前体mRNA最后一个内含子的切除。内含子G的供体剪接位点与共有序列基本不同,这表明软骨中可能存在一种新型的剪接机制。