Suppr超能文献

人类聚集蛋白聚糖基因的结构:外显子-内含子组织及其与蛋白质结构域的关联。

Structure of the human aggrecan gene: exon-intron organization and association with the protein domains.

作者信息

Valhmu W B, Palmer G D, Rivers P A, Ebara S, Cheng J F, Fischer S, Ratcliffe A

机构信息

Department of Orthopaedic Surgery, Columbia University, New York, NY 10032, USA.

出版信息

Biochem J. 1995 Jul 15;309 ( Pt 2)(Pt 2):535-42. doi: 10.1042/bj3090535.

Abstract

The complete exon-intron organization of the human aggrecan gene has been defined, and the exon organization has been compared with the individual domains of the protein core. A yeast artificial chromosome containing the aggrecan gene was selected from the Centre d'Etude du Polymorphisme Humaine yeast artificial chromosome library. A cosmid sulibrary was created from this, and direct sequencing of individual cosmids was used to provide the exon-intron organization. The human aggrecan gene was found to be composed of 19 exons ranging in size from 77 to 4224 bp. Exon 1 is non-coding, whereas exons 2-19 code for a protein core of 2454 amino acids with a calculated mass of 254379 Da. Intron 1 of the gene is at least 13 kb. Overall, the sizes of the 18 introns range from 0.5 to greater than 13 kb. Each intron begins with a GT and ends with an AG, thus obeying the GT/AG rule of splice-junction sequences. The entire coding region is contained in 39.4 kb of the gene. The organization of exons is strongly related to the specific domains of the protein core. The A loop of G1 and the interglobular domain are encoded by exons 3 and 7 respectively. The B and B' loops of G1 are encoded by exons 4-6, and those of G2 are encoded by exons 8-10. These sets of exons, coding for the B and B' loops, are identical in size and organization. This is supported by the intron classes associated with these exons. Exon 11 codes for the 5' half of the keratan sulphate-rich region, and exon 12 codes for the 3' half of the keratan sulphate-rich region as well as the entire chondroitin sulphate-rich region. G3 is encoded by exons 13-18, including the alternatively spliced epidermal growth factor-like and complement regulatory protein-like domains. The correspondence between the exon organization and the protein domains argues strongly for modular assembly of the aggrecan gene.

摘要

人类聚集蛋白聚糖基因完整的外显子 - 内含子结构已被确定,并且外显子结构已与蛋白核心的各个结构域进行了比较。从人类多态性研究中心酵母人工染色体文库中筛选出一个包含聚集蛋白聚糖基因的酵母人工染色体。由此构建了一个黏粒亚文库,并通过对单个黏粒进行直接测序来确定外显子 - 内含子结构。发现人类聚集蛋白聚糖基因由19个外显子组成,大小从77到4224 bp不等。外显子1是非编码的,而外显子2 - 19编码一个由2454个氨基酸组成的蛋白核心,计算分子量为254379 Da。该基因的内含子1至少有13 kb。总体而言,18个内含子的大小范围从0.5到大于13 kb。每个内含子以GT起始,以AG结束,因此遵循剪接连接序列的GT/AG规则。整个编码区包含在该基因的39.4 kb中。外显子的结构与蛋白核心的特定结构域密切相关。G1的A环和球间结构域分别由外显子3和7编码。G1的B环和B'环由外显子4 - 6编码,G2的B环和B'环由外显子8 - 10编码。这些编码B环和B'环的外显子组在大小和结构上是相同的。与这些外显子相关的内含子类别也支持这一点。外显子11编码富含硫酸角质素区域的5'半部分,外显子12编码富含硫酸角质素区域的3'半部分以及整个富含硫酸软骨素的区域。G3由外显子13 - 18编码,包括选择性剪接的表皮生长因子样和补体调节蛋白样结构域。外显子结构与蛋白结构域之间的对应关系有力地支持了聚集蛋白聚糖基因的模块化组装。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4062/1135764/36ec23762ade/biochemj00059-0176-a.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验