Regec A, Quadros E V, Platica O, Rothenberg S P
Division of Hematology/Oncology, State University of New York (SUNY)-Health Science Center, Brooklyn 11203, USA.
Blood. 1995 May 15;85(10):2711-9.
Transcobalamin II (TCII) is a plasma protein that binds vitamin B12 (cobalamin; Cbl) and facilitates the cellular uptake of the vitamin by receptor-mediated endocytosis. In genetic disorders that are characterized by congenital deficiency of TCII, intracellular Cbl deficiency occurs, resulting in an early onset of megaloblastic anemia that is sometimes accompanied by a neurologic disorder. To define the genetic basis for TCII deficiency, we have cloned and characterized the human gene that encodes this protein. The gene spans a minimum of 18 kbp and contains nine exons and eight introns, with a polyadenylation signal sequence located 509 bp downstream from the termination codon and a transcription initiation site beginning 158 bp upstream from the ATG translation start site. The 5' flanking DNA does not have a TATA or CCAAT regulatory element, but a 34-nucleotide stretch beginning just upstream of the CAP site contains four tandemly organized 5'-CCCC-3' tetramers. This sequence is a motif for a trans-active transcription factor (ETF) that regulates expression of the epidermal growth factor receptor gene (EGFR), which also lacks TATA and CCAAT regulatory elements. A GC-rich sequence that binds the SP1 protein is located 356 nucleotides upstream from the first of the series of CCCC tetramers. Although this GC sequence is at an unusual location with respect to the CAP site, a 507-bp fragment containing this GC box drives the chloramphenicol acetyltransferase (CAT) reporter gene after transient transfection into NIH 3T3 cells. No CAT activity was observed when a 420-bp fragment lacking this GC box but containing the ETF-binding domains was similarly transfected into this cell line. One consensus and two atypical motifs for the c-myc ligand are located downstream and upstream, respectively, of the GC box, and this could explain the elevated plasma TCII observed in some patients with multiple myeloma, as the c-myc product is overexpressed in some myeloma cells. Restriction endonuclease digestion of genomic DNA from eight normal subjects with Taq I, Hinfl, Msp I, and Bgl I identified three patterns of restriction fragment length polymorphism (RFLP). A number of the exon/intron splice junctions of human TCII, TCI, and IF genes are located in homologous regions of these proteins, providing evidence that these genes have evolved by duplication of an ancestral gene. This characterization of the TCII gene and the RFLP should facilitate the identification of the mutation(s) responsible for the genetic abnormalities of TCII expression.
转钴胺素II(TCII)是一种血浆蛋白,它能结合维生素B12(钴胺素;Cbl),并通过受体介导的内吞作用促进细胞对该维生素的摄取。在以先天性TCII缺乏为特征的遗传性疾病中,细胞内Cbl缺乏会发生,导致巨幼细胞贫血的早期发作,有时还伴有神经系统疾病。为了确定TCII缺乏的遗传基础,我们克隆并鉴定了编码该蛋白的人类基因。该基因跨度至少为18kbp,包含9个外显子和8个内含子,一个多聚腺苷酸化信号序列位于终止密码子下游509bp处,转录起始位点始于ATG翻译起始位点上游158bp处。5'侧翼DNA没有TATA或CCAAT调控元件,但在帽位点上游开始的一个34个核苷酸的片段包含四个串联排列的5'-CCCC-3'四聚体。这个序列是一种反式作用转录因子(ETF)的基序,该转录因子调节表皮生长因子受体基因(EGFR)的表达,而EGFR也缺乏TATA和CCAAT调控元件。一个与SP1蛋白结合的富含GC的序列位于一系列CCCC四聚体中第一个四聚体上游356个核苷酸处。尽管这个GC序列相对于帽位点处于一个不寻常的位置,但一个包含这个GC框的507bp片段在瞬时转染到NIH 3T3细胞后能驱动氯霉素乙酰转移酶(CAT)报告基因。当一个缺乏这个GC框但包含ETF结合结构域的420bp片段以同样方式转染到该细胞系中时,未观察到CAT活性。c-myc配体的一个共有基序和两个非典型基序分别位于GC框的下游和上游,这可以解释在一些多发性骨髓瘤患者中观察到的血浆TCII升高,因为c-myc产物在一些骨髓瘤细胞中过度表达。用Taq I、Hinfl、Msp I和Bgl I对8名正常受试者的基因组DNA进行限制性内切酶消化,确定了三种限制性片段长度多态性(RFLP)模式。人类TCII、TCI和IF基因的许多外显子/内含子剪接连接位于这些蛋白质的同源区域,这为这些基因是通过一个祖先基因的复制进化而来提供了证据。对TCII基因的这种表征和RFLP应该有助于鉴定导致TCII表达遗传异常的突变。