Emmert S, Schneider T D, Khan S G, Kraemer K H
Basic Research Laboratory, National Cancer Institute, National Institutes of Health, Building 37 Room 3E24, Bethesda, MD 20892, USA,
Nucleic Acids Res. 2001 Apr 1;29(7):1443-52. doi: 10.1093/nar/29.7.1443.
Defects in the XPG DNA repair endonuclease gene can result in the cancer-prone disorders xeroderma pigmentosum (XP) or the XP-Cockayne syndrome complex. While the XPG cDNA sequence was known, determination of the genomic sequence was required to understand its different functions. In cells from normal donors, we found that the genomic sequence of the human XPG gene spans 30 kb, contains 15 exons that range from 61 to 1074 bp and 14 introns that range from 250 to 5763 bp. Analysis of the splice donor and acceptor sites using an information theory-based approach revealed three splice sites with low information content, which are components of the minor (U12) spliceosome. We identified six alternatively spliced XPG mRNA isoforms in cells from normal donors and from XPG patients: partial deletion of exon 8, partial retention of intron 8, two with alternative exons (in introns 1 and 6) and two that retained complete introns (introns 3 and 9). The amount of alternatively spliced XPG mRNA isoforms varied in different tissues. Most alternative splice donor and acceptor sites had a relatively high information content, but one has the U12 spliceosome sequence. A single nucleotide polymorphism has allele frequencies of 0.74 for 3507G and 0.26 for 3507C in 91 donors. The human XPG gene contains multiple splice sites with low information content in association with multiple alternatively spliced isoforms of XPG mRNA.
XPG DNA修复内切核酸酶基因的缺陷可导致易患癌症的疾病,如着色性干皮病(XP)或XP-科凯恩综合征复合体。虽然XPG cDNA序列已知,但需要确定基因组序列以了解其不同功能。在正常供体的细胞中,我们发现人类XPG基因的基因组序列跨度为30 kb,包含15个外显子,长度从61到1074 bp不等,以及14个内含子,长度从250到5763 bp不等。使用基于信息论的方法分析剪接供体和受体位点,发现了三个信息含量低的剪接位点,它们是次要(U12)剪接体的组成部分。我们在正常供体和XPG患者的细胞中鉴定出六种可变剪接的XPG mRNA异构体:外显子8部分缺失、内含子8部分保留、两种带有可变外显子(在内含子1和6中)以及两种保留完整内含子(内含子3和9)。可变剪接的XPG mRNA异构体的数量在不同组织中有所不同。大多数可变剪接供体和受体位点具有相对较高的信息含量,但有一个具有U12剪接体序列。在91名供体中,单核苷酸多态性3507G的等位基因频率为0.74,3507C的等位基因频率为0.26。人类XPG基因包含多个信息含量低的剪接位点,并与XPG mRNA的多个可变剪接异构体相关。