Zhang Theresa, Haws Peter, Wu Qiang
Department of Bioinformatics, Merck Research Labs, Rahway, New Jersey 07065, USA.
Genome Res. 2004 Jan;14(1):79-89. doi: 10.1101/gr.1225204. Epub 2003 Dec 12.
A large family of neural protocadherin (Pcdh) proteins is encoded by three closely linked mammalian gene clusters (alpha, beta, and gamma). Pcdh alpha and gamma clusters have a striking genomic organization. Specifically, each "variable" exon is spliced to a common set of downstream "constant" exons within each cluster. Recent studies demonstrated that the cell-specific expression of each Pcdh gene is determined bya combination of variable-exon promoter activation and cis-splicing of the corresponding variable exon to the first constant exon. To determine whether there are other similarly organized gene clusters in mammalian genomes, we performed a genome-wide search and identified a large number of mammalian genes containing multiple variable first exons. Here we describe several clusters that contain about a dozen variable exons arrayed in tandem, including UDP glucuronosyltransferase (UGT1), plectin, neuronal nitric oxide synthase (NOS1), and glucocorticoid receptor (GR) genes. In all these cases, multiple variable first exons are each spliced to a common set of downstream constant exons to generate diverse functional mRNAs. As an example, we analyzed the tissue-specific expression profile of the mouse UGT1 repertoire and found that multiple isoforms are expressed in a tissue-specific manner. Therefore, this variable and constant genomic organization provides a genetic mechanism for directing distinct cell- and tissue-specific patterns of gene expression.
一大类神经原钙黏蛋白(Pcdh)由三个紧密相连的哺乳动物基因簇(α、β和γ)编码。Pcdhα和γ基因簇具有显著的基因组组织形式。具体而言,每个“可变”外显子都与每个基因簇内一组共同的下游“恒定”外显子进行剪接。最近的研究表明,每个Pcdh基因的细胞特异性表达是由可变外显子启动子激活以及相应可变外显子与第一个恒定外显子的顺式剪接共同决定的。为了确定哺乳动物基因组中是否存在其他类似组织形式的基因簇,我们进行了全基因组搜索,并鉴定出大量包含多个可变首个外显子的哺乳动物基因。在此,我们描述了几个包含约十二个串联排列的可变外显子的基因簇,包括尿苷二磷酸葡萄糖醛酸基转移酶(UGT1)、网蛋白、神经元型一氧化氮合酶(NOS1)和糖皮质激素受体(GR)基因。在所有这些情况下,多个可变首个外显子各自与一组共同的下游恒定外显子进行剪接,以产生多样的功能性mRNA。例如,我们分析了小鼠UGT1基因库的组织特异性表达谱,发现多种异构体以组织特异性方式表达。因此,这种可变和恒定的基因组组织形式为指导不同的细胞和组织特异性基因表达模式提供了一种遗传机制。