Tauchi H, Matsuura S, Isomura M, Kinjo T, Nakamura A, Sakamoto S, Kondo N, Endo S, Komatsu K, Nakamura Y
Institute of Medical Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku, Tokyo, 108-8639, Japan.
Genomics. 1999 Jan 15;55(2):242-7. doi: 10.1006/geno.1998.5657.
An 800-kb region on chromosome 8q21, which complements the phenotype of cells from Nijmegen breakage syndrome patients, is a candidate for the locus of the underlying gene, termed NBS1. The sequence of this 800-kb region of DNA indicated that the size of this segment is 755,832 bp with an additional 36-kb gap. From this region, we identified four genes including NBS1, a gene coding for a 27-kDa vitamin D-dependent calcium-binding protein (27-kDa calbindin), the mitochondrial 2,4-dienoyl-CoA reductase gene, and a novel gene, C8orf1/hT41. All four genes were aligned in a 250-kb centromeric portion of the region, and no gene was found in the remaining telomeric portion containing 500 kb. The genomic organization of the C8orf1/hT41 and NBS1 genes has been analyzed using the computer programs GRAIL 2 and GENSCAN. They predicted and successfully found more than 93% of the exons, even a small 54-bp exon, indicating that one or more exons in any gene can be identified by these programs. GENSCAN was more efficient at locating the four genes than GRAIL 2 and identified 15 of the 16 exons of the NBS1 gene. This 800-kb region contained repetitive sequences, including 179 copies of the Alu sequence (1 copy/4.2 kb), 123 copies of the L1 sequence (1 copy/6.1 kb), 107 copies of the LTR sequence (1 copy/7.1 kb), and 63 copies of the MER sequence (1 copy/12 kb). There was a slight but not significant difference in the repetitive content of the gene-rich region and the remaining noncoding region. Our results indicate that computer-assisted methods are useful and powerful for identifying exons of both known and novel genes.
8号染色体8q21上一个800kb的区域可补充奈梅亨断裂综合征患者细胞的表型,是潜在基因(称为NBS1)位点的候选区域。该800kb DNA区域的序列表明,该片段大小为755,832bp,还有一个36kb的缺口。从该区域我们鉴定出四个基因,包括NBS1、一个编码27kDa维生素D依赖性钙结合蛋白(27kDa钙结合蛋白)的基因、线粒体2,4-二烯酰辅酶A还原酶基因以及一个新基因C8orf1/hT41。所有四个基因都排列在该区域250kb的着丝粒部分,在剩余的包含500kb的端粒部分未发现基因。使用计算机程序GRAIL 2和GENSCAN分析了C8orf1/hT41和NBS1基因的基因组结构。它们预测并成功找到了超过93%的外显子,甚至一个小的54bp外显子,这表明这些程序可以识别任何基因中的一个或多个外显子。GENSCAN在定位这四个基因方面比GRAIL 2更有效,并且鉴定出了NBS1基因16个外显子中的15个。这个800kb的区域包含重复序列,包括179个Alu序列拷贝(1个拷贝/4.2kb)、123个L1序列拷贝(1个拷贝/6.1kb)、107个LTR序列拷贝(1个拷贝/7.1kb)和63个MER序列拷贝(1个拷贝/12kb)。富含基因区域和其余非编码区域的重复含量存在轻微但不显著的差异。我们的结果表明,计算机辅助方法对于识别已知和新基因的外显子是有用且强大的。