Mancuso D J, Tuley E A, Westfield L A, Lester-Mancuso T L, Le Beau M M, Sorace J M, Sadler J E
Howard Hughes Medical Institute Laboratories, Department of Medicine, Washington University School of Medicine, St. Louis, Missouri 63110.
Biochemistry. 1991 Jan 8;30(1):253-69. doi: 10.1021/bi00215a036.
Structural analysis of the von Willebrand factor gene located on chromosome 12 is complicated by the presence of a partial unprocessed pseudogene on chromosome 22q11-13. The structures of the von Willebrand factor pseudogene and corresponding segment of the gene were determined, and methods were developed for the rapid differentiation of von Willebrand factor gene and pseudogene sequences. The pseudogene is 21-29 kilobases in length and corresponds to 12 exons (exons 23-34) of the von Willebrand factor gene. Approximately 21 kilobases of the gene and pseudogene were sequenced, including the 5' boundary of the pseudogene. The 3' boundary of the pseudogene lies within an 8-kb region corresponding to intron 34 of the gene. The presence of splice site and nonsense mutations suggests that the pseudogene cannot yield functional transcripts. The pseudogene has diverged approximately 3.1% in nucleotide sequence from the gene. This suggests a recent evolutionary origin approximately 19-29 million years ago, near the time of divergence of humans and apes from monkeys. Several repetitive sequences were identified, including 4 Alu, one Line-1, and several short simple sequence repeats. Several of these simple repeats differ in length between the gene and pseudogene and provide useful markers for distinguishing these loci. Sequence differences between the gene and pseudogene were exploited to design oligonucleotide primers for use in the polymerase chain reaction to selectivity amplify sequences corresponding to exons 23-34 from either the von Willebrand factor gene or the pseudogene. This method is useful for the analysis of gene defects in patients with von Willebrand disease, without interference from homologous sequences in the pseudogene.
位于12号染色体上的血管性血友病因子基因的结构分析因22q11 - 13号染色体上存在部分未加工的假基因而变得复杂。确定了血管性血友病因子假基因和相应基因片段的结构,并开发了快速区分血管性血友病因子基因和假基因序列的方法。该假基因长度为21 - 29千碱基,对应于血管性血友病因子基因的12个外显子(外显子23 - 34)。对该基因和假基因约21千碱基进行了测序,包括假基因的5'边界。假基因的3'边界位于对应于该基因内含子34的8千碱基区域内。剪接位点和无义突变的存在表明该假基因无法产生功能性转录本。该假基因与基因的核苷酸序列差异约为3.1%。这表明其在大约1900 - 2900万年前有一个较近的进化起源,接近人类和猿类从猴子分化的时间。鉴定出了几个重复序列,包括4个Alu、1个Line - 1以及几个短串联重复序列。这些简单重复序列中的几个在基因和假基因之间长度不同,为区分这些基因座提供了有用的标记。利用基因和假基因之间的序列差异设计寡核苷酸引物,用于聚合酶链反应,以选择性扩增来自血管性血友病因子基因或假基因的对应于外显子23 - 34的序列。该方法对于分析血管性血友病患者的基因缺陷很有用,不会受到假基因中同源序列的干扰。