Ibrahim El Chérif, Hims Matthew M, Shomron Noam, Burge Christopher B, Slaugenhaupt Susan A, Reed Robin
Department of Cell Biology, Harvard Medical School, Boston, Massachusetts 02115, USA.
Hum Mutat. 2007 Jan;28(1):41-53. doi: 10.1002/humu.20401.
Splicing mutations that lead to devastating genetic diseases are often located in nonconserved or weakly conserved sequences that normally do not affect splicing. Thus, the underlying reason for the splicing defect is not immediately obvious. An example of this phenomenon is observed in the neurodevelopmental disease familial dysautonomia (FD), which is caused by a single-base change in the 5' splice site (5'ss) of intron 20 in the IKBKAP gene (c.2204+6T>C). This mutation, which is in the sixth position of the intron and results in exon 20 skipping, has no phenotype in many other introns. To determine why the position 6 mutation causes aberrant splicing only in certain cases, we first used an in silico approach to identify potential sequences involved in exon 20 skipping. Computational analyses of the exon 20 5'ss itself predicted that this nine-nucleotide splicing signal, even when it contains the T>C mutation, is not sufficiently weak to explain the FD phenotype. However, the computational analysis predicted that both the upstream 3' splice site (3'ss) and exon 20 contain weak splicing signals, indicating that the FD 5'ss, together with the surrounding splicing signals, are not adequate for defining exon 20. These in silico predictions were corroborated using IKBKAP minigenes in a new rapid and simple in vitro coupled RNA polymerase (RNAP) II transcription/splicing assay. Finally, the weak splicing signals that flank the T>C mutation were validated as the underlying cause of familial dysautonomia in vivo using transient transfection assays. Together, our study demonstrates the general utility of combining in silico data with an in vitro RNAP II transcription/splicing system for rapidly identifying critical sequences that underlie the numerous splicing diseases caused by otherwise silent mutations.
导致毁灭性遗传疾病的剪接突变通常位于非保守或弱保守序列中,这些序列通常不会影响剪接。因此,剪接缺陷的根本原因并不立即明显。在神经发育疾病家族性自主神经功能异常(FD)中观察到了这种现象,它由IKBKAP基因第20内含子5'剪接位点(5'ss)的单碱基变化(c.2204+6T>C)引起。该突变位于内含子的第六位,导致外显子20跳跃,在许多其他内含子中没有表型。为了确定6号位突变为何仅在某些情况下导致异常剪接,我们首先使用计算机方法来识别与外显子20跳跃相关的潜在序列。对外显子20自身5'ss的计算分析预测,即使这个九核苷酸剪接信号包含T>C突变,它也不够弱,无法解释FD表型。然而,计算分析预测上游3'剪接位点(3'ss)和外显子20都包含弱剪接信号,这表明FD的5'ss与周围的剪接信号一起,不足以界定外显子20。在一种新的快速简单的体外偶联RNA聚合酶(RNAP)II转录/剪接试验中,使用IKBKAP小基因证实了这些计算机预测。最后,通过瞬时转染试验在体内验证了T>C突变侧翼的弱剪接信号是家族性自主神经功能异常的根本原因。我们的研究共同证明了将计算机数据与体外RNAP II转录/剪接系统相结合,用于快速识别由其他沉默突变引起的众多剪接疾病潜在关键序列的普遍实用性。