Sahashi Kentaro, Masuda Akio, Matsuura Tohru, Shinmi Jun, Zhang Zhujun, Takeshima Yasuhiro, Matsuo Masafumi, Sobue Gen, Ohno Kinji
Division of Neurogenetics and Bioinformatics, Center for Neurological Diseases and Cancer, Nagoya University Graduate School of Medicine, Nagoya, Japan.
Nucleic Acids Res. 2007;35(18):5995-6003. doi: 10.1093/nar/gkm647. Epub 2007 Aug 28.
We have found that two previously reported exonic mutations in the PINK1 and PARK7 genes affect pre-mRNA splicing. To develop an algorithm to predict underestimated splicing consequences of exonic mutations at the 5' splice site, we constructed and analyzed 31 minigenes carrying exonic splicing mutations and their derivatives. We also examined 189,249 U2-dependent 5' splice sites of the entire human genome and found that a new variable, the SD-Score, which represents a common logarithm of the frequency of a specific 5' splice site, efficiently predicts the splicing consequences of these minigenes. We also employed the information contents (R(i)) to improve the prediction accuracy. We validated our algorithm by analyzing 32 additional minigenes as well as 179 previously reported splicing mutations. The SD-Score algorithm predicted aberrant splicings in 198 of 204 sites (sensitivity = 97.1%) and normal splicings in 36 of 38 sites (specificity = 94.7%). Simulation of all possible exonic mutations at positions -3, -2 and -1 of the 189 249 sites predicts that 37.8, 88.8 and 96.8% of these mutations would affect pre-mRNA splicing, respectively. We propose that the SD-Score algorithm is a practical tool to predict splicing consequences of mutations affecting the 5' splice site.
我们发现,先前报道的PINK1和PARK7基因中的两个外显子突变会影响前体mRNA剪接。为了开发一种算法来预测5'剪接位点外显子突变被低估的剪接后果,我们构建并分析了31个携带外显子剪接突变及其衍生物的微型基因。我们还检查了整个人类基因组中189,249个依赖U2的5'剪接位点,发现一个新的变量SD-Score(代表特定5'剪接位点频率的常用对数)能够有效地预测这些微型基因的剪接后果。我们还利用信息含量(R(i))来提高预测准确性。我们通过分析另外32个微型基因以及179个先前报道的剪接突变来验证我们的算法。SD-Score算法在204个位点中的198个预测到异常剪接(敏感性 = 97.1%),在38个位点中的36个预测到正常剪接(特异性 = 94.7%)。对189,249个位点的-3、-2和-1位置处所有可能的外显子突变进行模拟预测,这些突变中分别有37.8%、88.8%和96.8%会影响前体mRNA剪接。我们提出,SD-Score算法是预测影响5'剪接位点突变剪接后果的实用工具。