Desmet François-Olivier, Hamroun Dalil, Lalande Marine, Collod-Béroud Gwenaëlle, Claustres Mireille, Béroud Christophe
INSERM, U827, Montpellier, France.
Nucleic Acids Res. 2009 May;37(9):e67. doi: 10.1093/nar/gkp215. Epub 2009 Apr 1.
Thousands of mutations are identified yearly. Although many directly affect protein expression, an increasing proportion of mutations is now believed to influence mRNA splicing. They mostly affect existing splice sites, but synonymous, non-synonymous or nonsense mutations can also create or disrupt splice sites or auxiliary cis-splicing sequences. To facilitate the analysis of the different mutations, we designed Human Splicing Finder (HSF), a tool to predict the effects of mutations on splicing signals or to identify splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction as well as new ones for binding sites of the 9G8 and Tra2-beta Serine-Arginine proteins and the hnRNP A1 ribonucleoprotein. We also developed new Position Weight Matrices to assess the strength of 5' and 3' splice sites and branch points. We evaluated HSF efficiency using a set of 83 intronic and 35 exonic mutations known to result in splicing defects. We showed that the mutation effect was correctly predicted in almost all cases. HSF could thus represent a valuable resource for research, diagnostic and therapeutic (e.g. therapeutic exon skipping) purposes as well as for global studies, such as the GEN2PHEN European Project or the Human Variome Project.
每年都会鉴定出数千种突变。尽管许多突变直接影响蛋白质表达,但现在人们认为越来越多的突变会影响mRNA剪接。它们大多影响现有的剪接位点,但同义突变、非同义突变或无义突变也可能产生或破坏剪接位点或辅助顺式剪接序列。为便于分析不同的突变,我们设计了人类剪接预测器(HSF),这是一种用于预测突变对剪接信号的影响或识别任何人源序列中的剪接基序的工具。它包含用于辅助序列预测的所有可用矩阵,以及针对9G8和Tra2-β丝氨酸-精氨酸蛋白及hnRNP A1核糖核蛋白结合位点的新矩阵。我们还开发了新的位置权重矩阵来评估5'和3'剪接位点及分支点的强度。我们使用一组已知会导致剪接缺陷的83个内含子突变和35个外显子突变评估了HSF的效率。我们表明,几乎在所有情况下都能正确预测突变效应。因此,HSF可成为用于研究、诊断和治疗(如治疗性外显子跳跃)目的以及全球研究(如欧洲GEN2PHEN项目或人类变异组项目)的宝贵资源。