Institute of Clinical Medicine, National Yang-Ming University, Taipei, Taiwan.
Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan.
Hum Genomics. 2017 Nov 9;11(1):27. doi: 10.1186/s40246-017-0122-6.
The human genome contains millions of single nucleotide polymorphisms (SNPs); many of these SNPs are intronic and have unknown functional significance. SNPs occurring within intron branchpoint sites, especially at the adenine (A), would presumably affect splicing; however, this has not been systematically studied. We employed a splicing prediction tool to identify human intron branchpoint sites and screened dbSNP for identifying SNPs located in the predicted sites to generate a genome-wide branchpoint site SNP database.
We identified 600 SNPs located within branchpoint sites; among which, 216 showed a change in A. After scoring the SNPs by counting the As in the ± 10 nucleotide region, only four SNPs were identified without additional As (rs13296170, rs12769205, rs75434223, and rs67785924). Using minigene constructs, we examined the effects of these SNPs on splicing. The three SNPs (rs13296170, rs12769205, and rs75434223) with nucleotide substitution at the A position resulted in abnormal splicing (exon skipping and/or intron inclusion). However, rs67785924, a 5-bp deletion that abolished the branchpoint A nucleotide, exhibited normal RNA splicing pattern, presumably using two of the downstream As as alternative branchpoints. The influence of additional As on splicing was further confirmed by studying rs2733532, which contains three additional As in the ± 10 nucleotide region.
We generated a high-confidence genome-wide branchpoint site SNP database, experimentally verified the importance of A in the branchpoint, and suggested that other nearby As can protect branchpoint A substitution from abnormal splicing.
人类基因组包含数百万个单核苷酸多态性(SNPs);其中许多 SNPs 是内含子的,具有未知的功能意义。发生在内含子分支点位点的 SNPs,特别是腺嘌呤(A),可能会影响剪接;然而,这尚未得到系统研究。我们使用剪接预测工具来识别人类内含子分支点位点,并在 dbSNP 中筛选出位于预测位点的 SNPs,以生成全基因组分支点 SNP 数据库。
我们确定了 600 个位于分支点位点的 SNPs;其中,216 个显示 A 发生变化。通过计算±10 个核苷酸区域内的 As 来对 SNPs 进行评分后,只有四个 SNPs 没有额外的 As(rs13296170、rs12769205、rs75434223 和 rs67785924)。使用小基因构建体,我们研究了这些 SNPs 对剪接的影响。三个 SNPs(rs13296170、rs12769205 和 rs75434223)在 A 位置发生核苷酸取代导致异常剪接(外显子跳过和/或内含子包含)。然而,rs67785924 是一个 5 个碱基的缺失,消除了分支点 A 核苷酸,表现出正常的 RNA 剪接模式,可能使用下游的两个 A 作为替代分支点。通过研究包含±10 个核苷酸区域内的三个额外 As 的 rs2733532,进一步证实了额外 As 对剪接的影响。
我们生成了一个高可信度的全基因组分支点 SNP 数据库,实验验证了 A 在分支点中的重要性,并表明附近的其他 As 可以保护分支点 A 取代免受异常剪接的影响。