Zhang Mengwen, Guo Jing, Li Bin, Liu Kang, Zhao Jiayuan, Zhang Jiayuan, Lin Xuqing, Tang Bin, Wang Jie, Liao Weiping, He Na
Department of Neurology, Institute of Neuroscience, Key Laboratory of Neurogenetics and Channelopathies of Guangdong Province and the Ministry of Education of China, The Second Affiliated Hospital, Guangzhou Medical University, Guangzhou, 510260, China.
Department of Neurology, The Guangdong 999 Brain Hospital, Guangzhou, 510510, China.
Acta Epileptol. 2024 Nov 11;6(1):37. doi: 10.1186/s42494-024-00178-z.
SCN1A is the most well-recognized and commonly mutated gene related to epilepsy. This study analyzed the characteristic spatial and frequency distributions of SCN1A mutations, aiming to provide important insight into the mutagenesis etiopathology of SCN1A-associated epilepsy.
Epilepsy-associated SCN1A variants were retrieved from the SCN1A mutation database, the HGMD database, and literature reviews. The base substitutions, mutation frequencies in CpG dinucleotides, and spatial distributions of mutations in terms of exons and structural domains were analyzed.
A total of 2621 SCN1A variants were identified in 5106 unrelated cases. The most common type was missense mutation, followed by frameshift mutations and splice site mutations. Among the missense mutations, transitions within CpG dinucleotides were much more recurrently identified than transitions within non-CpG dinucleotides, and the most common type was the G > A transition. Among the nonsense mutations, the most predominant type of single-base substitution was the C > T transition, among which 75.3% (235/312) were within CpG sites. The most common "hotspot" codons for missense mutations were codons 101, 946, and 1783; while for nonsense mutations it was codon 712. One-base deletion or insertion was the most common type of frameshift mutation, causing protein truncation. The three most common frameshift mutations were c.5536_5539delAAAC, c.4554dupA, and c.5010_5013delGTTT. Splice mutations were the most frequently identified in exon 4 with a hotspot site c.602 + 1G > A. The spatial distribution of missense mutations showed that exons 22 and 4 had the highest mutation density (111 and 84 mutations per 100 bp, respectively), and exon 12 had the lowest mutation density, with 4 mutations per 100 bp. Further distribution analysis of the protein domains revealed that missense mutations were more common in the pore region and voltage sensor (231 mutations per 100 amino acids, respectively), and the protein truncation mutations were distributed evenly among the domains.
SCN1A mutations tend to cluster at distinct sites, depending on the characteristic CpG dinucleotides, exons, and functional domains. Higher mutation density in particular regions, such as exon 22 and exon 4, offers promising targets for therapeutic genetic interventions.
SCN1A是与癫痫相关的最广为人知且最常发生突变的基因。本研究分析了SCN1A突变的特征性空间和频率分布,旨在为SCN1A相关癫痫的诱变病因学提供重要见解。
从SCN1A突变数据库、HGMD数据库及文献综述中检索与癫痫相关的SCN1A变异。分析碱基替换、CpG二核苷酸中的突变频率以及外显子和结构域方面的突变空间分布。
在5106例无关病例中共鉴定出2621个SCN1A变异。最常见的类型是错义突变,其次是移码突变和剪接位点突变。在错义突变中,CpG二核苷酸内的转换比非CpG二核苷酸内的转换更常被鉴定到,最常见的类型是G>A转换。在无义突变中,单碱基替换最主要的类型是C>T转换,其中75.3%(235/312)位于CpG位点。错义突变最常见的“热点”密码子是101、946和1783密码子;而无义突变的是712密码子。单碱基缺失或插入是移码突变最常见的类型,导致蛋白质截短。三种最常见的移码突变是c.5536_5539delAAAC、c.4554dupA和c.5010_5013delGTTT。剪接突变在第4外显子中最常被鉴定到,热点位点为c.602+1G>A。错义突变的空间分布显示,第22和4外显子的突变密度最高(分别为每100 bp有111和84个突变),第12外显子的突变密度最低,每100 bp有4个突变。对蛋白质结构域的进一步分布分析表明,错义突变在孔区和电压感受器中更常见(分别为每100个氨基酸有231个突变),蛋白质截短突变在各结构域中分布均匀。
SCN1A突变倾向于聚集在不同位点,这取决于特征性的CpG二核苷酸、外显子和功能结构域。特定区域(如第22和4外显子)较高的突变密度为治疗性基因干预提供了有前景的靶点。