Suroliya Varun, Uppili Bharathram, Kumar Manish, Jha Vineet, Srivastava Achal K, Faruq Mohammed
Department of Neurology, All India Institute of Medical Sciences, Ansari Nagar, Delhi, 110020, India.
Genomics and Molecular Medicine, CSIR-Institute of Genomics and Integrative Biology, Mall Road, Delhi, 110007, India.
Hum Genome Var. 2024 Jun 13;11(1):25. doi: 10.1038/s41439-024-00281-0.
Tandem nucleotide repeat (TNR) expansions, particularly the CNG nucleotide configuration, are associated with a variety of neurodegenerative disorders. In this study, we aimed to identify novel unstable CNG repeat loci associated with the neurogenetic disorder spinocerebellar ataxia (SCA). Using a computational approach, 15,069 CNG repeat loci in the coding and noncoding regions of the human genome were identified. Based on the feature selection criteria (repeat length >10 and functional location of repeats), we selected 52 repeats for further analysis and evaluated the repeat length variability in 100 control subjects. A subset of 19 CNG loci observed to be highly variable in control subjects was selected for subsequent analysis in 100 individuals with SCA. The genes with these highly variable repeats also exhibited higher gene expression levels in the brain according to the tissue expression dataset (GTEx). No pathogenic expansion events were identified in patient samples, which is a limitation given the size of the patient group examined; however, these loci contain potential risk alleles for expandability. Recent studies have implicated GLS, RAI1, GIPC1, MED15, EP400, MEF2A, and CNKSR2 in neurological diseases, with GLS, GIPC1, MED15, RAI1, and MEF2A sharing the same repeat loci reported in this study. This finding validates the approach of evaluating repeat loci in different populations and their possible implications for human pathologies.
串联核苷酸重复序列(TNR)扩增,尤其是CNG核苷酸构型,与多种神经退行性疾病相关。在本研究中,我们旨在鉴定与神经遗传性疾病脊髓小脑共济失调(SCA)相关的新型不稳定CNG重复位点。采用计算方法,在人类基因组的编码区和非编码区鉴定出15,069个CNG重复位点。根据特征选择标准(重复长度>10且重复序列的功能位置),我们选择了52个重复序列进行进一步分析,并评估了100名对照受试者的重复长度变异性。在100名SCA患者中,选择了19个在对照受试者中观察到高度可变的CNG位点进行后续分析。根据组织表达数据集(GTEx),具有这些高度可变重复序列的基因在大脑中也表现出较高的基因表达水平。在患者样本中未发现致病性扩增事件,鉴于所检查患者组的规模,这是一个局限性;然而,这些位点包含可扩展性的潜在风险等位基因。最近的研究表明,GLS、RAI1、GIPC1、MED15、EP400、MEF2A和CNKSR2与神经疾病有关,其中GLS、GIPC1、MED15、RAI1和MEF2A共享本研究中报道的相同重复位点。这一发现验证了评估不同人群中重复位点及其对人类病理学可能影响的方法。