School of Computer Science and Engineering, Yulin Normal University, Yulin, 537000, China.
Sci Rep. 2019 Jul 22;9(1):10543. doi: 10.1038/s41598-019-47076-w.
Genes are the basic functional units of heredity. Differences in genes can lead to various congenital physical conditions. One kind of these differences is caused by genetic variations named single nucleotide polymorphisms (SNPs). An SNP is a variation in a single nucleotide that occurs at a specific position in the genome. Some SNPs can affect splice sites and protein structures and cause gene abnormalities. SNPs on paired chromosomes may lead to fatal diseases so that a fertilized embryo cannot develop into a normal fetus or the people born with these abnormalities die in childhood. The distributions of genotypes on these SNP sites are different from those on other sites. Based on this idea, we present a novel statistical method to detect the abnormal distributions of genotypes and locate the potentially lethal genes. The test was performed on HapMap data and 74 suspicious SNPs were found. Ten SNP maps "reviewed" genes in the NCBI database. Among them, 5 genes were related to fatal childhood diseases or embryonic development, 1 gene can cause spermatogenic failure, and the other 4 genes were associated with many genetic diseases. The results validated our method. The method is very simple and is guaranteed by a statistical test. It is an inexpensive way to discover potentially lethal genes and the mutation sites. The mined genes deserve further study.
基因是遗传的基本功能单位。基因的差异可能导致各种先天性身体状况。这些差异之一是由称为单核苷酸多态性 (SNP) 的遗传变异引起的。SNP 是基因组特定位置上单个核苷酸的变异。一些 SNP 会影响剪接位点和蛋白质结构,导致基因异常。成对染色体上的 SNP 可能导致致命疾病,使受精卵无法发育成正常胎儿,或者带有这些异常的人在童年时死亡。这些 SNP 位点上的基因型分布与其他位点不同。基于这个想法,我们提出了一种新的统计方法来检测基因型的异常分布并定位潜在的致死基因。该测试是在 HapMap 数据上进行的,发现了 74 个可疑 SNP。对 NCBI 数据库中的 10 个 SNP 图谱“审查”了基因。其中,5 个基因与致命的儿童疾病或胚胎发育有关,1 个基因可导致精子发生失败,另外 4 个基因与许多遗传疾病有关。结果验证了我们的方法。该方法非常简单,并通过统计检验得到保证。这是一种发现潜在致死基因和突变位点的廉价方法。挖掘出的基因值得进一步研究。