Goertzel Benjamin N, Pennachin Cassio, de Souza Coelho Lucio, Gurbaxani Brian, Maloney Elizabeth M, Jones James F
Virginia Tech, National Capital Region, Arlington, VA, USA.
Pharmacogenomics. 2006 Apr;7(3):475-83. doi: 10.2217/14622416.7.3.475.
This paper asks whether the presence of chronic fatigue syndrome (CFS) can be more accurately predicted from single nucleotide polymorphism (SNP) profiles than would occur by chance.
Specifically, given SNP profiles for 43 CFS patients, together with 58 controls, we used an enumerative search to identify an ensemble of conjunctive rules that predict whether a patient has CFS.
The accuracy of the rules reached 76.3%, with the highest accuracy rules yielding 49 true negatives, 15 false negatives, 28 true positives and nine false positives (odds ratio [OR] 8.94, p < 0.0001). Analysis of the SNPs used most frequently in the overall ensemble of rules gave rise to a list of 'most important SNPs', which was not identical to the list of 'most differentiating SNPs' that one would calculate via studying each SNP independently. The top three genes containing the SNPs accounting for the highest accumulated importances were neuronal tryptophan hydroxylase (TPH2), catechol-O-methyltransferase (COMT) and nuclear receptor subfamily 3, group C, member 1 glucocorticoid receptor (NR3C1).
The fact that only 28 out of several million possible SNPs predict whether a person has CFS with 76% accuracy indicates that CFS has a genetic component that may help to explain some aspects of the illness.
本文探讨通过单核苷酸多态性(SNP)谱预测慢性疲劳综合征(CFS)的存在是否比随机预测更准确。
具体而言,给定43例CFS患者和58例对照的SNP谱,我们使用枚举搜索来识别预测患者是否患有CFS的联合规则集合。
这些规则的准确率达到76.3%,准确率最高的规则产生49个真阴性、15个假阴性、28个真阳性和9个假阳性(优势比[OR]8.94,p<0.0001)。对整个规则集合中最常使用的SNP进行分析,得出了一份“最重要SNP”列表,该列表与通过独立研究每个SNP计算得出的“最具区分性SNP”列表不同。包含累积重要性最高的SNP的前三个基因是神经元色氨酸羟化酶(TPH2)、儿茶酚-O-甲基转移酶(COMT)和核受体亚家族3、C组成员1糖皮质激素受体(NR3C1)。
在数百万个可能的SNP中,只有28个能以76%的准确率预测一个人是否患有CFS,这一事实表明CFS具有遗传成分,这可能有助于解释该疾病的某些方面。