Chang Jeffrey S, Yeh Ru-Fang, Wiencke John K, Wiemels Joseph L, Smirnov Ivan, Pico Alexander R, Tihan Tarik, Patoka Joe, Miike Rei, Sison Jennette D, Rice Terri, Wrensch Margaret R
Department of Epidemiology and Biostatistics, University of California, San Francisco, 44 Page Street, Suite 503, San Francisco, CA 94143-1215, USA.
Cancer Epidemiol Biomarkers Prev. 2008 Jun;17(6):1368-73. doi: 10.1158/1055-9965.EPI-07-2830.
Glioma is a complex disease that is unlikely to result from the effect of a single gene. Genetic analysis at the pathway level involving multiple genes may be more likely to capture gene-disease associations than analyzing genes one at a time. The current pilot study included 112 Caucasians with glioblastoma multiforme and 112 Caucasian healthy controls frequency matched to cases by age and gender. Subjects were genotyped using a commercially available (ParAllele/Affymetrix) assay panel of 10,177 nonsynonymous coding single-nucleotide polymorphisms (SNP) spanning the genome known at the time the panel was constructed. For this analysis, we selected 10 pathways potentially involved in gliomagenesis that had SNPs represented on the panel. We performed random forests (RF) analyses of SNPs within each pathway group and logistic regression to assess interaction among genes in the one pathway for which the RF prediction error was better than chance and the permutation P < 0.10. Only the DNA repair pathway had a better than chance classification of case-control status with a prediction error of 45.5% and P = 0.09. Three SNPs (rs1047840 of EXO1, rs12450550 of EME1, and rs799917 of BRCA1) of the DNA repair pathway were identified as promising candidates for further replication. In addition, statistically significant interactions (P < 0.05) between rs1047840 of EXO1 and rs799917 or rs1799966 of BRCA1 were observed. Despite less than complete inclusion of genes and SNPs relevant to glioma and a small sample size, RF analysis identified one important biological pathway and several SNPs potentially associated with the development of glioblastoma.
胶质瘤是一种复杂的疾病,不太可能由单个基因的作用导致。与一次分析一个基因相比,在涉及多个基因的通路水平上进行遗传分析可能更有可能捕捉到基因与疾病的关联。当前的试点研究纳入了112名患有多形性胶质母细胞瘤的白种人以及112名在年龄和性别上与病例频率匹配的白种人健康对照。使用当时构建的包含10177个非同义编码单核苷酸多态性(SNP)的市售(ParAllele/Affymetrix)检测面板对受试者进行基因分型,这些SNP覆盖了当时已知的整个基因组。对于该分析,我们选择了10条可能参与胶质瘤发生的通路,检测面板上有这些通路的SNP代表。我们对每个通路组内的SNP进行随机森林(RF)分析,并进行逻辑回归,以评估其中一个通路中基因之间的相互作用,该通路的RF预测误差优于随机水平且置换P<0.10。只有DNA修复通路对病例对照状态的分类优于随机水平,预测误差为45.5%,P = 0.09。DNA修复通路的三个SNP(EXO1的rs1047840、EME1的rs12450550和BRCA1的rs799917)被确定为有希望进一步验证的候选基因。此外,观察到EXO1的rs1047840与BRCA1的rs799917或rs1799966之间存在统计学显著的相互作用(P<0.05)。尽管与胶质瘤相关的基因和SNP没有完全纳入且样本量较小,但RF分析确定了一条重要的生物学通路以及几个可能与胶质母细胞瘤发生相关的SNP。