Weissfeld Joel L, Lin Yan, Lin Hui-Min, Kurland Brenda F, Wilson David O, Fuhrman Carl R, Pennathur Arjun, Romkes Marjorie, Nukui Tomoko, Yuan Jian-Min, Siegfried Jill M, Diergaarde Brenda
Departments of *Epidemiology and †Biostatistics, Graduate School of Public Health, Pittsburgh, Pennsylvania; Departments of ‡Medicine, §Radiology, and ‖Cardiothoracic Surgery, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania; and ¶Department of Pharmacology, School of Medicine, University of Minnesota, Minneapolis, Minnesota.
J Thorac Oncol. 2015 Nov;10(11):1538-45. doi: 10.1097/JTO.0000000000000666.
Genome-wide association studies (GWAS) have consistently identified specific lung cancer susceptibility regions. We evaluated the lung cancer-predictive performance of single-nucleotide polymorphisms (SNPs) in these regions.
Lung cancer cases (N = 778) and controls (N = 1166) were genotyped for 77 SNPs located in GWAS-identified lung cancer susceptibility regions. Variable selection and model development used stepwise logistic regression and decision-tree analyses. In a subset nested in the Pittsburgh Lung Screening Study, change in area under the receiver operator characteristic curve and net reclassification improvement were used to compare predictions made by risk factor models with and without genetic variables.
Variable selection and model development kept two SNPs in each of three GWAS regions, rs2736100 and rs7727912 in 5p15.33, rs805297 and rs1802127 in 6p21.33, and rs8034191 and rs12440014 in 15q25.1. The ratio of cases to controls was three times higher among subjects with a high-risk genotype in every one as opposed to none of the three GWAS regions (odds ratio, 3.14; 95% confidence interval, 2.02-4.88; adjusted for sex, age, and pack-years). Adding a three-level classified count of GWAS regions with high-risk genotypes to an age and smoking risk factor-only model improved lung cancer prediction by a small amount: area under the receiver operator characteristic curve, 0.725 versus 0.717 (p = 0.056); overall net reclassification improvement was 0.052 across low-, intermediate-, and high- 6-year lung cancer risk categories (<3.0%, 3.0%-4.9%, ≥ 5.0%).
Specifying genotypes for SNPs in three GWAS-identified susceptibility regions improved lung cancer prediction, but probably by an extent too small to affect disease control practice.
全基因组关联研究(GWAS)已持续鉴定出特定的肺癌易感区域。我们评估了这些区域中单核苷酸多态性(SNP)对肺癌的预测性能。
对位于GWAS鉴定的肺癌易感区域的77个SNP进行基因分型,纳入肺癌病例(N = 778)和对照(N = 1166)。变量选择和模型构建采用逐步逻辑回归和决策树分析。在匹兹堡肺癌筛查研究的一个子集中,利用受试者工作特征曲线下面积的变化和净重新分类改善,比较有和没有遗传变量的风险因素模型所做的预测。
变量选择和模型构建在三个GWAS区域各保留了两个SNP,分别为5p15.33区域的rs2736100和rs7727912、6p21.33区域的rs805297和rs1802127,以及15q25.1区域的rs8034191和rs12440014。在三个GWAS区域中,有一个高风险基因型的受试者与无高风险基因型的受试者相比,病例与对照的比例高出三倍(比值比,3.14;95%置信区间,2.02 - 4.88;按性别、年龄和吸烟包年数调整)。将具有高风险基因型的GWAS区域的三级分类计数添加到仅包含年龄和吸烟风险因素的模型中,对肺癌预测有小幅改善:受试者工作特征曲线下面积,0.725对0.717(p = 0.056);在低、中、高6年肺癌风险类别(<3.0%、3.0% - 4.9%、≥5.0%)中,总体净重新分类改善为0.052。
指定三个GWAS鉴定的易感区域中SNP的基因型可改善肺癌预测,但改善程度可能太小,无法影响疾病控制实践。