Environmental Genomics Section, Laboratory of Molecular Genetics, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina, United States of America.
PLoS One. 2010 Aug 3;5(8):e11934. doi: 10.1371/journal.pone.0011934.
Prior microarray studies of smokers at high risk for lung cancer have demonstrated that heterogeneity in bronchial airway epithelial cell gene expression response to smoking can serve as an early diagnostic biomarker for lung cancer. As a first step in applying functional genomic analysis to population studies, we have examined the relationship between gene expression variation and genetic variation in a central molecular pathway (NRF2-mediated antioxidant response) associated with smoking exposure and lung cancer. We assessed global gene expression in histologically normal airway epithelial cells obtained at bronchoscopy from smokers who developed lung cancer (SC, n = 20), smokers without lung cancer (SNC, n = 24), and never smokers (NS, n = 8). Functional enrichment analysis showed that the NRF2-mediated, antioxidant response element (ARE)-regulated genes, were significantly lower in SC, when compared with expression levels in SNC. Importantly, we found that the expression of MAFG (a binding partner of NRF2) was correlated with the expression of ARE genes, suggesting MAFG levels may limit target gene induction. Bioinformatically we identified single nucleotide polymorphisms (SNPs) in putative ARE genes and to test the impact of genetic variation, we genotyped these putative regulatory SNPs and other tag SNPs in selected NRF2 pathway genes. Sequencing MAFG locus, we identified 30 novel SNPs and two were associated with either gene expression or lung cancer status among smokers. This work demonstrates an analysis approach that integrates bioinformatics pathway and transcription factor binding site analysis with genotype, gene expression and disease status to identify SNPs that may be associated with individual differences in gene expression and/or cancer status in smokers. These polymorphisms might ultimately contribute to lung cancer risk via their effect on the airway gene expression response to tobacco-smoke exposure.
先前针对肺癌高危吸烟者的微阵列研究表明,支气管气道上皮细胞基因表达对吸烟的异质性可作为肺癌的早期诊断生物标志物。作为将功能基因组分析应用于人群研究的第一步,我们研究了与吸烟暴露和肺癌相关的中央分子途径(NRF2 介导的抗氧化反应)中的基因表达变异与遗传变异之间的关系。我们评估了在支气管镜检查中从患有肺癌的吸烟者(SC,n=20)、没有肺癌的吸烟者(SNC,n=24)和从不吸烟者(NS,n=8)中获得的组织学正常气道上皮细胞的全基因表达。功能富集分析表明,与 SNC 中的表达水平相比,SC 中 NRF2 介导的抗氧化反应元件(ARE)调节基因显着降低。重要的是,我们发现 MAFG(NRF2 的结合伴侣)的表达与 ARE 基因的表达相关,这表明 MAFG 水平可能限制了靶基因的诱导。通过生物信息学,我们鉴定了潜在的 ARE 基因中的单核苷酸多态性(SNP),并为了测试遗传变异的影响,我们对这些潜在的调节 SNP 以及 NRF2 途径基因中的其他标记 SNP 进行了基因分型。对 MAFG 基因座进行测序,我们确定了 30 个新的 SNP,其中两个 SNP 与吸烟者的基因表达或肺癌状态有关。这项工作展示了一种分析方法,该方法将生物信息学途径和转录因子结合位点分析与基因型、基因表达和疾病状态相结合,以鉴定可能与吸烟者基因表达和/或癌症状态的个体差异相关的 SNP。这些多态性可能最终通过其对气道基因对烟草烟雾暴露的反应的影响而导致肺癌风险增加。