Xu Xiaohang, Xu Luopiao, Lang Zeyong, Sun Gege, Pan Junlong, Li Xue, Bian Zilong, Wu Xifeng
Center of Clinical Big Data and Analytics of the Second Affiliated Hospital and School of Public Health, Zhejiang University School of Medicine, Hangzhou, 310058, China.
Zhejiang Key Laboratory of Intelligent Preventive Medicine, Hangzhou, 310058, China.
Sci Rep. 2025 Mar 6;15(1):7825. doi: 10.1038/s41598-025-92083-9.
Lung cancer is a malignant tumor with a high morbidity and mortality rate worldwide, causing an increasing disease burden. Of these, the most common type is non-small cell lung cancer (NSCLC), which accounts for 80-85% of all lung cancer cases. Genetic research is crucial for continuously discovering susceptibility genes related to lung cancer for in-depth study. The role of genetic predisposition in the development of NSCLC, particularly within circadian rhythm pathways known to govern various physiological processes, is increasingly acknowledged. Yet, the association between genetic variants of circadian rhythm-related genes and NSCLC susceptibility among Chinese populations is not fully understood. This study carried out a two-phase (discovery and validation stages) research design to identify genetic variants associated with NSCLC risk within the circadian rhythm pathway. We employed extensive whole-genome sequencing (WGS) for 1,104 NSCLC cases and 9,635 controls. FastGWA-GLMM was used for single-locus risk association analysis of NSCLC, and we screened candidate SNPs in the validation set that comprised 4,444 cases and 174,282 controls from the Biobank Japan Project (BBJ). Furthermore, GCTA-COJO conditional analysis was utilized to confirm SNPs related to NSCLC risk. Finally, potential genetic variations that may regulate gene expression were explored in GTEx and QTLbase. RNA sequencing data were utilized for transcriptomic verification. Our study identified eight candidate SNPs associated with NSCLC susceptibility within the circadian rhythm pathway that met the requirement with P < 0.05 in both the discovery and validation populations. After conditional analysis, five of these SNPs remained. The A allele of CUL1 rs78524436 (OR = 1.18, 95%CI: 1.09-1.29, P = 7.99e-5) and the A allele of TEF rs9611588 (OR = 1.06, 95%CI: 1.02-1.10, P = 1.28e-3) were associated with an increased risk of NSCLC. The A allele of FBXL21 rs2069868 (OR = 0.86, 95%CI: 0.80-0.96, P = 4.78e-4), the T allele of CSNK1D rs147316973 (OR = 0.76, 95%CI: 0.65-0.88, P = 5.93e-4), and the A allele of RORA rs1589701 (OR = 0.94, 95%CI: 0.91-0.98, P = 3.40e-3) were associated with a lower risk of NSCLC, separately. The eQTL results revealed an association between RORA rs1589701 and TEF rs9611588 with the expression levels of RORA and TEF, respectively. Transcriptome data indicated that RORA and TEF showed lower expression levels in tumor tissues compared to normal tissues (P < 0.001). Moreover, poorer survival was observed in patients with lower RORA and TEF expressions (log-rank P < 0.05). Our findings spotlight potential susceptibility loci within circadian rhythm pathway genes that modulate NSCLC carcinogenesis, which enriches the understanding of the genetic susceptibility of NSCLC in the Chinese population and provides a more solid basis for exploring the biological mechanism of circadian rhythm genes in NSCLC.
肺癌是一种在全球范围内发病率和死亡率都很高的恶性肿瘤,导致疾病负担日益加重。其中,最常见的类型是非小细胞肺癌(NSCLC),占所有肺癌病例的80-85%。基因研究对于持续发现与肺癌相关的易感基因以进行深入研究至关重要。人们越来越认识到遗传易感性在NSCLC发生发展中的作用,尤其是在已知调控各种生理过程的昼夜节律途径中。然而,中国人群中昼夜节律相关基因的遗传变异与NSCLC易感性之间的关联尚未完全明确。本研究进行了两阶段(发现和验证阶段)的研究设计,以确定昼夜节律途径中与NSCLC风险相关的遗传变异。我们对1104例NSCLC病例和9635例对照进行了广泛的全基因组测序(WGS)。使用FastGWA-GLMM对NSCLC进行单基因座风险关联分析,并在验证集中筛选候选单核苷酸多态性(SNP),该验证集包括来自日本生物银行项目(BBJ)的4444例病例和174282例对照。此外,利用GCTA-COJO条件分析来确认与NSCLC风险相关的SNP。最后,在GTEx和QTLbase中探索可能调节基因表达的潜在遗传变异。利用RNA测序数据进行转录组验证。我们的研究在昼夜节律途径中鉴定出8个与NSCLC易感性相关的候选SNP,在发现和验证人群中均满足P<0.05的要求。经过条件分析后,其中5个SNP仍然显著。CUL1基因rs78524436的A等位基因(OR = 1.18,95%CI:1.09-1.29,P = 7.99e-5)和TEF基因rs9611588的A等位基因(OR = 1.06,95%CI:1.02-1.10,P = 1.28e-3)与NSCLC风险增加相关。FBXL21基因rs2069868的A等位基因(OR = 0.86,95%CI:0.80-0.96,P = 4.78e-4)、CSNK1D基因rs147316973的T等位基因(OR = 0.76,95%CI:0.65-0.88,P = 5.93e-4)和RORA基因rs1589701的A等位基因(OR = 0.94,95%CI:0.91-0.98,P = 3.40e-3)分别与NSCLC风险降低相关。表达数量性状基因座(eQTL)结果显示,RORA基因rs1589701和TEF基因rs9611588分别与RORA和TEF的表达水平相关。转录组数据表明,与正常组织相比,RORA和TEF在肿瘤组织中的表达水平较低(P < 0.001)。此外,RORA和TEF表达水平较低的患者生存率较差(对数秩检验P < 0.05)。我们的研究结果揭示了昼夜节律途径基因中潜在的易感位点,这些位点调节NSCLC的致癌作用,丰富了对中国人群中NSCLC遗传易感性的认识,并为探索昼夜节律基因在NSCLC中的生物学机制提供了更坚实的基础。