Rosenthal Elisabeth A, Wei Wei-Qi, Luo Yuan, Namjou-Khales Bahram, Schaid Daniel J, Esplin Edward D, Lape Michael, Kottyan Leah, Pacheco Jennifer Allen, Weng Chunhua, Gordon Adam Samuel, Kullo Iftikhar J, Crosslin David R, Grady William M, Hsu Li, Peters Ulrike, Jarvik Gail P
Department of Medicine, Division of Medical Genetics, University of Washington Medical Center, Seattle, WA, USA.
Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, USA.
Hum Genomics. 2025 Jul 9;19(1):77. doi: 10.1186/s40246-025-00791-0.
Many factors, including environmental and genetic variables, contribute to Colorectal Cancer (CRC) risk. The genetic components of risk can be divided into monogenic and polygenic factors. Just as monogenic factors can increase risk for more than one condition, polygenic factors may also underlie multiple phenotypes, including behavioral traits. In order to understand the biology of CRC risk better, it is important to understand the shared polygenic genetic architecture contributing to CRC risk and other phenotypes, including CRC associated risk factors.
We investigated potential shared genetics by performing a Phenome-wide association study (PheWAS) with a multi-ancestry CRC polygenic risk score (PRS). The discovery cohort (N = 426,464) consisted of ancestrally diverse participants from the United Kingdom Biobank. The replication cohort (N = 87,271) consisted of ancestrally diverse participants from the electronic Medical Records and Genomics Network phase 3. We used a mixed-effects model to adjust for the presence of related individuals. To preserve power, we limited the number of tests by restricting analysis to ancestor phecodes derived from the electronic health record (EHR) that were not likely to be a result of CRC or its treatment.
We discovered and replicated associations between the CRC PRS and breast cancer, prostate cancer, obesity, smoking and alcohol use (discovery p < 1.1e-4; replication p < 0.0019). The association between CRC risk and prostate cancer may be a novel finding, whereas the association with breast cancer has been previously observed using orthogonal methods. The association between CRC risk and behavioral risk factors corroborate previous studies, also using orthogonal methods, and may reveal potential prevention or treatment strategies.
As these results corroborate findings from other studies using orthogonal methods, we demonstrate that a CRC PRS can be used as a proxy for genetic risk for CRC when investigating shared genetics between CRC and other phenotypes. Further study of the relationship between PRS from multiple traits with EHR data may reveal additional shared genetic factors. Ultimately, understanding these underlying genetic correlations may identify prevention and treatment strategies for CRC.
包括环境和基因变量在内的许多因素都会影响结直肠癌(CRC)风险。风险的遗传成分可分为单基因因素和多基因因素。正如单基因因素可增加多种疾病的风险一样,多基因因素也可能是包括行为特征在内的多种表型的基础。为了更好地理解CRC风险的生物学机制,了解导致CRC风险和其他表型(包括CRC相关风险因素)的共享多基因遗传结构非常重要。
我们通过对多血统CRC多基因风险评分(PRS)进行全表型组关联研究(PheWAS)来调查潜在的共享遗传学。发现队列(N = 426,464)由来自英国生物银行的不同血统参与者组成。复制队列(N = 87,271)由来自电子病历和基因组学网络第3阶段的不同血统参与者组成。我们使用混合效应模型来调整相关个体的存在。为了保持检验效能,我们通过将分析限制在源自电子健康记录(EHR)且不太可能是CRC或其治疗结果的祖先疾病编码来限制检验数量。
我们发现并重复验证了CRC PRS与乳腺癌、前列腺癌、肥胖、吸烟和饮酒之间的关联(发现阶段p < 1.1e - 4;复制阶段p < 0.0019)。CRC风险与前列腺癌之间的关联可能是一项新发现,而与乳腺癌的关联此前已通过正交方法观察到。CRC风险与行为风险因素之间的关联也证实了先前使用正交方法的研究结果,并且可能揭示潜在的预防或治疗策略。
由于这些结果证实了使用正交方法的其他研究结果,我们证明在研究CRC与其他表型之间的共享遗传学特征时,CRC PRS可作为CRC遗传风险的替代指标。进一步研究多个性状的PRS与EHR数据之间的关系可能会揭示更多共享遗传因素。最终,了解这些潜在的遗传相关性可能会确定CRC的预防和治疗策略。