Key Laboratory for Bio-Electromagnetic Environment and Advanced Medical Theranostics, School of Biomedical Engineering and Informatics, Nanjing Medical University, Nanjing, 211166, P. R. China.
Center for Data Science, Zhejiang University, Hangzhou, 310058, P. R. China.
Adv Sci (Weinh). 2024 Sep;11(36):e2400815. doi: 10.1002/advs.202400815. Epub 2024 Aug 5.
Cistrome-wide association studies (CWAS) are pivotal for identifying genetic determinants of diseases by correlating genetically regulated cistrome states with phenotypes. Traditional CWAS typically develops a model based on cistrome and genotype data to associate predicted cistrome states with phenotypes. The random effect cistrome-wide association study (RECWAS), reevaluates the necessity of cistrome state prediction in CWAS. RECWAS utilizes either a linear model or marginal effect for initial feature selection, followed by kernel-based feature aggregation for association testing is introduced. Through simulations and analysis of prostate cancer data, a thorough evaluation of CWAS and RECWAS is conducted. The results suggest that RECWAS offers improved power compared to traditional CWAS, identifying additional genomic regions associated with prostate cancer. CWAS identified 102 significant regions, while RECWAS found 50 additional significant regions compared to CWAS, many of which are validated. Validation encompassed a range of biological evidence, including risk signals from the GWAS catalog, susceptibility genes from the DisGeNET database, and enhancer-domain scores. RECWAS consistently demonstrated improved performance over traditional CWAS in identifying genomic regions associated with prostate cancer. These findings demonstrate the benefits of incorporating kernel methods into CWAS and provide new insights for genetic discovery in complex diseases.
全基因组调控元件关联研究(Cistrome-wide association studies,CWAS)通过将遗传调控的调控元件状态与表型相关联,对于识别疾病的遗传决定因素至关重要。传统的 CWAS 通常基于调控元件和基因型数据开发模型,以关联预测的调控元件状态与表型。随机效应全基因组调控元件关联研究(Random effect cistrome-wide association study,RECWAS)重新评估了 CWAS 中调控元件状态预测的必要性。RECWAS 采用线性模型或边际效应进行初始特征选择,然后引入基于核的特征聚合进行关联测试。通过对前列腺癌数据的模拟和分析,对 CWAS 和 RECWAS 进行了全面评估。结果表明,与传统的 CWAS 相比,RECWAS 提供了更高的功效,能够识别与前列腺癌相关的额外基因组区域。CWAS 鉴定出 102 个显著区域,而 RECWAS 则发现了 50 个与 CWAS 相比额外的显著区域,其中许多区域得到了验证。验证涵盖了一系列生物学证据,包括来自 GWAS 目录的风险信号、来自 DisGeNET 数据库的易感性基因以及增强子域评分。RECWAS 在识别与前列腺癌相关的基因组区域方面始终表现出优于传统 CWAS 的性能。这些发现表明将核方法纳入 CWAS 的益处,并为复杂疾病的遗传发现提供了新的见解。