Department of Public Health Sciences, University of Chicago, Chicago, IL 60637, USA.
Section of Genetic Medicine, Department of Medicine, University of Chicago, Chicago, IL 60637, USA.
Am J Hum Genet. 2023 Jun 1;110(6):950-962. doi: 10.1016/j.ajhg.2023.04.005. Epub 2023 May 9.
Genome-wide association studies (GWASs) have identified more than 200 genomic loci for breast cancer risk, but specific causal genes in most of these loci have not been identified. In fact, transcriptome-wide association studies (TWASs) of breast cancer performed using gene expression prediction models trained in breast tissue have yet to clearly identify most target genes. To identify candidate genes, we performed a GWAS analysis in a breast cancer dataset from UK Biobank (UKB) and combined the results with the GWAS results of the Breast Cancer Association Consortium (BCAC) by a meta-analysis. Using the summary statistics from the meta-analysis, we performed a joint TWAS analysis that combined TWAS signals from multiple tissues. We used expression prediction models trained in 11 tissues that are potentially relevant to breast cancer from the Genotype-Tissue Expression (GTEx) data. In the GWAS analysis, we identified eight loci distinct from those reported previously. In the TWAS analysis, we identified 309 genes at 108 genomic loci to be significantly associated with breast cancer at the Bonferroni threshold. Of these, 17 genes were located in eight regions that were at least 1 Mb away from published GWAS hits. The remaining TWAS-significant genes were located in 100 known genomic loci from previous GWASs of breast cancer. We found that 21 genes located in known GWAS loci remained statistically significant after conditioning on previous GWAS index variants. Our study provides insights into breast cancer genetics through mapping candidate target genes in a large proportion of known GWAS loci and discovering multiple new loci.
全基因组关联研究(GWAS)已经确定了 200 多个与乳腺癌风险相关的基因组位点,但这些位点中的大多数特定因果基因尚未确定。事实上,使用在乳腺组织中训练的基因表达预测模型进行的全转录组关联研究(TWAS)尚未明确确定大多数靶基因。为了鉴定候选基因,我们在英国生物库(UKB)的乳腺癌数据集上进行了 GWAS 分析,并通过荟萃分析将结果与乳腺癌协会联盟(BCAC)的 GWAS 结果相结合。使用荟萃分析的汇总统计数据,我们进行了联合 TWAS 分析,该分析结合了来自多个组织的 TWAS 信号。我们使用了来自 Genotype-Tissue Expression (GTEx) 数据中 11 个可能与乳腺癌相关的组织的表达预测模型进行训练。在 GWAS 分析中,我们确定了 8 个与以前报道的不同的位点。在 TWAS 分析中,我们在 108 个基因组位点确定了 309 个基因与乳腺癌在 Bonferroni 阈值下显著相关。其中,17 个基因位于至少距离已发表 GWAS 命中 1Mb 远的八个区域。其余 TWAS 显著的基因位于先前乳腺癌 GWAS 中 100 个已知的基因组位点。我们发现,在对先前 GWAS 索引变体进行条件化后,位于已知 GWAS 位点的 21 个基因仍然具有统计学意义。我们的研究通过在大部分已知 GWAS 位点中映射候选靶基因并发现多个新的位点,为乳腺癌遗传学提供了深入的了解。