Wilcox Naomi, Tyrer Jonathan P, Dennis Joe, Yang Xin, Perry John R B, Gardner Eugene J, Easton Douglas F
Centre for Cancer Genetic Epidemiology, Department of Public Health and Primary Care, University of Cambridge, Cambridge, UK.
Metabolic Research Laboratory, Wellcome-MRC Institute of Metabolic Science, University of Cambridge, Cambridge, UK.
Genet Epidemiol. 2025 Jan;49(1):e22609. doi: 10.1002/gepi.22609.
In large cohort studies the number of unaffected individuals outnumbers the number of affected individuals, and the power can be low to detect associations for outcomes with low prevalence. We consider how including recorded family history in regression models increases the power to detect associations between genetic variants and disease risk. We show theoretically and using Monte-Carlo simulations that including a family history of the disease, with a weighting of 0.5 compared with true cases, increases the power to detect associations. This is a powerful approach for detecting variants with moderate effects, but for larger effect sizes a weighting of > 0.5 can be more powerful. We illustrate this both for common variants and for exome sequencing data for over 400,000 individuals in UK Biobank to evaluate the association between the burden of protein-truncating variants in genes and risk for four cancer types.
在大型队列研究中,未受影响个体的数量超过受影响个体的数量,对于低患病率的结局,检测关联的效能可能较低。我们考虑在回归模型中纳入记录的家族史如何提高检测基因变异与疾病风险之间关联的效能。我们通过理论分析和蒙特卡洛模拟表明,纳入疾病家族史(与真实病例相比权重为0.5)可提高检测关联的效能。这是检测具有中等效应变异的有效方法,但对于效应量较大的情况,权重>0.5可能更有效。我们针对常见变异以及英国生物银行中超过40万个体的外显子组测序数据进行了说明,以评估基因中蛋白质截短变异的负担与四种癌症类型风险之间的关联。