Institute of Health and Environment, Seoul National University, 08826, Republic of Korea.
Interdisciplinary Program of Bioinformatics, Seoul National University, 08826, Republic of Korea.
Genetics. 2017 Nov;207(3):1147-1155. doi: 10.1534/genetics.117.300283. Epub 2017 Sep 12.
Despite the many successes of genome-wide association studies (GWAS), the known susceptibility variants identified by GWAS have modest effect sizes, leading to notable skepticism about the effectiveness of building a risk prediction model from large-scale genetic data. However, in contrast to genetic variants, the family history of diseases has been largely accepted as an important risk factor in clinical diagnosis and risk prediction. Nevertheless, the complicated structures of the family history of diseases have limited their application in clinical practice. Here, we developed a new method that enables incorporation of the general family history of diseases with a liability threshold model, and propose a new analysis strategy for risk prediction with penalized regression analysis that incorporates both large numbers of genetic variants and clinical risk factors. Application of our model to type 2 diabetes in the Korean population (1846 cases and 1846 controls) demonstrated that single-nucleotide polymorphisms accounted for 32.5% of the variation explained by the predicted risk scores in the test data set, and incorporation of family history led to an additional 6.3% improvement in prediction. Our results illustrate that family medical history provides valuable information on the variation of complex diseases and improves prediction performance.
尽管全基因组关联研究(GWAS)取得了许多成功,但 GWAS 确定的已知易感性变异具有适度的效应大小,这导致人们对从大规模遗传数据构建风险预测模型的有效性产生了明显的怀疑。然而,与遗传变异不同,疾病家族史在临床诊断和风险预测中被广泛认为是一个重要的风险因素。尽管如此,疾病家族史的复杂结构限制了它们在临床实践中的应用。在这里,我们开发了一种新方法,能够将一般疾病家族史与易感性阈值模型结合起来,并提出了一种新的分析策略,用于通过纳入大量遗传变异和临床风险因素的惩罚回归分析进行风险预测。我们的模型在韩国人群中的 2 型糖尿病(1846 例病例和 1846 例对照)中的应用表明,单核苷酸多态性解释了预测风险评分在测试数据集变异的 32.5%,而家族史的纳入则使预测性能提高了 6.3%。我们的研究结果表明,家族医疗史提供了有关复杂疾病变异的有价值信息,并提高了预测性能。