Lange Ethan M, Sun Jielin, Lange Leslie A, Zheng S Lilly, Duggan David, Carpten John D, Gronberg Henrik, Isaacs William B, Xu Jianfeng, Chang Bao-Li
Department of Genetics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.
Cancer Epidemiol Biomarkers Prev. 2008 Sep;17(9):2208-14. doi: 10.1158/1055-9965.EPI-08-0183.
Over the past 2 decades, DNA samples from thousands of families have been collected and genotyped for linkage studies of common complex diseases, such as type 2 diabetes, asthma, and prostate cancer. Unfortunately, little success has been achieved in identifying genetic susceptibility risk factors through these considerable efforts. However, significant success in identifying common disease risk-associated variants has been recently achieved from genome-wide association studies using unrelated case-control samples. These genome-wide association studies are typically done using population-based cases and controls that are ascertained irrespective of their family history for the disease of interest. Few genetic association studies have taken full advantage of the considerable resources that are available from the linkage-based family collections despite evidence showing cases that have a positive family history of disease are more likely to carry common genetic variants associated with disease susceptibility. Herein, we argue that population stratification is still a concern in case-control genetic association studies, despite the development of analytic methods designed to account for this source of confounding, for a subset of single nucleotide polymorphisms in the genome, most notably those single nucleotide polymorphisms in regions involved with natural selection. We note that current analytic approaches designed to address the issue of population stratification in case-control studies cannot definitively distinguish between true and false associations, and we argue that family-based samples can still serve an invaluable role in following up findings from case-control studies.
在过去20年里,已收集了来自数千个家庭的DNA样本,并对其进行基因分型,用于诸如2型糖尿病、哮喘和前列腺癌等常见复杂疾病的连锁研究。遗憾的是,通过这些大量的努力,在识别遗传易感性风险因素方面几乎没有取得成功。然而,最近使用非亲属病例对照样本的全基因组关联研究在识别常见疾病风险相关变异方面取得了显著成功。这些全基因组关联研究通常使用基于人群的病例和对照,这些病例和对照是不论其对感兴趣疾病的家族史而确定的。尽管有证据表明有疾病家族史阳性的病例更有可能携带与疾病易感性相关的常见遗传变异,但很少有遗传关联研究充分利用基于连锁的家系样本中可用的大量资源。在此,我们认为,尽管已经开发出旨在解释这种混杂来源的分析方法,但对于基因组中的一部分单核苷酸多态性,尤其是那些与自然选择相关区域的单核苷酸多态性,群体分层在病例对照遗传关联研究中仍然是一个问题。我们注意到,目前旨在解决病例对照研究中群体分层问题的分析方法无法明确区分真实关联和虚假关联,并且我们认为基于家系的样本在跟进病例对照研究的结果方面仍然可以发挥宝贵的作用。