The University of Queensland, Queensland Brain Institute, Brisbane, QLD 4072, Australia.
Hum Mol Genet. 2013 Feb 15;22(4):832-41. doi: 10.1093/hmg/dds491. Epub 2012 Nov 28.
Common diseases such as endometriosis (ED), Alzheimer's disease (AD) and multiple sclerosis (MS) account for a significant proportion of the health care burden in many countries. Genome-wide association studies (GWASs) for these diseases have identified a number of individual genetic variants contributing to the risk of those diseases. However, the effect size for most variants is small and collectively the known variants explain only a small proportion of the estimated heritability. We used a linear mixed model to fit all single nucleotide polymorphisms (SNPs) simultaneously, and estimated genetic variances on the liability scale using SNPs from GWASs in unrelated individuals for these three diseases. For each of the three diseases, case and control samples were not all genotyped in the same laboratory. We demonstrate that a careful analysis can obtain robust estimates, but also that insufficient quality control (QC) of SNPs can lead to spurious results and that too stringent QC is likely to remove real genetic signals. Our estimates show that common SNPs on commercially available genotyping chips capture significant variation contributing to liability for all three diseases. The estimated proportion of total variation tagged by all SNPs was 0.26 (SE 0.04) for ED, 0.24 (SE 0.03) for AD and 0.30 (SE 0.03) for MS. Further, we partitioned the genetic variance explained into five categories by a minor allele frequency (MAF), by chromosomes and gene annotation. We provide strong evidence that a substantial proportion of variation in liability is explained by common SNPs, and thereby give insights into the genetic architecture of the diseases.
常见疾病,如子宫内膜异位症 (ED)、阿尔茨海默病 (AD) 和多发性硬化症 (MS),在许多国家的医疗保健负担中占有相当大的比例。针对这些疾病的全基因组关联研究 (GWAS) 已经确定了许多个体遗传变异,这些变异会增加患病风险。然而,大多数变异的效应大小较小,而且已知的变异共同解释了遗传率的一小部分。我们使用线性混合模型同时拟合所有单核苷酸多态性 (SNP),并使用来自无关个体 GWAS 的 SNP 估计遗传方差在易感性尺度上。对于这三种疾病,每种疾病的病例和对照样本都不是在同一个实验室进行全基因分型。我们证明,仔细的分析可以获得稳健的估计,但 SNP 的质量控制 (QC) 不足也会导致虚假结果,而 QC 过于严格则可能会去除真实的遗传信号。我们的估计表明,商业可用基因分型芯片上的常见 SNP 可以捕获对所有三种疾病易感性有贡献的显著变异。所有 SNP 标记的总变异比例估计值分别为 ED 的 0.26(SE 0.04)、AD 的 0.24(SE 0.03)和 MS 的 0.30(SE 0.03)。此外,我们根据次要等位基因频率 (MAF)、染色体和基因注释将遗传方差解释分为五类。我们提供了强有力的证据,证明常见 SNP 可以解释易感性变异的很大一部分,从而深入了解疾病的遗传结构。