Department of Genomics of Common Disease, Imperial College London, London, United Kingdom.
PLoS One. 2013;8(3):e58048. doi: 10.1371/journal.pone.0058048. Epub 2013 Mar 12.
The limited ability of common variants to account for the genetic contribution to complex disease has prompted searches for rare variants of large effect, to partly explain the 'missing heritability'. Analyses of genome-wide genotyping data have identified genomic structural variants (GSVs) as a source of such rare causal variants. Recent studies have reported multiple GSV loci associated with risk of obesity. We attempted to replicate these associations by similar analysis of two familial-obesity case-control cohorts and a population cohort, and detected GSVs at 11 out of 18 loci, at frequencies similar to those previously reported. Based on their reported frequencies and effect sizes (OR≥25), we had sufficient statistical power to detect the large majority (80%) of genuine associations at these loci. However, only one obesity association was replicated. Deletion of a 220 kb region on chromosome 16p11.2 has a carrier population frequency of 2×10(-4) (95% confidence interval [9.6×10(-5)-3.1×10(-4)]); accounts overall for 0.5% [0.19%-0.82%] of severe childhood obesity cases (P = 3.8×10(-10); odds ratio = 25.0 [9.9-60.6]); and results in a mean body mass index (BMI) increase of 5.8 kg.m(-2) [1.8-10.3] in adults from the general population. We also attempted replication using BMI as a quantitative trait in our population cohort; associations with BMI at or near nominal significance were detected at two further loci near KIF2B and within FOXP2, but these did not survive correction for multiple testing. These findings emphasise several issues of importance when conducting rare GSV association, including the need for careful cohort selection and replication strategy, accurate GSV identification, and appropriate correction for multiple testing and/or control of false discovery rate. Moreover, they highlight the potential difficulty in replicating rare CNV associations across different populations. Nevertheless, we show that such studies are potentially valuable for the identification of variants making an appreciable contribution to complex disease.
常见变异对复杂疾病遗传贡献的有限解释促使人们寻找具有较大效应的罕见变异,以部分解释“遗传缺失”。对全基因组基因分型数据的分析已经将基因组结构变异(GSV)确定为这种罕见因果变异的来源。最近的研究报告了多个与肥胖风险相关的 GSV 位点。我们试图通过对两个家族性肥胖病例对照队列和一个人群队列进行类似的分析来复制这些关联,并在 18 个位点中的 11 个位点检测到 GSV,其频率与之前报道的相似。根据它们报告的频率和效应大小(OR≥25),我们有足够的统计能力在这些位点检测到绝大多数(80%)真正的关联。然而,只有一个肥胖关联得到了复制。16p11.2 染色体上 220kb 区域缺失的携带者群体频率为 2×10(-4)(95%置信区间[9.6×10(-5)-3.1×10(-4)]);总体上占严重儿童肥胖病例的 0.5%[0.19%-0.82%](P=3.8×10(-10);优势比=25.0[9.9-60.6]);导致普通人群中成年人平均体重指数(BMI)增加 5.8kg.m(-2)[1.8-10.3]。我们还尝试使用人群队列中的 BMI 作为定量特征进行复制;在 KIF2B 附近和 FOXP2 内的两个进一步位点检测到与 BMI 接近显著水平的关联,但这些关联在进行多次测试校正后并未存活。这些发现强调了在进行罕见 GSV 关联时需要注意的几个重要问题,包括仔细选择队列和复制策略、准确识别 GSV,以及适当校正多重检验和/或控制假发现率。此外,它们突出了在不同人群中复制罕见 CNV 关联的潜在困难。尽管如此,我们表明,此类研究对于识别对复杂疾病有明显贡献的变异具有潜在价值。