1 Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, Boston, Massachusetts.
2 Working Group of Genomic Mathematics, University of Bonn, Bonn, Germany.
Am J Respir Cell Mol Biol. 2018 Nov;59(5):614-622. doi: 10.1165/rcmb.2018-0088OC.
Genome-wide association studies have identified common variants associated with chronic obstructive pulmonary disease (COPD). Whole-genome sequencing (WGS) offers comprehensive coverage of the entire genome, as compared with genotyping arrays or exome sequencing. We hypothesized that WGS in subjects with severe COPD and smoking control subjects with normal pulmonary function would allow us to identify novel genetic determinants of COPD. We sequenced 821 patients with severe COPD and 973 control subjects from the COPDGene and Boston Early-Onset COPD studies, including both non-Hispanic white and African American individuals. We performed single-variant and grouped-variant analyses, and in addition, we assessed the overlap of variants between sequencing- and array-based imputation. Our most significantly associated variant was in a known region near HHIP (combined P = 1.6 × 10); additional variants approaching genome-wide significance included previously described regions in CHRNA5, TNS1, and SERPINA6/SERPINA1 (the latter in African American individuals). None of our associations were clearly driven by rare variants, and we found minimal evidence of replication of genes identified by previously reported smaller sequencing studies. With WGS, we identified more than 20 million new variants, not seen with imputation, including more than 10,000 of potential importance in previously identified COPD genome-wide association study regions. WGS in severe COPD identifies a large number of potentially important functional variants, with the strongest associations being in known COPD risk loci, including HHIP and SERPINA1. Larger sample sizes will be needed to identify associated variants in novel regions of the genome.
全基因组关联研究已经确定了与慢性阻塞性肺疾病(COPD)相关的常见变体。与基因分型阵列或外显子测序相比,全基因组测序(WGS)提供了对整个基因组的全面覆盖。我们假设,对严重 COPD 患者和具有正常肺功能的吸烟对照者进行 WGS 测序,将使我们能够确定 COPD 的新遗传决定因素。我们对 COPDGene 和波士顿早发 COPD 研究中的 821 名严重 COPD 患者和 973 名对照者进行了测序,这些患者和对照者包括非西班牙裔白人和非裔美国人。我们进行了单变体和分组变体分析,此外,我们还评估了测序和基于阵列的插补之间变体的重叠。我们最显著相关的变体位于 HHIP 附近的已知区域(合并 P 值=1.6×10);另外,在 CHRNA5、TNS1 和 SERPINA6/SERPINA1 中,接近全基因组显著水平的区域包括先前描述的区域(在非裔美国人中)。我们的关联都不是由稀有变体明显驱动的,我们发现以前报道的较小测序研究确定的基因的复制证据很少。通过 WGS,我们鉴定了 2000 多万个新变体,这些变体无法通过插补获得,包括在先前确定的 COPD 全基因组关联研究区域中具有潜在重要性的超过 10000 个变体。在严重 COPD 中进行 WGS 鉴定了大量潜在重要的功能变体,最强的关联存在于已知的 COPD 风险基因座中,包括 HHIP 和 SERPINA1。需要更大的样本量来确定基因组新区域中的相关变体。