Morrison Alanna C, Huang Zhuoyi, Yu Bing, Metcalf Ginger, Liu Xiaoming, Ballantyne Christie, Coresh Josef, Yu Fuli, Muzny Donna, Feofanova Elena, Rustagi Navin, Gibbs Richard, Boerwinkle Eric
Human Genetics Center, University of Texas School of Public Health, Houston, TX 77030, USA.
Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA.
Am J Hum Genet. 2017 Feb 2;100(2):205-215. doi: 10.1016/j.ajhg.2016.12.009. Epub 2017 Jan 12.
Whole-genome sequencing (WGS) allows for a comprehensive view of the sequence of the human genome. We present and apply integrated methodologic steps for interrogating WGS data to characterize the genetic architecture of 10 heart- and blood-related traits in a sample of 1,860 African Americans. In order to evaluate the contribution of regulatory and non-protein coding regions of the genome, we conducted aggregate tests of rare variation across the entire genomic landscape using a sliding window, complemented by an annotation-based assessment of the genome using predefined regulatory elements and within the first intron of all genes. These tests were performed treating all variants equally as well as with individual variants weighted by a measure of predicted functional consequence. Significant findings were assessed in 1,705 individuals of European ancestry. After these steps, we identified and replicated components of the genomic landscape significantly associated with heart- and blood-related traits. For two traits, lipoprotein(a) levels and neutrophil count, aggregate tests of low-frequency and rare variation were significantly associated across multiple motifs. For a third trait, cardiac troponin T, investigation of regulatory domains identified a locus on chromosome 9. These practical approaches for WGS analysis led to the identification of informative genomic regions and also showed that defined non-coding regions, such as first introns of genes and regulatory domains, are associated with important risk factor phenotypes. This study illustrates the tractable nature of WGS data and outlines an approach for characterizing the genetic architecture of complex traits.
全基因组测序(WGS)能够全面了解人类基因组序列。我们提出并应用综合方法步骤来分析WGS数据,以刻画1860名非裔美国人样本中10种心脏和血液相关性状的遗传结构。为了评估基因组调控区域和非蛋白质编码区域的作用,我们使用滑动窗口对整个基因组范围内的罕见变异进行汇总测试,并通过基于预定义调控元件以及所有基因的第一个内含子内的基因组注释评估进行补充。这些测试在同等对待所有变异以及对单个变异按预测功能后果的衡量指标进行加权的情况下进行。在1705名欧洲血统个体中评估显著发现。经过这些步骤,我们识别并重复验证了与心脏和血液相关性状显著相关的基因组景观成分。对于脂蛋白(a)水平和中性粒细胞计数这两个性状,低频和罕见变异的汇总测试在多个基序上显著相关。对于第三个性状心肌肌钙蛋白T,对调控区域的研究在9号染色体上确定了一个位点。这些用于WGS分析的实用方法导致识别出信息丰富的基因组区域,还表明特定的非编码区域,如基因的第一个内含子和调控区域,与重要的风险因素表型相关。这项研究说明了WGS数据的易处理性,并概述了一种刻画复杂性状遗传结构的方法。