Schadt Eric E, Molony Cliona, Chudin Eugene, Hao Ke, Yang Xia, Lum Pek Y, Kasarskis Andrew, Zhang Bin, Wang Susanna, Suver Christine, Zhu Jun, Millstein Joshua, Sieberts Solveig, Lamb John, GuhaThakurta Debraj, Derry Jonathan, Storey John D, Avila-Campillo Iliana, Kruger Mark J, Johnson Jason M, Rohl Carol A, van Nas Atila, Mehrabian Margarete, Drake Thomas A, Lusis Aldons J, Smith Ryan C, Guengerich F Peter, Strom Stephen C, Schuetz Erin, Rushmore Thomas H, Ulrich Roger
Rosetta Inpharmatics, Seattle, Washington, United States of America.
PLoS Biol. 2008 May 6;6(5):e107. doi: 10.1371/journal.pbio.0060107.
Genetic variants that are associated with common human diseases do not lead directly to disease, but instead act on intermediate, molecular phenotypes that in turn induce changes in higher-order disease traits. Therefore, identifying the molecular phenotypes that vary in response to changes in DNA and that also associate with changes in disease traits has the potential to provide the functional information required to not only identify and validate the susceptibility genes that are directly affected by changes in DNA, but also to understand the molecular networks in which such genes operate and how changes in these networks lead to changes in disease traits. Toward that end, we profiled more than 39,000 transcripts and we genotyped 782,476 unique single nucleotide polymorphisms (SNPs) in more than 400 human liver samples to characterize the genetic architecture of gene expression in the human liver, a metabolically active tissue that is important in a number of common human diseases, including obesity, diabetes, and atherosclerosis. This genome-wide association study of gene expression resulted in the detection of more than 6,000 associations between SNP genotypes and liver gene expression traits, where many of the corresponding genes identified have already been implicated in a number of human diseases. The utility of these data for elucidating the causes of common human diseases is demonstrated by integrating them with genotypic and expression data from other human and mouse populations. This provides much-needed functional support for the candidate susceptibility genes being identified at a growing number of genetic loci that have been identified as key drivers of disease from genome-wide association studies of disease. By using an integrative genomics approach, we highlight how the gene RPS26 and not ERBB3 is supported by our data as the most likely susceptibility gene for a novel type 1 diabetes locus recently identified in a large-scale, genome-wide association study. We also identify SORT1 and CELSR2 as candidate susceptibility genes for a locus recently associated with coronary artery disease and plasma low-density lipoprotein cholesterol levels in the process.
与常见人类疾病相关的基因变异并非直接导致疾病,而是作用于中间分子表型,这些表型进而引发高阶疾病性状的变化。因此,识别那些随DNA变化而改变且与疾病性状变化相关的分子表型,不仅有可能提供所需的功能信息,以识别和验证直接受DNA变化影响的易感基因,还有助于理解这些基因所处的分子网络,以及这些网络的变化如何导致疾病性状的改变。为此,我们对400多个人类肝脏样本中的39000多个转录本进行了分析,并对782476个独特的单核苷酸多态性(SNP)进行了基因分型,以描绘人类肝脏中基因表达的遗传结构。肝脏是一个代谢活跃的组织,在包括肥胖症、糖尿病和动脉粥样硬化在内的多种常见人类疾病中起着重要作用。这项全基因组基因表达关联研究检测到SNP基因型与肝脏基因表达性状之间存在6000多个关联,其中许多已鉴定出的相应基因与多种人类疾病有关。通过将这些数据与来自其他人类和小鼠群体的基因型和表达数据相结合,证明了这些数据在阐明常见人类疾病病因方面的效用。这为越来越多从疾病全基因组关联研究中被确定为疾病关键驱动因素的基因座上正在识别的候选易感基因提供了急需的功能支持。通过使用综合基因组学方法,我们强调了我们的数据如何支持基因RPS26而非ERBB3作为最近在大规模全基因组关联研究中发现的一个新型1型糖尿病基因座最可能的易感基因。在此过程中,我们还将SORT1和CELSR2确定为一个最近与冠状动脉疾病和血浆低密度脂蛋白胆固醇水平相关的基因座的候选易感基因。