Pal Lipika R, Moult John
Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA.
Institute for Bioscience and Biotechnology Research, University of Maryland, 9600 Gudelsky Drive, Rockville, MD 20850, USA; Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA.
J Mol Biol. 2015 Jul 3;427(13):2271-89. doi: 10.1016/j.jmb.2015.04.014. Epub 2015 May 1.
Recent genome-wide association studies (GWAS) have led to the reliable identification of single nucleotide polymorphisms (SNPs) at a number of loci associated with increased risk of specific common human diseases. Each such locus implicates multiple possible candidate SNPs for involvement in disease mechanism. A variety of mechanisms may link the presence of an SNP to altered in vivo gene product function and hence contribute to disease risk. Here, we report an analysis of the role of one of these mechanisms, missense SNPs (msSNPs) in proteins in seven complex trait diseases. Linkage disequilibrium information was used to identify possible candidate msSNPs associated with increased disease risk at each of 356 loci for the seven diseases. Two computational methods were used to estimate which of these SNPs has a significant impact on in vivo protein function. 69% of the loci have at least one candidate msSNP and 33% have at least one predicted high-impact msSNP. In some cases, these SNPs are in well-established disease-related proteins, such as MST1 (macrophage stimulating 1) for Crohn's disease. In others, they are in proteins identified by GWAS as likely candidates for disease relevance, but previously without known mechanism, such as ADAMTS13 (ADAM metallopeptidase with thrombospondin type 1 motif, 13) for coronary artery disease. In still other cases, the missense SNPs are in proteins not previously suggested as disease candidates, such as TUBB1 (tubulin, beta 1, class VI) for hypertension. Together, these data support a substantial role for this class of SNPs in susceptibility to common human disease.
近期的全基因组关联研究(GWAS)已可靠地鉴定出多个位点的单核苷酸多态性(SNP),这些位点与特定常见人类疾病风险增加相关。每个此类位点都涉及多个可能参与疾病机制的候选SNP。多种机制可能将SNP的存在与体内基因产物功能改变联系起来,从而导致疾病风险增加。在此,我们报告了对其中一种机制(即错义SNP,msSNP)在七种复杂性状疾病蛋白质中的作用的分析。利用连锁不平衡信息来识别与七种疾病的356个位点中每个位点疾病风险增加相关的可能候选msSNP。使用两种计算方法来估计这些SNP中哪些对体内蛋白质功能有显著影响。69%的位点至少有一个候选msSNP,33%的位点至少有一个预测具有高影响的msSNP。在某些情况下,这些SNP存在于已明确的与疾病相关的蛋白质中,如克罗恩病中的MST1(巨噬细胞刺激蛋白1)。在其他情况下,它们存在于GWAS鉴定为可能与疾病相关但此前机制不明的蛋白质中,如冠状动脉疾病中的ADAMTS13(含血小板反应蛋白基序的解聚素样金属蛋白酶13)。在另外一些情况下,错义SNP存在于此前未被认为是疾病候选蛋白的蛋白质中,如高血压中的TUBB1(微管蛋白,β1,VI类)。总之,这些数据支持了这类SNP在常见人类疾病易感性中起重要作用。