Department of Genetics, Merck Research Laboratories, Pasteur, Boston, MA 02115, USA.
Am J Epidemiol. 2012 Sep 1;176(5):423-30. doi: 10.1093/aje/kws123. Epub 2012 Aug 2.
Large-scale genome-wide association studies (GWAS) have identified over 40 genomic regions significantly associated with type 2 diabetes mellitus. However, GWAS results are not always straightforward to interpret, and linking these loci to meaningful disease etiology is often difficult without extensive follow-up studies. The authors expanded on previously reported type 2 diabetes mellitus GWAS from the nested case-control studies of 2 prospective US cohorts by incorporating expression single nucleotide polymorphism (SNP) information and applying SNP set enrichment analysis to identify sets of SNPs associated with genes that could provide further biologic insight to traditional genome-wide analysis. Using data collected between 1989 and 1994 in these previous studies to form a nested case-control study, the authors found that 3 of the most significantly associated SNPs to type 2 diabetes mellitus in their study are expression SNPs to the lymphocyte antigen 75 gene (LY75), the ubiquitin-specific peptidase 36 gene (USP36), and the phosphatidylinositol transfer protein, cytoplasmic 1 gene (PITPNC1). SNP set enrichment analysis of the GWAS results identified enrichment for expression SNPs to the macrophage-enriched module and the Gene Ontology (GO) biologic process fat cell differentiation human, which includes the transcription factor 7-like 2 gene (TCF7L2), as well as other type 2 diabetes mellitus-associated genes. Integrating genome-wide association, gene expression, and gene set analysis may provide valuable biologic support for potential type 2 diabetes mellitus susceptibility loci and may be useful in identifying new targets or pathways of interest for the treatment and prevention of type 2 diabetes mellitus.
大规模全基因组关联研究(GWAS)已经确定了 40 多个与 2 型糖尿病显著相关的基因组区域。然而,GWAS 结果并不总是易于解释,如果没有广泛的后续研究,将这些位点与有意义的疾病病因联系起来通常是困难的。作者通过纳入表达单核苷酸多态性(SNP)信息并应用 SNP 集富集分析来识别与基因相关的 SNP 集,为传统的全基因组分析提供进一步的生物学见解,从而扩展了先前报道的来自 2 个前瞻性美国队列的嵌套病例对照研究的 2 型糖尿病 GWAS。利用之前研究中在 1989 年至 1994 年之间收集的数据形成嵌套病例对照研究,作者发现他们的研究中与 2 型糖尿病最显著相关的 3 个 SNP 是淋巴细胞抗原 75 基因(LY75)、泛素特异性肽酶 36 基因(USP36)和磷酸肌醇转移蛋白、细胞质 1 基因(PITPNC1)的表达 SNP。GWAS 结果的 SNP 集富集分析确定了富含表达 SNP 的巨噬细胞富集模块和基因本体论(GO)生物学过程人类脂肪细胞分化的富集,其中包括转录因子 7 样 2 基因(TCF7L2)以及其他 2 型糖尿病相关基因。整合全基因组关联、基因表达和基因集分析可能为潜在的 2 型糖尿病易感基因座提供有价值的生物学支持,并可能有助于确定 2 型糖尿病治疗和预防的新靶点或感兴趣的途径。