Division of Genetics, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115, USA.
Am J Hum Genet. 2011 Oct 7;89(4):496-506. doi: 10.1016/j.ajhg.2011.09.002. Epub 2011 Sep 29.
Although genome-wide association studies have implicated many individual loci in complex diseases, identifying the exact causal alleles and the cell types within which they act remains greatly challenging. To ultimately understand disease mechanism, researchers must carefully conceive functional studies in relevant pathogenic cell types to demonstrate the cellular impact of disease-associated genetic variants. This challenge is highlighted in autoimmune diseases, such as rheumatoid arthritis, where any of a broad range of immunological cell types might potentially be impacted by genetic variation to cause disease. To this end, we developed a statistical approach to identify potentially pathogenic cell types in autoimmune diseases by using a gene-expression data set of 223 murine-sorted immune cells from the Immunological Genome Consortium. We found enrichment of transitional B cell genes in systemic lupus erythematosus (p = 5.9 × 10(-6)) and epithelial-associated stimulated dendritic cell genes in Crohn disease (p = 1.6 × 10(-5)). Finally, we demonstrated enrichment of CD4+ effector memory T cell genes within rheumatoid arthritis loci (p < 10(-6)). To further validate the role of CD4+ effector memory T cells within rheumatoid arthritis, we identified 436 loci that were not yet known to be associated with the disease but that had a statistically suggestive association in a recent genome-wide association study (GWAS) meta-analysis (p(GWAS) < 0.001). Even among these putative loci, we noted a significant enrichment for genes specifically expressed in CD4+ effector memory T cells (p = 1.25 × 10(-4)). These cell types are primary candidates for future functional studies to reveal the role of risk alleles in autoimmunity. Our approach has application in other phenotypes, outside of autoimmunity, where many loci have been discovered and high-quality cell-type-specific gene expression is available.
尽管全基因组关联研究已经发现许多个体基因座与复杂疾病有关,但确定确切的因果等位基因和其作用的细胞类型仍然极具挑战性。为了最终理解疾病机制,研究人员必须在相关致病细胞类型中精心构思功能研究,以证明与疾病相关的遗传变异对细胞的影响。这一挑战在自身免疫性疾病中尤为突出,如类风湿关节炎,其中广泛的免疫细胞类型都可能受到遗传变异的影响而导致疾病。为此,我们开发了一种统计方法,通过使用免疫基因组联盟(Immunological Genome Consortium)的 223 种来自小鼠的免疫细胞的基因表达数据集,来识别自身免疫性疾病中潜在的致病细胞类型。我们发现系统性红斑狼疮(systemic lupus erythematosus)中转录活跃的 B 细胞基因富集(p = 5.9×10(-6)),克罗恩病(Crohn disease)中上皮相关刺激树突状细胞基因富集(p = 1.6×10(-5))。最后,我们在类风湿关节炎(rheumatoid arthritis)的基因座中证实了 CD4+效应记忆 T 细胞基因的富集(p < 10(-6))。为了进一步验证 CD4+效应记忆 T 细胞在类风湿关节炎中的作用,我们鉴定了 436 个先前未知与该疾病相关,但在最近的全基因组关联研究(GWAS)荟萃分析中具有统计学意义的关联(p(GWAS) < 0.001)的基因座。即使在这些假定的基因座中,我们也注意到 CD4+效应记忆 T 细胞中特异表达的基因显著富集(p = 1.25×10(-4))。这些细胞类型是未来功能研究揭示风险等位基因在自身免疫中的作用的主要候选者。我们的方法适用于其他表型,如自身免疫性疾病以外的表型,这些表型中已经发现了许多基因座,并且有高质量的细胞类型特异性基因表达数据。