Department of Electrical Engineering and Computer Science, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
Department of Electrical Engineering and Computer Science, Center for Proteomics and Bioinformatics, Case Western Reserve University, 10900 Euclid Ave, Cleveland, OH, 44106, USA.
BMC Bioinformatics. 2019 Jun 20;20(Suppl 12):320. doi: 10.1186/s12859-019-2835-0.
As Genome-Wide Association Studies (GWAS) have been increasingly used with data from various populations, it has been observed that data from different populations reveal different sets of Single Nucleotide Polymorphisms (SNPs) that are associated with the same disease. Using Type II Diabetes (T2D) as a test case, we develop measures and methods to characterize the functional overlap of SNPs associated with the same disease across populations.
We introduce the notion of an Overlap Matrix as a general means of characterizing the functional overlap between different SNP sets at different genomic and functional granularities. Using SNP-to-gene mapping, functional annotation databases, and functional association networks, we assess the degree of functional overlap across nine populations from Asian and European ethnic origins. We further assess the generalizability of the method by applying it to a dataset for another complex disease - Prostate Cancer. Our results show that more overlap is captured as more functional data is incorporated as we go through the pipeline, starting from SNPs and ending at network overlap analyses. We hypothesize that these observed differences in the functional mechanisms of T2D across populations can also explain the common use of different prescription drugs in different populations. We show that this hypothesis is concordant with the literature on the functional mechanisms of prescription drugs.
Our results show that although the etiology of a complex disease can be associated with distinct processes that are affected in different populations, network-based annotations can capture more functional overlap across populations. These results support the notion that it can be useful to take ethnicity into account in making personalized treatment decisions for complex diseases.
随着全基因组关联研究(GWAS)越来越多地应用于来自不同人群的数据,人们观察到来自不同人群的数据揭示了与同一疾病相关的不同单核苷酸多态性(SNP)集。以 2 型糖尿病(T2D)为例,我们开发了用于描述跨人群与同一疾病相关的 SNP 功能重叠的度量和方法。
我们提出了重叠矩阵的概念,作为在不同基因组和功能粒度下描述不同 SNP 集之间功能重叠的一般方法。我们使用 SNP 到基因映射、功能注释数据库和功能关联网络,评估了来自亚洲和欧洲血统的九个人群中不同 SNP 集之间的功能重叠程度。我们通过将其应用于另一种复杂疾病——前列腺癌的数据集来进一步评估该方法的通用性。我们的结果表明,随着功能数据的不断纳入,从 SNP 到网络重叠分析,捕获的重叠程度越来越高。我们假设,这些在不同人群中 T2D 的功能机制中的差异也可以解释不同人群中使用不同处方药的常见现象。我们表明,这一假设与处方药功能机制的文献是一致的。
尽管复杂疾病的病因可能与不同人群中受影响的不同过程有关,但基于网络的注释可以在人群之间捕获更多的功能重叠。这些结果支持这样一种观点,即考虑种族因素对于复杂疾病的个体化治疗决策可能是有用的。