Department of Mathematical Sciences, Michigan Technological University, Houghton, Michigan, United States of America.
PLoS Genet. 2024 May 10;20(5):e1011245. doi: 10.1371/journal.pgen.1011245. eCollection 2024 May.
Joint analysis of multiple correlated phenotypes for genome-wide association studies (GWAS) can identify and interpret pleiotropic loci which are essential to understand pleiotropy in diseases and complex traits. Meanwhile, constructing a network based on associations between phenotypes and genotypes provides a new insight to analyze multiple phenotypes, which can explore whether phenotypes and genotypes might be related to each other at a higher level of cellular and organismal organization. In this paper, we first develop a bipartite signed network by linking phenotypes and genotypes into a Genotype and Phenotype Network (GPN). The GPN can be constructed by a mixture of quantitative and qualitative phenotypes and is applicable to binary phenotypes with extremely unbalanced case-control ratios in large-scale biobank datasets. We then apply a powerful community detection method to partition phenotypes into disjoint network modules based on GPN. Finally, we jointly test the association between multiple phenotypes in a network module and a single nucleotide polymorphism (SNP). Simulations and analyses of 72 complex traits in the UK Biobank show that multiple phenotype association tests based on network modules detected by GPN are much more powerful than those without considering network modules. The newly proposed GPN provides a new insight to investigate the genetic architecture among different types of phenotypes. Multiple phenotypes association studies based on GPN are improved by incorporating the genetic information into the phenotype clustering. Notably, it might broaden the understanding of genetic architecture that exists between diagnoses, genes, and pleiotropy.
联合分析多个相关表型的全基因组关联研究(GWAS)可以识别和解释多效性位点,这对于理解疾病和复杂性状中的多效性至关重要。同时,基于表型和基因型之间的关联构建网络为分析多个表型提供了新的视角,可以探索表型和基因型是否在更高的细胞和生物体组织水平上相关。在本文中,我们首先通过将表型和基因型链接到一个基因型和表型网络(GPN)中来构建一个二分有向网络。GPN 可以通过混合定量和定性表型构建,适用于大规模生物库数据集中具有极不平衡病例对照比的二项表型。然后,我们应用一种强大的社区检测方法,根据 GPN 将表型划分为不相交的网络模块。最后,我们联合测试网络模块中多个表型与单个核苷酸多态性(SNP)之间的关联。对英国生物库中的 72 种复杂性状的模拟和分析表明,基于 GPN 检测到的网络模块的多表型关联检验比不考虑网络模块的检验更有效。新提出的 GPN 为研究不同类型表型之间的遗传结构提供了新的视角。通过将遗传信息纳入表型聚类,基于 GPN 的多表型关联研究得到了改进。值得注意的是,它可能拓宽了对诊断、基因和多效性之间存在的遗传结构的理解。