Wang Juehan, Gazal Steven
Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
Center for Genetic Epidemiology, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.
medRxiv. 2023 Oct 21:2023.10.20.23297214. doi: 10.1101/2023.10.20.23297214.
Multi-ancestry genome-wide association studies (GWAS) have highlighted the existence of variants with ancestry-specific effect sizes. Understanding where and why these ancestry-specific effects occur is fundamental to understanding the genetic basis of human diseases and complex traits. Here, we characterized genes differentially expressed across ancestries (ancDE genes) at the cell-type level by leveraging single-cell RNA-seq data in peripheral blood mononuclear cells for 21 individuals with East Asian (EAS) ancestry and 23 individuals with European (EUR) ancestry (172K cells); then, we tested if variants surrounding those genes were enriched in disease variants with ancestry-specific effect sizes by leveraging ancestry-matched GWAS of 31 diseases and complex traits (average = 90K and 267K in EAS and EUR, respectively). We observed that ancDE genes tend to be cell-type-specific, to be enriched in genes interacting with the environment, and in variants with ancestry-specific disease effect sizes, suggesting the impact of shared cell-type-specific gene-by-environment (GxE) interactions between regulatory and disease architectures. Finally, we illustrated how GxE interactions might have led to ancestry-specific expression in B cells, and ancestry-specific allele effect sizes in lymphocyte count GWAS for variants surrounding . Our results imply that large single-cell and GWAS datasets in diverse populations are required to improve our understanding on the effect of genetic variants on human diseases.
多祖先全基因组关联研究(GWAS)突出了具有特定祖先效应大小的变异的存在。了解这些特定祖先效应发生的位置和原因对于理解人类疾病和复杂性状的遗传基础至关重要。在这里,我们通过利用21名东亚(EAS)血统个体和23名欧洲(EUR)血统个体(172K个细胞)外周血单核细胞中的单细胞RNA测序数据,在细胞类型水平上对跨祖先差异表达的基因(ancDE基因)进行了表征;然后,我们通过利用31种疾病和复杂性状的祖先匹配GWAS(EAS和EUR中分别平均为90K和267K),测试了这些基因周围的变异是否在具有特定祖先效应大小的疾病变异中富集。我们观察到ancDE基因倾向于具有细胞类型特异性,在与环境相互作用的基因以及具有特定祖先疾病效应大小的变异中富集,这表明调控结构和疾病结构之间共享的细胞类型特异性基因-环境(GxE)相互作用的影响。最后,我们说明了GxE相互作用可能如何导致B细胞中的特定祖先表达,以及围绕的变异在淋巴细胞计数GWAS中的特定祖先等位基因效应大小。我们的结果表明,需要不同人群中的大型单细胞和GWAS数据集来增进我们对遗传变异对人类疾病影响的理解。