Betti Michael J, Lin Phillip, Aldrich Melinda C, Gamazon Eric R
Department of Medicine, Division of Genetic Medicine, Vanderbilt University Medical Center, 2525 West End Avenue, Suite 700, Nashville, TN, 37203, USA.
Clare Hall, University of Cambridge, Herschel Rd, Cambridge, CB3 9AL, UK.
Nat Commun. 2025 Apr 3;16(1):3193. doi: 10.1038/s41467-025-58023-x.
The biological functions of extragenic enhancer RNAs and their impact on disease risk remain relatively underexplored. In this work, we develop in silico models of genetically regulated expression of enhancer RNAs across 49 cell and tissue types, characterizing their degree of genetic control. Leveraging the estimated genetically regulated expression for enhancer RNAs and canonical genes in a large-scale DNA biobank (N > 70,000) and high-resolution Hi-C contact data, we train a deep learning-based model of pairwise three-dimensional chromatin contact frequency for enhancer-enhancer and enhancer-gene pairs in cerebellum and whole blood. Notably, the use of genetically regulated expression of enhancer RNAs provides substantial tissue-specific predictive power, supporting a role for these transcripts in modulating spatial chromatin organization. We identify schizophrenia-associated enhancer RNAs independent of GWAS loci using enhancer RNA-based TWAS and determine the causal effects of these enhancer RNAs using Mendelian randomization. Using enhancer RNA-based TWAS, we generate a comprehensive resource of tissue-specific enhancer associations with complex traits in the UK Biobank. Finally, we show that a substantially greater proportion (63%) of GWAS associations colocalize with causal regulatory variation when enhancer RNAs are included.
基因外增强子RNA的生物学功能及其对疾病风险的影响仍相对未被充分探索。在这项工作中,我们开发了计算机模型,用于模拟49种细胞和组织类型中增强子RNA的基因调控表达,表征其基因控制程度。利用大规模DNA生物样本库(N>70,000)中增强子RNA和经典基因的估计基因调控表达以及高分辨率的Hi-C接触数据,我们训练了一个基于深度学习的模型,用于预测小脑和全血中增强子-增强子和增强子-基因对的三维染色质接触频率。值得注意的是,使用增强子RNA的基因调控表达提供了强大的组织特异性预测能力,支持这些转录本在调节空间染色质组织中的作用。我们使用基于增强子RNA的全转录组关联研究(TWAS)鉴定了与精神分裂症相关的独立于全基因组关联研究(GWAS)位点的增强子RNA,并使用孟德尔随机化确定了这些增强子RNA的因果效应。利用基于增强子RNA的TWAS,我们在英国生物样本库中生成了一份关于组织特异性增强子与复杂性状关联的综合资源。最后,我们表明,当纳入增强子RNA时,GWAS关联中相当大比例(63%)与因果调控变异共定位。