Institute for Computational Biomedicine, Weill Cornell Medicine, New York, New York, United States of America.
Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York, United States of America.
PLoS Genet. 2020 Apr 3;16(4):e1008663. doi: 10.1371/journal.pgen.1008663. eCollection 2020 Apr.
Previous studies have surveyed the potential impact of loss-of-function (LoF) variants and identified LoF-tolerant protein-coding genes. However, the tolerance of human genomes to losing enhancers has not yet been evaluated. Here we present the catalog of LoF-tolerant enhancers using structural variants from whole-genome sequences. Using a conservative approach, we estimate that individual human genomes possess at least 28 LoF-tolerant enhancers on average. We assessed the properties of LoF-tolerant enhancers in a unified regulatory network constructed by integrating tissue-specific enhancers and gene-gene interactions. We find that LoF-tolerant enhancers tend to be more tissue-specific and regulate fewer and more dispensable genes relative to other enhancers. They are enriched in immune-related cells while enhancers with low LoF-tolerance are enriched in kidney and brain/neuronal stem cells. We developed a supervised learning approach to predict the LoF-tolerance of all enhancers, which achieved an area under the receiver operating characteristics curve (AUROC) of 98%. We predict 3,519 more enhancers would be likely tolerant to LoF and 129 enhancers that would have low LoF-tolerance. Our predictions are supported by a known set of disease enhancers and novel deletions from PacBio sequencing. The LoF-tolerance scores provided here will serve as an important reference for disease studies.
先前的研究调查了功能丧失 (LoF) 变异体的潜在影响,并确定了 LoF 耐受的蛋白质编码基因。然而,人类基因组对失去增强子的耐受性尚未得到评估。在这里,我们使用全基因组序列中的结构变体呈现了 LoF 耐受增强子目录。使用保守的方法,我们估计单个人类基因组平均至少具有 28 个 LoF 耐受增强子。我们在通过整合组织特异性增强子和基因-基因相互作用构建的统一调控网络中评估了 LoF 耐受增强子的特性。我们发现,与其他增强子相比,LoF 耐受增强子往往更具组织特异性,调控的基因更少且更不重要。它们在免疫相关细胞中富集,而 LoF 耐受性低的增强子在肾脏和脑/神经元干细胞中富集。我们开发了一种监督学习方法来预测所有增强子的 LoF 耐受性,其接收者操作特征曲线 (AUROC) 的面积为 98%。我们预测有 3519 个增强子可能耐受 LoF,而 129 个增强子 LoF 耐受性低。我们的预测得到了一组已知的疾病增强子和来自 PacBio 测序的新型缺失的支持。这里提供的 LoF 耐受性评分将作为疾病研究的重要参考。