Suppr超能文献

基于图注意力网络的染色质接触预测基因共表达。

Prediction of gene co-expression from chromatin contacts with graph attention network.

机构信息

School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China.

Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, Shanghai 200050, China.

出版信息

Bioinformatics. 2022 Sep 30;38(19):4457-4465. doi: 10.1093/bioinformatics/btac535.

Abstract

MOTIVATION

The technology of high-throughput chromatin conformation capture (Hi-C) allows genome-wide measurement of chromatin interactions. Several studies have shown statistically significant relationships between gene-gene spatial contacts and their co-expression. It is desirable to uncover epigenetic mechanisms of transcriptional regulation behind such relationships using computational modeling. Existing methods for predicting gene co-expression from Hi-C data use manual feature engineering or unsupervised learning, which either limits the prediction accuracy or lacks interpretability.

RESULTS

To address these issues, we propose HiCoEx (Hi-C predicts gene co-expression), a novel end-to-end framework for explainable prediction of gene co-expression from Hi-C data based on graph neural network. We apply graph attention mechanism to a gene contact network inferred from Hi-C data to distinguish the importance among different neighboring genes of each gene, and learn the gene representation to predict co-expression in a supervised and task-specific manner. Then, from the trained model, we extract the learned gene embeddings as a model interpretation to distill biological insights. Experimental results show that HiCoEx can learn gene representation from 3D genomics signals automatically to improve prediction accuracy, and make the black box model explainable by capturing some biologically meaningful patterns, e.g., in a gene contact network, the common neighbors of two central genes might contribute to the co-expression of the two central genes through sharing enhancers.

AVAILABILITY AND IMPLEMENTATION

The source code is freely available at https://github.com/JieZheng-ShanghaiTech/HiCoEx.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

高通量染色质构象捕获(Hi-C)技术可实现全基因组染色质相互作用的测量。已有多项研究表明基因间空间接触与其共表达之间存在统计学显著关系。使用计算建模揭示这种关系背后的转录调控的表观遗传机制是很可取的。从 Hi-C 数据预测基因共表达的现有方法使用手动特征工程或无监督学习,这要么限制了预测准确性,要么缺乏可解释性。

结果

为了解决这些问题,我们提出了 HiCoEx(Hi-C 预测基因共表达),这是一种基于图神经网络的从 Hi-C 数据中进行可解释的基因共表达预测的端到端框架。我们将图注意机制应用于从 Hi-C 数据推断出的基因接触网络,以区分每个基因的不同相邻基因之间的重要性,并以监督和特定于任务的方式学习基因表示来预测共表达。然后,我们从训练好的模型中提取学习到的基因嵌入作为模型解释,以提取生物学见解。实验结果表明,HiCoEx 可以从 3D 基因组学信号中自动学习基因表示,以提高预测准确性,并通过捕获一些有意义的生物学模式来使黑盒模型具有可解释性,例如,在基因接触网络中,两个中心基因的共同邻居可能通过共享增强子而促进两个中心基因的共表达。

可用性和实现

源代码可在 https://github.com/JieZheng-ShanghaiTech/HiCoEx 上免费获得。

补充信息

补充数据可在生物信息学在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ab09/9525008/1d4cdc7f017a/btac535f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验