• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过密集连接卷积神经网络整合远端和近端信息来预测基因表达。

Integrating distal and proximal information to predict gene expression via a densely connected convolutional neural network.

机构信息

MOE Key Laboratory of Bioinformatics, Beijing National Research Center for Information Science and Technology, Department of Automation, Tsinghua University, Beijing 100084, China.

CEMS, NCMIS, MDIS, Academy of Mathematics and Systems Science, National Center for Mathematics and Interdisciplinary Sciences, Chinese Academy of Sciences, Beijing 100080, China.

出版信息

Bioinformatics. 2020 Jan 15;36(2):496-503. doi: 10.1093/bioinformatics/btz562.

DOI:10.1093/bioinformatics/btz562
PMID:31318408
Abstract

MOTIVATION

Interactions among cis-regulatory elements such as enhancers and promoters are main driving forces shaping context-specific chromatin structure and gene expression. Although there have been computational methods for predicting gene expression from genomic and epigenomic information, most of them neglect long-range enhancer-promoter interactions, due to the difficulty in precisely linking regulatory enhancers to target genes. Recently, HiChIP, a novel high-throughput experimental approach, has generated comprehensive data on high-resolution interactions between promoters and distal enhancers. Moreover, plenty of studies suggest that deep learning achieves state-of-the-art performance in epigenomic signal prediction, and thus promoting the understanding of regulatory elements. In consideration of these two factors, we integrate proximal promoter sequences and HiChIP distal enhancer-promoter interactions to accurately predict gene expression.

RESULTS

We propose DeepExpression, a densely connected convolutional neural network, to predict gene expression using both promoter sequences and enhancer-promoter interactions. We demonstrate that our model consistently outperforms baseline methods, not only in the classification of binary gene expression status but also in regression of continuous gene expression levels, in both cross-validation experiments and cross-cell line predictions. We show that the sequential promoter information is more informative than the experimental enhancer information; meanwhile, the enhancer-promoter interactions within ±100 kbp around the TSS of a gene are most beneficial. We finally visualize motifs in both promoter and enhancer regions and show the match of identified sequence signatures with known motifs. We expect to see a wide spectrum of applications using HiChIP data in deciphering the mechanism of gene regulation.

AVAILABILITY AND IMPLEMENTATION

DeepExpression is freely available at https://github.com/wanwenzeng/DeepExpression.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

顺式调控元件(如增强子和启动子)之间的相互作用是塑造特定于上下文的染色质结构和基因表达的主要驱动力。尽管已经有一些从基因组和表观基因组信息预测基因表达的计算方法,但由于难以精确地将调节增强子与靶基因联系起来,大多数方法都忽略了长程增强子-启动子相互作用。最近,一种新型高通量实验方法 HiChIP,已经生成了关于启动子和远端增强子之间高分辨率相互作用的全面数据。此外,大量研究表明,深度学习在表观基因组信号预测方面达到了最先进的性能,从而促进了对调节元件的理解。考虑到这两个因素,我们整合了近端启动子序列和 HiChIP 远端增强子-启动子相互作用,以准确预测基因表达。

结果

我们提出了 DeepExpression,这是一种密集连接的卷积神经网络,用于使用启动子序列和增强子-启动子相互作用来预测基因表达。我们证明,我们的模型不仅在二元基因表达状态的分类中,而且在连续基因表达水平的回归中,都始终优于基线方法,无论是在交叉验证实验还是跨细胞系预测中。我们表明,序列启动子信息比实验增强子信息更具信息量;同时,在基因 TSS 周围 ±100 kbp 内的增强子-启动子相互作用最有益。我们最后在启动子和增强子区域可视化了基序,并显示了识别的序列特征与已知基序的匹配。我们希望看到使用 HiChIP 数据在破译基因调控机制方面的广泛应用。

可用性和实现

DeepExpression 可在 https://github.com/wanwenzeng/DeepExpression 上免费获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Integrating distal and proximal information to predict gene expression via a densely connected convolutional neural network.通过密集连接卷积神经网络整合远端和近端信息来预测基因表达。
Bioinformatics. 2020 Jan 15;36(2):496-503. doi: 10.1093/bioinformatics/btz562.
2
DeepCAPE: A Deep Convolutional Neural Network for the Accurate Prediction of Enhancers.深度CAPE:用于准确预测增强子的深度卷积神经网络
Genomics Proteomics Bioinformatics. 2021 Aug;19(4):565-577. doi: 10.1016/j.gpb.2019.04.006. Epub 2021 Feb 11.
3
Chromatin accessibility prediction via a hybrid deep convolutional neural network.基于混合深度卷积神经网络的染色质可及性预测。
Bioinformatics. 2018 Mar 1;34(5):732-738. doi: 10.1093/bioinformatics/btx679.
4
EPIHC: Improving Enhancer-Promoter Interaction Prediction by Using Hybrid Features and Communicative Learning.EPIHC:利用混合特征和交流学习改进增强子-启动子相互作用预测
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3435-3443. doi: 10.1109/TCBB.2021.3109488. Epub 2022 Dec 8.
5
Opening up the blackbox: an interpretable deep neural network-based classifier for cell-type specific enhancer predictions.打开黑箱:一种基于可解释深度神经网络的细胞类型特异性增强子预测分类器。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):54. doi: 10.1186/s12918-016-0302-3.
6
Local Epigenomic Data are more Informative than Local Genome Sequence Data in Predicting Enhancer-Promoter Interactions Using Neural Networks.利用神经网络进行增强子-启动子相互作用预测时,局部表观基因组数据比局部基因组序列数据更具信息量。
Genes (Basel). 2019 Dec 29;11(1):41. doi: 10.3390/genes11010041.
7
Exploiting sequence-based features for predicting enhancer-promoter interactions.利用基于序列的特征预测增强子-启动子相互作用。
Bioinformatics. 2017 Jul 15;33(14):i252-i260. doi: 10.1093/bioinformatics/btx257.
8
A simple convolutional neural network for prediction of enhancer-promoter interactions with DNA sequence data.基于 DNA 序列数据的增强子-启动子相互作用预测的简单卷积神经网络。
Bioinformatics. 2019 Sep 1;35(17):2899-2906. doi: 10.1093/bioinformatics/bty1050.
9
AIKYATAN: mapping distal regulatory elements using convolutional learning on GPU.AIKYATAN:使用 GPU 上的卷积学习进行远端调控元件的作图。
BMC Bioinformatics. 2019 Oct 7;20(1):488. doi: 10.1186/s12859-019-3049-1.
10
DeepPHiC: predicting promoter-centered chromatin interactions using a novel deep learning approach.DeepPHiC:使用新型深度学习方法预测以启动子为中心的染色质相互作用。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac801.

引用本文的文献

1
Single-cell analysis of chromatin and expression reveals age- and sex-associated alterations in the human heart.单细胞分析染色质和表达揭示了人类心脏中与年龄和性别相关的改变。
Commun Biol. 2024 Aug 26;7(1):1052. doi: 10.1038/s42003-024-06582-y.
2
A computational approach for deciphering the interactions between proximal and distal gene regulators in GC B-cell response.一种用于解析生发中心B细胞应答中近端和远端基因调控因子之间相互作用的计算方法。
NAR Genom Bioinform. 2024 May 6;6(2):lqae050. doi: 10.1093/nargab/lqae050. eCollection 2024 Jun.
3
A neural network model to screen feature genes for pancreatic cancer.
用于筛选胰腺癌特征基因的神经网络模型。
BMC Bioinformatics. 2023 May 11;24(1):193. doi: 10.1186/s12859-023-05322-z.
4
A survey on algorithms to characterize transcription factor binding sites.一种用于刻画转录因子结合位点的算法研究综述。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad156.
5
Predicting gene expression from histone modifications with self-attention based neural networks and transfer learning.基于自注意力神经网络和迁移学习从组蛋白修饰预测基因表达
Front Genet. 2022 Dec 14;13:1081842. doi: 10.3389/fgene.2022.1081842. eCollection 2022.
6
Deciphering the impact of genetic variation on human polyadenylation using APARENT2.利用 APARENT2 破译遗传变异对人类多聚腺苷酸化的影响。
Genome Biol. 2022 Nov 5;23(1):232. doi: 10.1186/s13059-022-02799-4.
7
HiChIPdb: a comprehensive database of HiChIP regulatory interactions.HiChIPdb:一个综合的 HiChIP 调控相互作用数据库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D159-D166. doi: 10.1093/nar/gkac859.
8
Interpreting Neural Networks for Biological Sequences by Learning Stochastic Masks.通过学习随机掩码来解释生物序列的神经网络。
Nat Mach Intell. 2022 Jan;4(1):41-54. doi: 10.1038/s42256-021-00428-6. Epub 2022 Jan 25.
9
Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network.使用深度卷积神经网络评估DNA序列和表观遗传修饰对基因表达的相对重要性。
Comput Struct Biotechnol J. 2022 Jul 13;20:3814-3823. doi: 10.1016/j.csbj.2022.07.014. eCollection 2022.
10
A review of deep learning applications in human genomics using next-generation sequencing data.深度学习在人类基因组学中应用的研究进展:利用下一代测序数据
Hum Genomics. 2022 Jul 25;16(1):26. doi: 10.1186/s40246-022-00396-x.