• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度铬:用于从组蛋白修饰预测基因表达的深度学习

DeepChrome: deep-learning for predicting gene expression from histone modifications.

作者信息

Singh Ritambhara, Lanchantin Jack, Robins Gabriel, Qi Yanjun

机构信息

Department of Computer Science, University of Virginia, Charlottesville, VA 22904, USA.

出版信息

Bioinformatics. 2016 Sep 1;32(17):i639-i648. doi: 10.1093/bioinformatics/btw427.

DOI:10.1093/bioinformatics/btw427
PMID:27587684
Abstract

MOTIVATION

Histone modifications are among the most important factors that control gene regulation. Computational methods that predict gene expression from histone modification signals are highly desirable for understanding their combinatorial effects in gene regulation. This knowledge can help in developing 'epigenetic drugs' for diseases like cancer. Previous studies for quantifying the relationship between histone modifications and gene expression levels either failed to capture combinatorial effects or relied on multiple methods that separate predictions and combinatorial analysis. This paper develops a unified discriminative framework using a deep convolutional neural network to classify gene expression using histone modification data as input. Our system, called DeepChrome, allows automatic extraction of complex interactions among important features. To simultaneously visualize the combinatorial interactions among histone modifications, we propose a novel optimization-based technique that generates feature pattern maps from the learnt deep model. This provides an intuitive description of underlying epigenetic mechanisms that regulate genes.

RESULTS

We show that DeepChrome outperforms state-of-the-art models like Support Vector Machines and Random Forests for gene expression classification task on 56 different cell-types from REMC database. The output of our visualization technique not only validates the previous observations but also allows novel insights about combinatorial interactions among histone modification marks, some of which have recently been observed by experimental studies.

AVAILABILITY AND IMPLEMENTATION

Codes and results are available at www.deepchrome.org

CONTACT

yanjun@virginia.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

组蛋白修饰是控制基因调控的最重要因素之一。从组蛋白修饰信号预测基因表达的计算方法对于理解它们在基因调控中的组合效应非常有必要。这些知识有助于开发针对癌症等疾病的“表观遗传药物”。先前用于量化组蛋白修饰与基因表达水平之间关系的研究要么未能捕捉到组合效应,要么依赖于多种将预测和组合分析分开的方法。本文开发了一个统一的判别框架,使用深度卷积神经网络以组蛋白修饰数据作为输入来对基因表达进行分类。我们的系统名为DeepChrome,能够自动提取重要特征之间的复杂相互作用。为了同时可视化组蛋白修饰之间的组合相互作用,我们提出了一种基于优化的新技术,该技术从学习到的深度模型生成特征模式图。这提供了对调控基因的潜在表观遗传机制的直观描述。

结果

我们表明,在来自REMC数据库的56种不同细胞类型上,对于基因表达分类任务,DeepChrome优于支持向量机和随机森林等现有模型。我们可视化技术的输出不仅验证了先前的观察结果,还提供了关于组蛋白修饰标记之间组合相互作用的新见解,其中一些最近已通过实验研究观察到。

可用性和实现

代码和结果可在www.deepchrome.org获取。

联系方式

yanjun@virginia.edu

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
DeepChrome: deep-learning for predicting gene expression from histone modifications.深度铬:用于从组蛋白修饰预测基因表达的深度学习
Bioinformatics. 2016 Sep 1;32(17):i639-i648. doi: 10.1093/bioinformatics/btw427.
2
DeepDiff: DEEP-learning for predicting DIFFerential gene expression from histone modifications.DeepDiff:基于深度学习的组蛋白修饰差异基因表达预测方法。
Bioinformatics. 2018 Sep 1;34(17):i891-i900. doi: 10.1093/bioinformatics/bty612.
3
Finding combinatorial histone code by semi-supervised biclustering.通过半监督双聚类寻找组合组蛋白密码。
BMC Genomics. 2012 Jul 3;13:301. doi: 10.1186/1471-2164-13-301.
4
Opening up the blackbox: an interpretable deep neural network-based classifier for cell-type specific enhancer predictions.打开黑箱:一种基于可解释深度神经网络的细胞类型特异性增强子预测分类器。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):54. doi: 10.1186/s12918-016-0302-3.
5
Deciphering histone code of transcriptional regulation in malaria parasites by large-scale data mining.通过大规模数据挖掘破译疟原虫转录调控的组蛋白密码
Comput Biol Chem. 2014 Jun;50:3-10. doi: 10.1016/j.compbiolchem.2014.01.002. Epub 2014 Jan 23.
6
CatLearning: highly accurate gene expression prediction from histone mark.CatLearning:基于组蛋白标记的高精度基因表达预测
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae373.
7
Enhancer prediction with histone modification marks using a hybrid neural network model.基于组蛋白修饰标记的增强子预测的混合神经网络模型。
Methods. 2019 Aug 15;166:48-56. doi: 10.1016/j.ymeth.2019.03.014. Epub 2019 Mar 21.
8
Accurate and highly interpretable prediction of gene expression from histone modifications.从组蛋白修饰中准确且高度可解释地预测基因表达。
BMC Bioinformatics. 2022 Apr 26;23(1):151. doi: 10.1186/s12859-022-04687-x.
9
Differential Gene Expression Prediction by Ensemble Deep Networks on Histone Modification Data.基于组蛋白修饰数据的集成深度网络进行差异基因表达预测
IEEE/ACM Trans Comput Biol Bioinform. 2023 Jan-Feb;20(1):340-351. doi: 10.1109/TCBB.2021.3139634. Epub 2023 Feb 3.
10
Combinations of Histone Modifications for Pattern Genes.模式基因的组蛋白修饰组合
Acta Biotheor. 2016 Jun;64(2):121-32. doi: 10.1007/s10441-016-9276-1. Epub 2016 Feb 5.

引用本文的文献

1
Energy entropy vector: a novel approach for efficient microbial genomic sequence analysis and classification.能量熵向量:一种用于高效微生物基因组序列分析和分类的新方法。
Brief Bioinform. 2025 Sep 6;26(5). doi: 10.1093/bib/bbaf459.
2
Towards universal modeling of transcript isoform expression levels.迈向转录本异构体表达水平的通用建模
bioRxiv. 2025 Jul 25:2025.07.21.665977. doi: 10.1101/2025.07.21.665977.
3
Machine learning on multiple epigenetic features reveals H3K27Ac as a driver of gene expression prediction across patients with glioblastoma.
基于多种表观遗传特征的机器学习揭示了H3K27Ac是胶质母细胞瘤患者基因表达预测的驱动因素。
PLoS Comput Biol. 2025 Aug 7;21(8):e1012272. doi: 10.1371/journal.pcbi.1012272. eCollection 2025 Aug.
4
Multi-omics based and AI-driven drug repositioning for epigenetic therapy in female malignancies.基于多组学和人工智能驱动的女性恶性肿瘤表观遗传治疗药物重新定位
J Transl Med. 2025 Jul 25;23(1):837. doi: 10.1186/s12967-025-06856-x.
5
Metabolic reprogramming and functional crosstalk within the tumor microenvironment (TME) and A Multi-omics anticancer approach.肿瘤微环境(TME)中的代谢重编程与功能串扰以及多组学抗癌方法。
Med Oncol. 2025 Jul 24;42(9):373. doi: 10.1007/s12032-025-02945-5.
6
Integrating Artificial Intelligence in Next-Generation Sequencing: Advances, Challenges, and Future Directions.将人工智能整合到下一代测序中:进展、挑战与未来方向。
Curr Issues Mol Biol. 2025 Jun 19;47(6):470. doi: 10.3390/cimb47060470.
7
GPS: Harnessing data fusion strategies to improve the accuracy of machine learning-based genomic and phenotypic selection.GPS:利用数据融合策略提高基于机器学习的基因组和表型选择的准确性。
Plant Commun. 2025 Aug 11;6(8):101416. doi: 10.1016/j.xplc.2025.101416. Epub 2025 Jun 11.
8
Predicting gene expression changes upon epigenomic drug treatment.预测表观基因组药物治疗后的基因表达变化。
F1000Res. 2025 May 2;12:1089. doi: 10.12688/f1000research.140273.3. eCollection 2023.
9
Quantifying the influence of genetic context on duplicated mammalian genes.量化基因背景对哺乳动物重复基因的影响。
bioRxiv. 2025 May 2:2025.04.03.647042. doi: 10.1101/2025.04.03.647042.
10
TimeFlies: an snRNA-seq aging clock for the fruit fly head sheds light on sex-biased aging.时光飞逝:一种用于果蝇头部的单链核糖核酸测序衰老时钟揭示了性别差异衰老现象。
bioRxiv. 2025 Jan 25:2024.11.25.625273. doi: 10.1101/2024.11.25.625273.