• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一个可推广的框架,全面预测表观基因组、染色质组织和转录组。

A generalizable framework to comprehensively predict epigenome, chromatin organization, and transcriptome.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, 500 S. State St, Ann Arbor, MI 48109, USA.

Department of Computer Science and Engineering, University of Michigan, 500 S. State St, Ann Arbor, MI 48109, USA.

出版信息

Nucleic Acids Res. 2023 Jul 7;51(12):5931-5947. doi: 10.1093/nar/gkad436.

DOI:10.1093/nar/gkad436
PMID:37224527
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10325920/
Abstract

Many deep learning approaches have been proposed to predict epigenetic profiles, chromatin organization, and transcription activity. While these approaches achieve satisfactory performance in predicting one modality from another, the learned representations are not generalizable across predictive tasks or across cell types. In this paper, we propose a deep learning approach named EPCOT which employs a pre-training and fine-tuning framework, and is able to accurately and comprehensively predict multiple modalities including epigenome, chromatin organization, transcriptome, and enhancer activity for new cell types, by only requiring cell-type specific chromatin accessibility profiles. Many of these predicted modalities, such as Micro-C and ChIA-PET, are quite expensive to get in practice, and the in silico prediction from EPCOT should be quite helpful. Furthermore, this pre-training and fine-tuning framework allows EPCOT to identify generic representations generalizable across different predictive tasks. Interpreting EPCOT models also provides biological insights including mapping between different genomic modalities, identifying TF sequence binding patterns, and analyzing cell-type specific TF impacts on enhancer activity.

摘要

许多深度学习方法已经被提出用于预测表观基因组图谱、染色质组织和转录活性。虽然这些方法在从一种模态预测另一种模态方面取得了令人满意的性能,但学习到的表示形式并不能在不同的预测任务或不同的细胞类型之间通用。在本文中,我们提出了一种名为 EPCOT 的深度学习方法,它采用了预训练和微调框架,仅需要细胞类型特异性染色质可及性图谱,就能够准确全面地预测多种模态,包括表观基因组、染色质组织、转录组和增强子活性,对于新的细胞类型也是如此。其中许多预测模态,如 Micro-C 和 ChIA-PET,在实践中都非常昂贵,而 EPCOT 的计算预测应该非常有帮助。此外,这种预训练和微调框架允许 EPCOT 识别可在不同预测任务中通用的通用表示。解释 EPCOT 模型还提供了生物学见解,包括不同基因组模态之间的映射、识别 TF 序列结合模式,以及分析细胞类型特异性 TF 对增强子活性的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/0f861303c6c8/gkad436fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/c1a5a1096407/gkad436figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/153a5bdbd277/gkad436fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/ed337529067c/gkad436fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/58ea9f93c789/gkad436fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/9ff56c77d8c5/gkad436fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/5cbb1793c390/gkad436fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/0f861303c6c8/gkad436fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/c1a5a1096407/gkad436figgra1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/153a5bdbd277/gkad436fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/ed337529067c/gkad436fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/58ea9f93c789/gkad436fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/9ff56c77d8c5/gkad436fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/5cbb1793c390/gkad436fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d2d/10325920/0f861303c6c8/gkad436fig6.jpg

相似文献

1
A generalizable framework to comprehensively predict epigenome, chromatin organization, and transcriptome.一个可推广的框架,全面预测表观基因组、染色质组织和转录组。
Nucleic Acids Res. 2023 Jul 7;51(12):5931-5947. doi: 10.1093/nar/gkad436.
2
ProbC: joint modeling of epigenome and transcriptome effects in 3D genome.ProBC:3D 基因组中表观基因组和转录组效应的联合建模。
BMC Genomics. 2022 Apr 9;23(1):287. doi: 10.1186/s12864-022-08498-5.
3
A multi-modal transformer for cell type-agnostic regulatory predictions.一种用于细胞类型无关调节预测的多模态变压器。
Cell Genom. 2025 Feb 12;5(2):100762. doi: 10.1016/j.xgen.2025.100762. Epub 2025 Jan 29.
4
Graph embedding and unsupervised learning predict genomic sub-compartments from HiC chromatin interaction data.图嵌入和无监督学习可从 HiC 染色质相互作用数据预测基因组亚区室。
Nat Commun. 2020 Mar 3;11(1):1173. doi: 10.1038/s41467-020-14974-x.
5
Multi-omic single-cell velocity models epigenome-transcriptome interactions and improves cell fate prediction.多组学单细胞速度模型揭示了表观基因组-转录组相互作用,并提高了细胞命运预测。
Nat Biotechnol. 2023 Mar;41(3):387-398. doi: 10.1038/s41587-022-01476-y. Epub 2022 Oct 13.
6
Predicting cell type-specific epigenomic profiles accounting for distal genetic effects.预测细胞类型特异性表观基因组图谱,同时考虑远端遗传效应。
Nat Commun. 2024 Nov 16;15(1):9951. doi: 10.1038/s41467-024-54441-5.
7
A computational approach for the functional classification of the epigenome.一种用于表观基因组功能分类的计算方法。
Epigenetics Chromatin. 2017 May 15;10:26. doi: 10.1186/s13072-017-0131-7. eCollection 2017.
8
Assessing the model transferability for prediction of transcription factor binding sites based on chromatin accessibility.基于染色质可及性评估预测转录因子结合位点的模型可转移性。
BMC Bioinformatics. 2017 Jul 27;18(1):355. doi: 10.1186/s12859-017-1769-7.
9
Genome-wide chromatin accessibility and transcriptome profiling show minimal epigenome changes and coordinated transcriptional dysregulation of hedgehog signaling in Danforth's short tail mice.全基因组染色质可及性和转录组谱分析显示,Danforth 短尾鼠 Hedgehog 信号的表观基因组变化极小,转录调控失调协调。
Hum Mol Genet. 2019 Mar 1;28(5):736-750. doi: 10.1093/hmg/ddy378.
10
Prediction of Enhancer-Gene Interactions Using Chromatin-Conformation Capture and Epigenome Data Using STARE.使用 STARE 进行染色质构象捕获和表观基因组数据预测增强子-基因相互作用
Methods Mol Biol. 2025;2856:327-339. doi: 10.1007/978-1-0716-4136-1_20.

引用本文的文献

1
Developing a general AI model for integrating diverse genomic modalities and comprehensive genomic knowledge.开发一个用于整合多种基因组模式和全面基因组知识的通用人工智能模型。
bioRxiv. 2025 May 14:2025.05.08.652986. doi: 10.1101/2025.05.08.652986.
2
ChromoGen: Diffusion model predicts single-cell chromatin conformations.染色体生成器:扩散模型预测单细胞染色质构象。
Sci Adv. 2025 Jan 31;11(5):eadr8265. doi: 10.1126/sciadv.adr8265.
3
Recipes and ingredients for deep learning models of 3D genome folding.三维基因组折叠深度学习模型的方法和要素

本文引用的文献

1
Epiphany: predicting Hi-C contact maps from 1D epigenomic signals.顿悟:从一维表观基因组信号预测 Hi-C 接触图谱。
Genome Biol. 2023 Jun 6;24(1):134. doi: 10.1186/s13059-023-02934-9.
2
Region Capture Micro-C reveals coalescence of enhancers and promoters into nested microcompartments.区域捕获微区揭示了增强子和启动子的合并成嵌套的微区。
Nat Genet. 2023 Jun;55(6):1048-1056. doi: 10.1038/s41588-023-01391-1. Epub 2023 May 8.
3
The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles.
Curr Opin Genet Dev. 2025 Apr;91:102308. doi: 10.1016/j.gde.2024.102308. Epub 2025 Jan 24.
4
A review of deep learning models for the prediction of chromatin interactions with DNA and epigenomic profiles.用于预测染色质与DNA相互作用及表观基因组图谱的深度学习模型综述。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae651.
5
ChromaFold predicts the 3D contact map from single-cell chromatin accessibility.ChromaFold 可从单细胞染色质可及性预测 3D 接触图谱。
Nat Commun. 2024 Nov 1;15(1):9432. doi: 10.1038/s41467-024-53628-0.
6
Context transcription factors establish cooperative environments and mediate enhancer communication.上下文转录因子建立合作环境并介导增强子通讯。
Nat Genet. 2024 Oct;56(10):2199-2212. doi: 10.1038/s41588-024-01892-7. Epub 2024 Oct 3.
7
dHICA: a deep transformer-based model enables accurate histone imputation from chromatin accessibility.dHICA:一种基于深度Transformer 的模型,可从染色质可及性中实现精确的组蛋白推断。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae459.
8
Machine and Deep Learning Methods for Predicting 3D Genome Organization.机器和深度学习方法预测三维基因组结构。
Methods Mol Biol. 2025;2856:357-400. doi: 10.1007/978-1-0716-4136-1_22.
9
EPInformer: a scalable deep learning framework for gene expression prediction by integrating promoter-enhancer sequences with multimodal epigenomic data.EPInformer:一种通过整合启动子-增强子序列与多组学表观基因组数据进行基因表达预测的可扩展深度学习框架。
bioRxiv. 2024 Aug 1:2024.08.01.606099. doi: 10.1101/2024.08.01.606099.
10
Machine and deep learning methods for predicting 3D genome organization.用于预测三维基因组组织的机器学习和深度学习方法。
ArXiv. 2024 Mar 4:arXiv:2403.03231v1.
ENCODE 插补挑战:对跨细胞类型表观基因组谱插补方法的批判性评估。
Genome Biol. 2023 Apr 18;24(1):79. doi: 10.1186/s13059-023-02915-y.
4
maxATAC: Genome-scale transcription-factor binding prediction from ATAC-seq with deep neural networks.maxATAC:基于深度神经网络的 ATAC-seq 全基因组转录因子结合预测
PLoS Comput Biol. 2023 Jan 31;19(1):e1010863. doi: 10.1371/journal.pcbi.1010863. eCollection 2023 Jan.
5
A sequence-based global map of regulatory activity for deciphering human genetics.基于序列的人类遗传学解码调控活性的全局图谱。
Nat Genet. 2022 Jul;54(7):940-949. doi: 10.1038/s41588-022-01102-2. Epub 2022 Jul 11.
6
Sequence-based modeling of three-dimensional genome architecture from kilobase to chromosome scale.基于序列的从千碱基到染色体尺度的三维基因组结构建模。
Nat Genet. 2022 May;54(5):725-734. doi: 10.1038/s41588-022-01065-4. Epub 2022 May 12.
7
DeepSTARR predicts enhancer activity from DNA sequence and enables the de novo design of synthetic enhancers.DeepSTARR 可根据 DNA 序列预测增强子活性,并能够从头设计合成增强子。
Nat Genet. 2022 May;54(5):613-624. doi: 10.1038/s41588-022-01048-5. Epub 2022 May 12.
8
Connecting high-resolution 3D chromatin organization with epigenomics.连接高分辨率 3D 染色质构象与表观基因组学。
Nat Commun. 2022 Apr 19;13(1):2054. doi: 10.1038/s41467-022-29695-6.
9
Chromatin interaction-aware gene regulatory modeling with graph attention networks.基于图注意力网络的染色质相互作用感知基因调控建模。
Genome Res. 2022 May;32(5):930-944. doi: 10.1101/gr.275870.121. Epub 2022 Apr 8.
10
Integrating Long-Range Regulatory Interactions to Predict Gene Expression Using Graph Convolutional Networks.基于图卷积网络整合长程调控相互作用以预测基因表达。
J Comput Biol. 2022 May;29(5):409-424. doi: 10.1089/cmb.2021.0316. Epub 2022 Mar 21.