• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图注意力网络的染色质相互作用感知基因调控建模。

Chromatin interaction-aware gene regulatory modeling with graph attention networks.

机构信息

Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York, New York 10065, USA.

出版信息

Genome Res. 2022 May;32(5):930-944. doi: 10.1101/gr.275870.121. Epub 2022 Apr 8.

DOI:10.1101/gr.275870.121
PMID:35396274
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9104700/
Abstract

Linking distal enhancers to genes and modeling their impact on target gene expression are longstanding unresolved problems in regulatory genomics and critical for interpreting noncoding genetic variation. Here, we present a new deep learning approach called GraphReg that exploits 3D interactions from chromosome conformation capture assays to predict gene expression from 1D epigenomic data or genomic DNA sequence. By using graph attention networks to exploit the connectivity of distal elements up to 2 Mb away in the genome, GraphReg more faithfully models gene regulation and more accurately predicts gene expression levels than the state-of-the-art deep learning methods for this task. Feature attribution used with GraphReg accurately identifies functional enhancers of genes, as validated by CRISPRi-FlowFISH and TAP-seq assays, outperforming both convolutional neural networks (CNNs) and the recently proposed activity-by-contact model. Sequence-based GraphReg also accurately predicts direct transcription factor (TF) targets as validated by CRISPRi TF knockout experiments via in silico ablation of TF binding motifs. GraphReg therefore represents an important advance in modeling the regulatory impact of epigenomic and sequence elements.

摘要

将远端增强子与基因联系起来,并对其影响靶基因表达的方式进行建模,这是调控基因组学中长期未解决的问题,对于解释非编码遗传变异至关重要。在这里,我们提出了一种新的深度学习方法,称为 GraphReg,它利用来自染色体构象捕获测定的 3D 相互作用,从 1D 表观基因组数据或基因组 DNA 序列预测基因表达。通过使用图注意网络来利用基因组中可达 2Mb 远的远端元件的连通性,GraphReg 比该任务的最先进的深度学习方法更忠实地模拟基因调控,并更准确地预测基因表达水平。与 GraphReg 一起使用的特征归因方法可以准确识别基因的功能增强子,这通过 CRISPRi-FlowFISH 和 TAP-seq 测定得到了验证,优于卷积神经网络 (CNNs) 和最近提出的活性接触模型。基于序列的 GraphReg 还可以通过 CRISPRi TF 敲除实验通过在计算机上切除 TF 结合基序来准确预测直接转录因子 (TF) 靶标。因此,GraphReg 代表了在建模表观基因组和序列元件的调控影响方面的重要进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/693336cacc8a/930f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/15ed5a5903a6/930f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/8aaf24fd50bc/930f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/7eeecf078bb0/930f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/693336cacc8a/930f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/15ed5a5903a6/930f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/8aaf24fd50bc/930f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/7eeecf078bb0/930f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dd32/9104700/693336cacc8a/930f04.jpg

相似文献

1
Chromatin interaction-aware gene regulatory modeling with graph attention networks.基于图注意力网络的染色质相互作用感知基因调控建模。
Genome Res. 2022 May;32(5):930-944. doi: 10.1101/gr.275870.121. Epub 2022 Apr 8.
2
Discovering epistatic feature interactions from neural network models of regulatory DNA sequences.从调控 DNA 序列的神经网络模型中发现上位特征相互作用。
Bioinformatics. 2018 Sep 1;34(17):i629-i637. doi: 10.1093/bioinformatics/bty575.
3
DeFine: deep convolutional neural networks accurately quantify intensities of transcription factor-DNA binding and facilitate evaluation of functional non-coding variants.DeFine:深度卷积神经网络能够准确量化转录因子-DNA 结合强度,并有助于评估功能非编码变体。
Nucleic Acids Res. 2018 Jun 20;46(11):e69. doi: 10.1093/nar/gky215.
4
Integrating distal and proximal information to predict gene expression via a densely connected convolutional neural network.通过密集连接卷积神经网络整合远端和近端信息来预测基因表达。
Bioinformatics. 2020 Jan 15;36(2):496-503. doi: 10.1093/bioinformatics/btz562.
5
Learning and interpreting the gene regulatory grammar in a deep learning framework.在深度学习框架中学习和解释基因调控语法。
PLoS Comput Biol. 2020 Nov 2;16(11):e1008334. doi: 10.1371/journal.pcbi.1008334. eCollection 2020 Nov.
6
The functional consequences of variation in transcription factor binding.转录因子结合变异的功能后果。
PLoS Genet. 2014 Mar 6;10(3):e1004226. doi: 10.1371/journal.pgen.1004226. eCollection 2014 Mar.
7
Characterizing collaborative transcription regulation with a graph-based deep learning approach.基于图的深度学习方法对协同转录调控进行特征描述。
PLoS Comput Biol. 2022 Jun 6;18(6):e1010162. doi: 10.1371/journal.pcbi.1010162. eCollection 2022 Jun.
8
Opening up the blackbox: an interpretable deep neural network-based classifier for cell-type specific enhancer predictions.打开黑箱:一种基于可解释深度神经网络的细胞类型特异性增强子预测分类器。
BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):54. doi: 10.1186/s12918-016-0302-3.
9
Enhancing the interpretability of transcription factor binding site prediction using attention mechanism.利用注意力机制提高转录因子结合位点预测的可解释性。
Sci Rep. 2020 Aug 7;10(1):13413. doi: 10.1038/s41598-020-70218-4.
10
Predicting Chromatin Interactions from DNA Sequence Using DeepC.使用DeepC从DNA序列预测染色质相互作用。
Methods Mol Biol. 2023;2624:19-42. doi: 10.1007/978-1-0716-2962-8_3.

引用本文的文献

1
Towards universal modeling of transcript isoform expression levels.迈向转录本异构体表达水平的通用建模
bioRxiv. 2025 Jul 25:2025.07.21.665977. doi: 10.1101/2025.07.21.665977.
2
Machine learning on multiple epigenetic features reveals H3K27Ac as a driver of gene expression prediction across patients with glioblastoma.基于多种表观遗传特征的机器学习揭示了H3K27Ac是胶质母细胞瘤患者基因表达预测的驱动因素。
PLoS Comput Biol. 2025 Aug 7;21(8):e1012272. doi: 10.1371/journal.pcbi.1012272. eCollection 2025 Aug.
3
Perspective on recent developments and challenges in regulatory and systems genomics.

本文引用的文献

1
Integrating Long-Range Regulatory Interactions to Predict Gene Expression Using Graph Convolutional Networks.基于图卷积网络整合长程调控相互作用以预测基因表达。
J Comput Biol. 2022 May;29(5):409-424. doi: 10.1089/cmb.2021.0316. Epub 2022 Mar 21.
2
HiCAR is a robust and sensitive method to analyze open-chromatin-associated genome organization.HiCAR 是一种强大而敏感的方法,可用于分析开放染色质相关的基因组结构。
Mol Cell. 2022 Mar 17;82(6):1225-1238.e6. doi: 10.1016/j.molcel.2022.01.023. Epub 2022 Feb 22.
3
Effective gene expression prediction from sequence by integrating long-range interactions.
监管与系统基因组学的最新进展及挑战之展望
Bioinform Adv. 2025 May 9;5(1):vbaf106. doi: 10.1093/bioadv/vbaf106. eCollection 2025.
4
In vivo mapping of mutagenesis sensitivity of human enhancers.人类增强子诱变敏感性的体内图谱分析。
Nature. 2025 Jun 18. doi: 10.1038/s41586-025-09182-w.
5
Developing a general AI model for integrating diverse genomic modalities and comprehensive genomic knowledge.开发一个用于整合多种基因组模式和全面基因组知识的通用人工智能模型。
bioRxiv. 2025 May 14:2025.05.08.652986. doi: 10.1101/2025.05.08.652986.
6
Gauge fixing for sequence-function relationships.序列-功能关系的规范固定
PLoS Comput Biol. 2025 Mar 20;21(3):e1012818. doi: 10.1371/journal.pcbi.1012818. eCollection 2025.
7
A multi-modal transformer for cell type-agnostic regulatory predictions.一种用于细胞类型无关调节预测的多模态变压器。
Cell Genom. 2025 Feb 12;5(2):100762. doi: 10.1016/j.xgen.2025.100762. Epub 2025 Jan 29.
8
Uncertainty-aware genomic deep learning with knowledge distillation.基于知识蒸馏的不确定性感知基因组深度学习
bioRxiv. 2024 Nov 15:2024.11.13.623485. doi: 10.1101/2024.11.13.623485.
9
Genome wide clustering on integrated chromatin states and Micro-C contacts reveals chromatin interaction signatures.基于整合染色质状态和Micro-C接触的全基因组聚类揭示了染色质相互作用特征。
NAR Genom Bioinform. 2024 Oct 3;6(4):lqae136. doi: 10.1093/nargab/lqae136. eCollection 2024 Sep.
10
Context transcription factors establish cooperative environments and mediate enhancer communication.上下文转录因子建立合作环境并介导增强子通讯。
Nat Genet. 2024 Oct;56(10):2199-2212. doi: 10.1038/s41588-024-01892-7. Epub 2024 Oct 3.
通过整合长程相互作用,从序列中有效预测基因表达。
Nat Methods. 2021 Oct;18(10):1196-1203. doi: 10.1038/s41592-021-01252-x. Epub 2021 Oct 4.
4
HiC-DC+ enables systematic 3D interaction calls and differential analysis for Hi-C and HiChIP.HiC-DC+ 实现了 Hi-C 和 HiChIP 的系统三维交互调用和差异分析。
Nat Commun. 2021 Jun 7;12(1):3366. doi: 10.1038/s41467-021-23749-x.
5
Base-resolution models of transcription-factor binding reveal soft motif syntax.基于分辨率的转录因子结合模型揭示了软基序语法。
Nat Genet. 2021 Mar;53(3):354-366. doi: 10.1038/s41588-021-00782-6. Epub 2021 Feb 18.
6
Targeted Perturb-seq enables genome-scale genetic screens in single cells.靶向扰动测序可在单细胞中进行全基因组规模的遗传筛选。
Nat Methods. 2020 Jun;17(6):629-635. doi: 10.1038/s41592-020-0837-5. Epub 2020 Jun 1.
7
Predicting mRNA Abundance Directly from Genomic Sequence Using Deep Convolutional Neural Networks.利用深度卷积神经网络直接从基因组序列预测 mRNA 丰度。
Cell Rep. 2020 May 19;31(7):107663. doi: 10.1016/j.celrep.2020.107663.
8
Ultrastructural Details of Mammalian Chromosome Architecture.哺乳动物染色体结构的超微结构细节
Mol Cell. 2020 May 7;78(3):554-565.e7. doi: 10.1016/j.molcel.2020.03.003. Epub 2020 Mar 25.
9
Activity-by-contact model of enhancer-promoter regulation from thousands of CRISPR perturbations.基于数千个 CRISPR 干扰的增强子-启动子调控的活性-接触模型。
Nat Genet. 2019 Dec;51(12):1664-1669. doi: 10.1038/s41588-019-0538-0. Epub 2019 Nov 29.
10
Chromatin-informed inference of transcriptional programs in gynecologic and basal breast cancers.基于染色质信息推断妇科和基底型乳腺癌中的转录程序。
Nat Commun. 2019 Sep 25;10(1):4369. doi: 10.1038/s41467-019-12291-6.