• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

可解释深度学习揭示的近端遗传元件对 DNA-蛋白质相互作用的调控

Modulation of DNA-protein Interactions by Proximal Genetic Elements as Uncovered by Interpretable Deep Learning.

机构信息

Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India.

Department of Biochemical Engineering & Biotechnology, Indian Institute of Technology (IIT) Delhi, New Delhi 110016, India; Regional Centre for Biotechnology (RCB), Faridabad, Haryana 121001, India.

出版信息

J Mol Biol. 2023 Jul 1;435(13):168121. doi: 10.1016/j.jmb.2023.168121. Epub 2023 Apr 24.

DOI:10.1016/j.jmb.2023.168121
PMID:37100167
Abstract

Transcription factors (TF) recognize specific motifs in the genome that are typically 6-12 bp long to regulate various aspects of the cellular machinery. Presence of binding motifs and favorable genome accessibility are key drivers for a consistent TF-DNA interaction. Although these pre-requisites may occur thousands of times in the genome, there seems to be a high degree of selectivity for the sites that are actually bound. Here, we present a deep-learning framework that identifies and characterizes the upstream and downstream genetic elements to the binding motif, for their role in enforcing the mentioned selectivity. The proposed framework is based on an interpretable recurrent neural network architecture that enables for the relative analysis of sequence context features. We apply the framework to model twenty-six transcription factors and score the TF-DNA binding at a base-pair resolution. We find significant differences in activations of DNA context features for bound and unbound sequences. In addition to standardized evaluation protocols, we offer outstanding interpretability that enables us to identify and annotate DNA sequence with possible elements that modulate TF-DNA binding. Also, differences in data processing have a huge influence on the overall model performance. Overall, the proposed framework allows for novel insights on the non-coding genetic elements and their role in facilitating a stable TF-DNA interaction.

摘要

转录因子 (TF) 识别基因组中特定的基序,这些基序通常长 6-12 个碱基对,用于调节细胞机制的各个方面。结合基序的存在和有利的基因组可及性是 TF-DNA 相互作用一致性的关键驱动因素。尽管这些前提条件在基因组中可能发生数千次,但实际上结合的位点似乎具有高度的选择性。在这里,我们提出了一个深度学习框架,用于识别和描述结合基序上游和下游的遗传元件,以研究它们在增强上述选择性方面的作用。所提出的框架基于可解释的递归神经网络架构,能够对序列上下文特征进行相对分析。我们将该框架应用于模拟二十六种转录因子,并以碱基对分辨率对 TF-DNA 结合进行评分。我们发现结合和未结合序列的 DNA 上下文特征的激活存在显著差异。除了标准化的评估协议外,我们还提供了出色的可解释性,使我们能够识别和注释可能调节 TF-DNA 结合的 DNA 序列元素。此外,数据处理的差异对整体模型性能有巨大影响。总的来说,该框架允许对非编码遗传元件及其在促进稳定 TF-DNA 相互作用中的作用有新的认识。

相似文献

1
Modulation of DNA-protein Interactions by Proximal Genetic Elements as Uncovered by Interpretable Deep Learning.可解释深度学习揭示的近端遗传元件对 DNA-蛋白质相互作用的调控
J Mol Biol. 2023 Jul 1;435(13):168121. doi: 10.1016/j.jmb.2023.168121. Epub 2023 Apr 24.
2
Deep neural networks identify sequence context features predictive of transcription factor binding.深度神经网络可识别预测转录因子结合的序列上下文特征。
Nat Mach Intell. 2021 Feb;3(2):172-180. doi: 10.1038/s42256-020-00282-y. Epub 2021 Jan 18.
3
Enhancing the interpretability of transcription factor binding site prediction using attention mechanism.利用注意力机制提高转录因子结合位点预测的可解释性。
Sci Rep. 2020 Aug 7;10(1):13413. doi: 10.1038/s41598-020-70218-4.
4
High-resolution transcription factor binding sites prediction improved performance and interpretability by deep learning method.深度学习方法提高了高分辨率转录因子结合位点预测的性能和可解释性。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab273.
5
Base-resolution prediction of transcription factor binding signals by a deep learning framework.基于深度学习框架的转录因子结合信号的碱基分辨率预测。
PLoS Comput Biol. 2022 Mar 9;18(3):e1009941. doi: 10.1371/journal.pcbi.1009941. eCollection 2022 Mar.
6
Discovering epistatic feature interactions from neural network models of regulatory DNA sequences.从调控 DNA 序列的神经网络模型中发现上位特征相互作用。
Bioinformatics. 2018 Sep 1;34(17):i629-i637. doi: 10.1093/bioinformatics/bty575.
7
An interpretable bimodal neural network characterizes the sequence and preexisting chromatin predictors of induced transcription factor binding.可解释的双模神经网络刻画了诱导转录因子结合的序列和预先存在的染色质预测因子。
Genome Biol. 2021 Jan 7;22(1):20. doi: 10.1186/s13059-020-02218-6.
8
DeepD2V: A Novel Deep Learning-Based Framework for Predicting Transcription Factor Binding Sites from Combined DNA Sequence.DeepD2V:一种基于深度学习的新型框架,用于从组合 DNA 序列预测转录因子结合位点。
Int J Mol Sci. 2021 May 24;22(11):5521. doi: 10.3390/ijms22115521.
9
DeepCAGE: Incorporating Transcription Factors in Genome-wide Prediction of Chromatin Accessibility.DeepCAGE:在全基因组预测染色质可及性中纳入转录因子。
Genomics Proteomics Bioinformatics. 2022 Jun;20(3):496-507. doi: 10.1016/j.gpb.2021.08.015. Epub 2022 Mar 12.
10
A novel method to identify the DNA motifs recognized by a defined transcription factor.一种鉴定特定转录因子识别的 DNA 基序的新方法。
Plant Mol Biol. 2014 Nov;86(4-5):367-80. doi: 10.1007/s11103-014-0234-5. Epub 2014 Aug 10.

引用本文的文献

1
Transfer Learning in Cancer Genetics, Mutation Detection, Gene Expression Analysis, and Syndrome Recognition.癌症遗传学、突变检测、基因表达分析和综合征识别中的迁移学习
Cancers (Basel). 2024 Jun 4;16(11):2138. doi: 10.3390/cancers16112138.