• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MulTFBS:一种具有多通道的时空网络,用于预测转录因子结合位点。

MulTFBS: A Spatial-Temporal Network with Multichannels for Predicting Transcription Factor Binding Sites.

机构信息

The School of Science, Dalian Maritime University, Dalian 116026, China.

出版信息

J Chem Inf Model. 2024 May 27;64(10):4322-4333. doi: 10.1021/acs.jcim.3c02088. Epub 2024 May 11.

DOI:10.1021/acs.jcim.3c02088
PMID:38733561
Abstract

Revealing the mechanisms that influence transcription factor binding specificity is the key to understanding gene regulation. In previous studies, DNA double helix structure and one-hot embedding have been used successfully to design computational methods for predicting transcription factor binding sites (TFBSs). However, DNA sequence as a kind of biological language, the method of word embedding representation in natural language processing, has not been considered properly in TFBS prediction models. In our work, we integrate different types of features of DNA sequence to design a multichanneled deep learning framework, namely MulTFBS, in which independent one-hot encoding, word embedding encoding, which can incorporate contextual information and extract the global features of the sequences, and double helix three-dimensional structural features have been trained in different channels. To extract sequence high-level information effectively, in our deep learning framework, we select the spatial-temporal network by combining convolutional neural networks and bidirectional long short-term memory networks with attention mechanism. Compared with six state-of-the-art methods on 66 universal protein-binding microarray data sets of different transcription factors, MulTFBS performs best on all data sets in the regression tasks, with the average of 0.698 and the average PCC of 0.833, which are 5.4% and 3.2% higher, respectively, than the suboptimal method CRPTS. In addition, we evaluate the classification performance of MulTFBS for distinguishing bound or unbound regions on TF ChIP-seq data. The results show that our framework also performs well in the TFBS classification tasks.

摘要

揭示影响转录因子结合特异性的机制是理解基因调控的关键。在以前的研究中,已经成功地使用 DNA 双螺旋结构和独热嵌入来设计用于预测转录因子结合位点 (TFBS) 的计算方法。然而,DNA 序列作为一种生物语言,在 TFBS 预测模型中尚未得到适当考虑自然语言处理中的词嵌入表示方法。在我们的工作中,我们整合了 DNA 序列的不同类型特征来设计一个多通道深度学习框架,即 MulTFBS,其中独立的独热编码、可以合并上下文信息并提取序列全局特征的词嵌入编码以及双螺旋三维结构特征已在不同通道中进行了训练。为了有效地提取序列的高级信息,在我们的深度学习框架中,我们通过结合卷积神经网络和具有注意力机制的双向长短期记忆网络来选择时空网络。在 66 个不同转录因子的通用蛋白质结合微阵列数据集上,与六种最先进的方法进行比较,MulTFBS 在所有回归任务数据集中的表现都优于其他方法,平均 为 0.698,平均 PCC 为 0.833,分别比次优方法 CRPTS 高 5.4%和 3.2%。此外,我们评估了 MulTFBS 在区分 TF ChIP-seq 数据上的结合或未结合区域的分类性能。结果表明,我们的框架在 TFBS 分类任务中也表现良好。

相似文献

1
MulTFBS: A Spatial-Temporal Network with Multichannels for Predicting Transcription Factor Binding Sites.MulTFBS:一种具有多通道的时空网络,用于预测转录因子结合位点。
J Chem Inf Model. 2024 May 27;64(10):4322-4333. doi: 10.1021/acs.jcim.3c02088. Epub 2024 May 11.
2
BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning.BERT-TFBS:一种基于迁移学习的用于预测转录因子结合位点的新型基于BERT的模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae195.
3
DeepD2V: A Novel Deep Learning-Based Framework for Predicting Transcription Factor Binding Sites from Combined DNA Sequence.DeepD2V:一种基于深度学习的新型框架,用于从组合 DNA 序列预测转录因子结合位点。
Int J Mol Sci. 2021 May 24;22(11):5521. doi: 10.3390/ijms22115521.
4
CacPred: a cascaded convolutional neural network for TF-DNA binding prediction.CacPred:用于转录因子-脱氧核糖核酸结合预测的级联卷积神经网络
BMC Genomics. 2025 Mar 18;26(Suppl 2):264. doi: 10.1186/s12864-025-11399-y.
5
MLSNet: a deep learning model for predicting transcription factor binding sites.MLSNet:一种用于预测转录因子结合位点的深度学习模型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae489.
6
DeepSTF: predicting transcription factor binding sites by interpretable deep neural networks combining sequence and shape.DeepSTF:通过结合序列和形状的可解释深度神经网络预测转录因子结合位点。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad231.
7
A novel convolution attention model for predicting transcription factor binding sites by combination of sequence and shape.一种新型卷积注意力模型,通过序列和形状的结合来预测转录因子结合位点。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab525.
8
A KAN-based hybrid deep neural networks for accurate identification of transcription factor binding sites.一种基于KAN的混合深度神经网络,用于准确识别转录因子结合位点。
PLoS One. 2025 May 7;20(5):e0322978. doi: 10.1371/journal.pone.0322978. eCollection 2025.
9
Deciphering the Language of Protein-DNA Interactions: A Deep Learning Approach Combining Contextual Embeddings and Multi-Scale Sequence Modeling.解析蛋白质- DNA 相互作用的语言:结合上下文嵌入和多尺度序列建模的深度学习方法。
J Mol Biol. 2024 Nov 15;436(22):168769. doi: 10.1016/j.jmb.2024.168769. Epub 2024 Aug 29.
10
An Integrative Framework for Combining Sequence and Epigenomic Data to Predict Transcription Factor Binding Sites Using Deep Learning.一种用于结合序列和表观基因组数据以利用深度学习预测转录因子结合位点的整合框架。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):355-364. doi: 10.1109/TCBB.2019.2901789. Epub 2021 Feb 3.

引用本文的文献

1
Identifying transcription factors with cell-type specific DNA binding signatures.鉴定具有细胞类型特异性 DNA 结合特征的转录因子。
BMC Genomics. 2024 Oct 14;25(1):957. doi: 10.1186/s12864-024-10859-1.