• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过深度递归神经网络检测与甲基化 DNA 结合的转录因子。

Detection of transcription factors binding to methylated DNA by deep recurrent neural network.

机构信息

College of Information and Computer Engineering at Northeast Forestry University of China.

School of management at Henan Institute of Technology of China.

出版信息

Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab533.

DOI:10.1093/bib/bbab533
PMID:34962264
Abstract

Transcription factors (TFs) are proteins specifically involved in gene expression regulation. It is generally accepted in epigenetics that methylated nucleotides could prevent the TFs from binding to DNA fragments. However, recent studies have confirmed that some TFs have capability to interact with methylated DNA fragments to further regulate gene expression. Although biochemical experiments could recognize TFs binding to methylated DNA sequences, these wet experimental methods are time-consuming and expensive. Machine learning methods provide a good choice for quickly identifying these TFs without experimental materials. Thus, this study aims to design a robust predictor to detect methylated DNA-bound TFs. We firstly proposed using tripeptide word vector feature to formulate protein samples. Subsequently, based on recurrent neural network with long short-term memory, a two-step computational model was designed. The first step predictor was utilized to discriminate transcription factors from non-transcription factors. Once proteins were predicted as TFs, the second step predictor was employed to judge whether the TFs can bind to methylated DNA. Through the independent dataset test, the accuracies of the first step and the second step are 86.63% and 73.59%, respectively. In addition, the statistical analysis of the distribution of tripeptides in training samples showed that the position and number of some tripeptides in the sequence could affect the binding of TFs to methylated DNA. Finally, on the basis of our model, a free web server was established based on the proposed model, which can be available at https://bioinfor.nefu.edu.cn/TFPM/.

摘要

转录因子(TFs)是专门参与基因表达调控的蛋白质。在表观遗传学中,人们普遍认为甲基化核苷酸可以阻止 TFs 与 DNA 片段结合。然而,最近的研究证实,一些 TFs 能够与甲基化的 DNA 片段相互作用,从而进一步调节基因表达。尽管生化实验可以识别与甲基化 DNA 序列结合的 TFs,但这些湿实验方法既耗时又昂贵。机器学习方法为在没有实验材料的情况下快速识别这些 TFs 提供了一个很好的选择。因此,本研究旨在设计一个强大的预测器来检测与甲基化 DNA 结合的 TFs。我们首先提出使用三肽字向量特征来构建蛋白质样本。随后,基于具有长短期记忆的递归神经网络,设计了一个两步计算模型。第一步预测器用于区分转录因子和非转录因子。一旦蛋白质被预测为 TFs,第二步预测器就用于判断 TFs 是否可以与甲基化 DNA 结合。通过独立数据集测试,第一步和第二步的准确率分别为 86.63%和 73.59%。此外,对训练样本中三肽分布的统计分析表明,序列中某些三肽的位置和数量可能会影响 TFs 与甲基化 DNA 的结合。最后,在我们的模型基础上,建立了一个基于所提出模型的免费网络服务器,可在 https://bioinfor.nefu.edu.cn/TFPM/ 上访问。

相似文献

1
Detection of transcription factors binding to methylated DNA by deep recurrent neural network.通过深度递归神经网络检测与甲基化 DNA 结合的转录因子。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab533.
2
Identifying Transcription Factors That Prefer Binding to Methylated DNA Using Reduced -Gap Dipeptide Composition.利用减少间隙二肽组成鉴定偏好结合甲基化DNA的转录因子
ACS Omega. 2022 Aug 30;7(36):32322-32330. doi: 10.1021/acsomega.2c03696. eCollection 2022 Sep 13.
3
DeepTFactor: A deep learning-based tool for the prediction of transcription factors.DeepTFactor:一种基于深度学习的转录因子预测工具。
Proc Natl Acad Sci U S A. 2021 Jan 12;118(2). doi: 10.1073/pnas.2021171118.
4
Predicting Preference of Transcription Factors for Methylated DNA Using Sequence Information.利用序列信息预测转录因子对甲基化DNA的偏好性
Mol Ther Nucleic Acids. 2020 Jul 31;22:1043-1050. doi: 10.1016/j.omtn.2020.07.035. eCollection 2020 Dec 4.
5
Identifying the "stripe" transcription factors and cooperative binding related to DNA methylation.鉴定与 DNA 甲基化相关的“条纹”转录因子和协同结合。
Commun Biol. 2024 Oct 5;7(1):1265. doi: 10.1038/s42003-024-06992-y.
6
Toward a base-resolution panorama of the in vivo impact of cytosine methylation on transcription factor binding.朝向一个基于分辨率的全景图,展示体内胞嘧啶甲基化对转录因子结合的影响。
Genome Biol. 2022 Jul 7;23(1):151. doi: 10.1186/s13059-022-02713-y.
7
A survey on protein-DNA-binding sites in computational biology.计算生物学中蛋白质-DNA 结合位点研究综述。
Brief Funct Genomics. 2022 Sep 16;21(5):357-375. doi: 10.1093/bfgp/elac009.
8
MeDReaders: a database for transcription factors that bind to methylated DNA.MeDReaders:一个与甲基化 DNA 结合的转录因子数据库。
Nucleic Acids Res. 2018 Jan 4;46(D1):D146-D151. doi: 10.1093/nar/gkx1096.
9
Enhancing the interpretability of transcription factor binding site prediction using attention mechanism.利用注意力机制提高转录因子结合位点预测的可解释性。
Sci Rep. 2020 Aug 7;10(1):13413. doi: 10.1038/s41598-020-70218-4.
10
High-resolution transcription factor binding sites prediction improved performance and interpretability by deep learning method.深度学习方法提高了高分辨率转录因子结合位点预测的性能和可解释性。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab273.

引用本文的文献

1
TFProtBert: Detection of Transcription Factors Binding to Methylated DNA Using ProtBert Latent Space Representation.TFProtBert:利用ProtBert潜在空间表示法检测与甲基化DNA结合的转录因子
Int J Mol Sci. 2025 Apr 29;26(9):4234. doi: 10.3390/ijms26094234.
2
Identifying the DNA methylation preference of transcription factors using ProtBERT and SVM.使用ProtBERT和支持向量机识别转录因子的DNA甲基化偏好性。
PLoS Comput Biol. 2025 May 13;21(5):e1012513. doi: 10.1371/journal.pcbi.1012513. eCollection 2025 May.
3
A First Computational Frame for Recognizing Heparin-Binding Protein.
一种用于识别肝素结合蛋白的首个计算框架。
Diagnostics (Basel). 2023 Jul 24;13(14):2465. doi: 10.3390/diagnostics13142465.
4
Recall DNA methylation levels at low coverage sites using a CNN model in WGBS.使用 CNN 模型在 WGBS 中召回低覆盖位点的 DNA 甲基化水平。
PLoS Comput Biol. 2023 Jun 14;19(6):e1011205. doi: 10.1371/journal.pcbi.1011205. eCollection 2023 Jun.
5
Computational prediction of promotors in strain C58 by using the machine learning technique.利用机器学习技术对C58菌株中的启动子进行计算预测。
Front Microbiol. 2023 Apr 13;14:1170785. doi: 10.3389/fmicb.2023.1170785. eCollection 2023.
6
Predicting promoters based on novel feature descriptor and feature selection technique.基于新型特征描述符和特征选择技术预测启动子。
Front Microbiol. 2023 Mar 2;14:1141227. doi: 10.3389/fmicb.2023.1141227. eCollection 2023.
7
A capsule network-based method for identifying transcription factors.一种基于胶囊网络的转录因子识别方法。
Front Microbiol. 2022 Dec 6;13:1048478. doi: 10.3389/fmicb.2022.1048478. eCollection 2022.
8
SCMRSA: a New Approach for Identifying and Analyzing Anti-MRSA Peptides Using Estimated Propensity Scores of Dipeptides.SCMRSA:一种利用二肽估计倾向得分鉴定和分析抗耐甲氧西林金黄色葡萄球菌肽的新方法。
ACS Omega. 2022 Sep 1;7(36):32653-32664. doi: 10.1021/acsomega.2c04305. eCollection 2022 Sep 13.
9
Identifying Transcription Factors That Prefer Binding to Methylated DNA Using Reduced -Gap Dipeptide Composition.利用减少间隙二肽组成鉴定偏好结合甲基化DNA的转录因子
ACS Omega. 2022 Aug 30;7(36):32322-32330. doi: 10.1021/acsomega.2c03696. eCollection 2022 Sep 13.
10
Assessing comparative importance of DNA sequence and epigenetic modifications on gene expression using a deep convolutional neural network.使用深度卷积神经网络评估DNA序列和表观遗传修饰对基因表达的相对重要性。
Comput Struct Biotechnol J. 2022 Jul 13;20:3814-3823. doi: 10.1016/j.csbj.2022.07.014. eCollection 2022.