• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用ProtBERT和支持向量机识别转录因子的DNA甲基化偏好性。

Identifying the DNA methylation preference of transcription factors using ProtBERT and SVM.

作者信息

Li Yanchao, Zou Quan, Dai Qi, Stalin Antony, Luo Ximei

机构信息

School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, China.

出版信息

PLoS Comput Biol. 2025 May 13;21(5):e1012513. doi: 10.1371/journal.pcbi.1012513. eCollection 2025 May.

DOI:10.1371/journal.pcbi.1012513
PMID:40359430
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12121914/
Abstract

Transcription factors (TFs) can affect gene expression by binding to certain specific DNA sequences. This binding process of TFs may be modulated by DNA methylation. A subset of TFs that serve as methylation readers preferentially binds to certain methylated DNA and is defined as TFPM. The identification of TFPMs enhances our understanding of DNA methylation's role in gene regulation. However, their experimental identification is resource-demanding. In this study, we propose a novel two-step computational approach to classify TFs and TFPMs. First, we employed a fine-tuned ProtBERT model to differentiate between the classes of TFs and non-TFs. Second, we combined the Reduced Amino Acid Category (RAAC) with K-mer and SVM to predict the potential of TFs to bind to methylated DNA. Comparative experiments demonstrate that our proposed methods outperform all existing approaches and emphasize the efficiency of our computational framework in classifying TFs and TFPMs. Cross-species validation on an independent mouse dataset further demonstrates the generalizability of our proposed framework In addition, we conducted predictions on all human transcription factors and found that most of the top 20 proteins belong to the Krueppel C2H2-type Zinc-finger family. So far, some studies have demonstrated a partial correlation between this family and DNA methylation and confirmed the preference of some of its members, thereby showing the robustness of our approach.

摘要

转录因子(TFs)可通过与某些特定的DNA序列结合来影响基因表达。TFs的这种结合过程可能会受到DNA甲基化的调控。作为甲基化读取器的一部分转录因子优先与某些甲基化DNA结合,并被定义为TFPM。TFPM的鉴定增强了我们对DNA甲基化在基因调控中作用的理解。然而,对它们进行实验鉴定需要大量资源。在本研究中,我们提出了一种新颖的两步计算方法来对TFs和TFPMs进行分类。首先,我们使用一个微调的ProtBERT模型来区分TFs和非TFs类别。其次,我们将简化氨基酸类别(RAAC)与K-mer和支持向量机相结合,以预测TFs与甲基化DNA结合的潜力。对比实验表明,我们提出的方法优于所有现有方法,并强调了我们的计算框架在对TFs和TFPMs进行分类方面的效率。在一个独立的小鼠数据集上进行的跨物种验证进一步证明了我们提出的框架的通用性。此外,我们对所有人类转录因子进行了预测,发现排名前20的蛋白质中大多数属于Krueppel C2H2型锌指家族。到目前为止,一些研究已经证明了这个家族与DNA甲基化之间存在部分关联,并证实了其一些成员的偏好,从而表明了我们方法的稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/4b6b492aae58/pcbi.1012513.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/612ae4d728ec/pcbi.1012513.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/cd5de9800cb4/pcbi.1012513.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/875fb5b2a684/pcbi.1012513.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/4b6b492aae58/pcbi.1012513.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/612ae4d728ec/pcbi.1012513.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/cd5de9800cb4/pcbi.1012513.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/875fb5b2a684/pcbi.1012513.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e3b/12121914/4b6b492aae58/pcbi.1012513.g004.jpg

相似文献

1
Identifying the DNA methylation preference of transcription factors using ProtBERT and SVM.使用ProtBERT和支持向量机识别转录因子的DNA甲基化偏好性。
PLoS Comput Biol. 2025 May 13;21(5):e1012513. doi: 10.1371/journal.pcbi.1012513. eCollection 2025 May.
2
TFProtBert: Detection of Transcription Factors Binding to Methylated DNA Using ProtBert Latent Space Representation.TFProtBert:利用ProtBert潜在空间表示法检测与甲基化DNA结合的转录因子
Int J Mol Sci. 2025 Apr 29;26(9):4234. doi: 10.3390/ijms26094234.
3
Detection of transcription factors binding to methylated DNA by deep recurrent neural network.通过深度递归神经网络检测与甲基化 DNA 结合的转录因子。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab533.
4
Identifying Transcription Factors That Prefer Binding to Methylated DNA Using Reduced -Gap Dipeptide Composition.利用减少间隙二肽组成鉴定偏好结合甲基化DNA的转录因子
ACS Omega. 2022 Aug 30;7(36):32322-32330. doi: 10.1021/acsomega.2c03696. eCollection 2022 Sep 13.
5
Toward a base-resolution panorama of the in vivo impact of cytosine methylation on transcription factor binding.朝向一个基于分辨率的全景图,展示体内胞嘧啶甲基化对转录因子结合的影响。
Genome Biol. 2022 Jul 7;23(1):151. doi: 10.1186/s13059-022-02713-y.
6
The combination approach of SVM and ECOC for powerful identification and classification of transcription factor.支持向量机(SVM)和纠错输出编码(ECOC)相结合的方法用于转录因子的高效识别和分类。
BMC Bioinformatics. 2008 Jun 16;9:282. doi: 10.1186/1471-2105-9-282.
7
Base-resolution methylation patterns accurately predict transcription factor bindings in vivo.碱基分辨率甲基化模式可在体内准确预测转录因子结合情况。
Nucleic Acids Res. 2015 Mar 11;43(5):2757-66. doi: 10.1093/nar/gkv151. Epub 2015 Feb 26.
8
Discovering human transcription factor physical interactions with genetic variants, novel DNA motifs, and repetitive elements using enhanced yeast one-hybrid assays.利用增强型酵母单杂交实验发现人类转录因子与遗传变异、新型 DNA 基序和重复元件的物理相互作用。
Genome Res. 2019 Sep;29(9):1533-1544. doi: 10.1101/gr.248823.119.
9
Quantitative modeling of transcription factor binding specificities using DNA shape.利用DNA形状对转录因子结合特异性进行定量建模。
Proc Natl Acad Sci U S A. 2015 Apr 14;112(15):4654-9. doi: 10.1073/pnas.1422023112. Epub 2015 Mar 9.
10
Predicting DNA-binding specificities of eukaryotic transcription factors.预测真核转录因子的 DNA 结合特异性。
PLoS One. 2010 Nov 30;5(11):e13876. doi: 10.1371/journal.pone.0013876.

本文引用的文献

1
Chromosome-scale and haplotype-resolved genome assembly of .……的染色体级别和单倍型解析基因组组装 。 你提供的原文不完整,请补充完整内容以便我能更准确地翻译。
Hortic Res. 2025 Jan 15;12(4):uhaf012. doi: 10.1093/hr/uhaf012. eCollection 2025 Apr.
2
postGWAS: A web server for deciphering the causality post the genome-wide association studies.postGWAS:一个用于在全基因组关联研究后解码因果关系的网络服务器。
Comput Biol Med. 2024 Mar;171:108108. doi: 10.1016/j.compbiomed.2024.108108. Epub 2024 Feb 5.
3
PeptideBERT: A Language Model Based on Transformers for Peptide Property Prediction.
PeptideBERT:一种基于 Transformer 的用于预测肽性质的语言模型。
J Phys Chem Lett. 2023 Nov 23;14(46):10427-10434. doi: 10.1021/acs.jpclett.3c02398. Epub 2023 Nov 13.
4
Investigation of the BERT model on nucleotide sequences with non-standard pre-training and evaluation of different k-mer embeddings.研究非标准预训练的核苷酸序列上的 BERT 模型,并评估不同的 k-mer 嵌入。
Bioinformatics. 2023 Oct 3;39(10). doi: 10.1093/bioinformatics/btad617.
5
PredDRBP-MLP: Prediction of DNA-binding proteins and RNA-binding proteins by multilayer perceptron.PredDRBP-MLP:通过多层感知器预测DNA结合蛋白和RNA结合蛋白
Comput Biol Med. 2023 Sep;164:107317. doi: 10.1016/j.compbiomed.2023.107317. Epub 2023 Aug 7.
6
Advances in Ultrahigh Throughput Hit Discovery with Tandem Mass Spectrometry Encoded Libraries.串联质谱编码文库在超高通量命中发现方面的进展。
J Am Chem Soc. 2023 Aug 30;145(34):19129-19139. doi: 10.1021/jacs.3c04899. Epub 2023 Aug 9.
7
AggBERT: Best in Class Prediction of Hexapeptide Amyloidogenesis with a Semi-Supervised ProtBERT Model.AggBERT:使用半监督 ProtBERT 模型进行六肽淀粉样变预测的最佳表现。
J Chem Inf Model. 2023 Sep 25;63(18):5727-5733. doi: 10.1021/acs.jcim.3c00817. Epub 2023 Aug 8.
8
Enhanced identification of membrane transport proteins: a hybrid approach combining ProtBERT-BFD and convolutional neural networks.增强膜转运蛋白的鉴定:结合 ProtBERT-BFD 和卷积神经网络的混合方法。
J Integr Bioinform. 2023 Jul 28;20(2). doi: 10.1515/jib-2022-0055. eCollection 2023 Jun 1.
9
Transcription factors interact with RNA to regulate genes.转录因子与 RNA 相互作用以调节基因。
Mol Cell. 2023 Jul 20;83(14):2449-2463.e13. doi: 10.1016/j.molcel.2023.06.012. Epub 2023 Jul 3.
10
Prediction of cooperative homeodomain DNA binding sites from high-throughput-SELEX data.高通量 SELEX 数据预测协同同源域 DNA 结合位点。
Nucleic Acids Res. 2023 Jul 7;51(12):6055-6072. doi: 10.1093/nar/gkad318.