• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

深度TFBS:利用深度多任务和迁移学习改进转录因子结合的种内和跨物种预测。

deepTFBS: Improving within- and Cross-Species Prediction of Transcription Factor Binding Using Deep Multi-Task and Transfer Learning.

作者信息

Zhai Jingjing, Zhang Yuzhou, Zhang Chujun, Yin Xiaotong, Song Minggui, Tang Chenglong, Ding Pengjun, Li Zenglin, Ma Chuang

机构信息

State Key Laboratory for Crop Stress Resistance and High-Efficiency Production, Center of Bioinformatics, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi, 712100, China.

Key Laboratory of Biology and Genetics Improvement of Maize in Arid Area of Northwest Region, Ministry of Agriculture, Northwest A&F University, Yangling, Shaanxi, 712100, China.

出版信息

Adv Sci (Weinh). 2025 Aug;12(30):e03135. doi: 10.1002/advs.202503135. Epub 2025 May 24.

DOI:10.1002/advs.202503135
PMID:40411397
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12376555/
Abstract

The precise prediction of transcription factor binding sites (TFBSs) is crucial in understanding gene regulation. In this study, deepTFBS, a comprehensive deep learning (DL) framework that builds a robust DNA language model of TF binding grammar for accurately predicting TFBSs within and across plant species is presented. Taking advantages of multi-task DL and transfer learning, deepTFBS is capable of leveraging the knowledge learned from large-scale TF binding profiles to enhance the prediction of TFBSs under small-sample training and cross-species prediction tasks. When tested using available information on 359 Arabidopsis TFs, deepTFBS outperformed previously described prediction strategies, including position weight matrix, deepSEA and DanQ, with a 244.49%, 49.15%, and 23.32% improvement of the area under the precision-recall curve (PRAUC), respectively. Further cross-species prediction of TFBS in wheat showed that deepTFBS yielded a significant PRAUC improvement of 30.6% over these three baseline models. deepTFBS can also utilize information from gene conservation and binding motifs, enabling efficient TFBS prediction in species where experimental data availability is limited. A case study, focusing on the WUSCHEL (WUS) transcription factor, illustrated the potential use of deepTFBS in cross-species applications, in our example between Arabidopsis and wheat. deepTFBS is publically available at https://github.com/cma2015/deepTFBS.

摘要

转录因子结合位点(TFBSs)的精确预测对于理解基因调控至关重要。在本研究中,我们提出了deepTFBS,这是一个全面的深度学习(DL)框架,它构建了一个强大的TF结合语法DNA语言模型,用于准确预测植物物种内和跨物种的TFBSs。利用多任务深度学习和迁移学习,deepTFBS能够利用从大规模TF结合谱中学到的知识,在小样本训练和跨物种预测任务中增强TFBSs的预测。当使用359个拟南芥TF的可用信息进行测试时,deepTFBS优于先前描述的预测策略,包括位置权重矩阵、deepSEA和DanQ,精确召回率曲线下面积(PRAUC)分别提高了244.49%、49.15%和23.32%。对小麦中TFBS的进一步跨物种预测表明,deepTFBS比这三个基线模型的PRAUC有显著提高,提高了30.6%。deepTFBS还可以利用基因保守性和结合基序的信息,在实验数据有限的物种中实现高效的TFBS预测。一个以WUSCHEL(WUS)转录因子为重点的案例研究,说明了deepTFBS在跨物种应用中的潜在用途,在我们的例子中是拟南芥和小麦之间。deepTFBS可在https://github.com/cma2015/deepTFBS上公开获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/53eab13b5526/ADVS-12-e03135-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/846b0685b0cc/ADVS-12-e03135-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/97b62eea9234/ADVS-12-e03135-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/d6a4db8ca224/ADVS-12-e03135-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/302801f0d21f/ADVS-12-e03135-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/3e209eec1528/ADVS-12-e03135-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/53eab13b5526/ADVS-12-e03135-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/846b0685b0cc/ADVS-12-e03135-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/97b62eea9234/ADVS-12-e03135-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/d6a4db8ca224/ADVS-12-e03135-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/302801f0d21f/ADVS-12-e03135-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/3e209eec1528/ADVS-12-e03135-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f846/12376555/53eab13b5526/ADVS-12-e03135-g007.jpg

相似文献

1
deepTFBS: Improving within- and Cross-Species Prediction of Transcription Factor Binding Using Deep Multi-Task and Transfer Learning.深度TFBS:利用深度多任务和迁移学习改进转录因子结合的种内和跨物种预测。
Adv Sci (Weinh). 2025 Aug;12(30):e03135. doi: 10.1002/advs.202503135. Epub 2025 May 24.
2
BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning.BERT-TFBS:一种基于迁移学习的用于预测转录因子结合位点的新型基于BERT的模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae195.
3
TFBSFootprinter: a multiomics tool for prediction of transcription factor binding sites in vertebrate species.TFBSFootprinter:一种用于预测脊椎动物物种中转录因子结合位点的多组学工具。
Transcription. 2025 Apr-Jun;16(2-3):204-223. doi: 10.1080/21541264.2025.2521764. Epub 2025 Jul 11.
4
Benchmarking transcription factor binding site prediction models: a comparative analysis on synthetic and biological data.基准测试转录因子结合位点预测模型:对合成数据和生物数据的比较分析
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf363.
5
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
6
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
7
CGGBP1 from higher amniotes restricts cytosine methylation and drives a GC-bias in transcription factor-binding sites at repressed promoters.高等羊膜动物的CGGBP1可限制胞嘧啶甲基化,并在抑制性启动子的转录因子结合位点上驱动GC偏好。
Transcription. 2025 Jul 31:1-36. doi: 10.1080/21541264.2025.2533598.
8
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Short-Term Memory Impairment短期记忆障碍

本文引用的文献

1
PDLLMs: A group of tailored DNA large language models for analyzing plant genomes.PDLLMs:一组用于分析植物基因组的定制化DNA大语言模型。
Mol Plant. 2025 Feb 3;18(2):175-178. doi: 10.1016/j.molp.2024.12.006. Epub 2024 Dec 9.
2
MLSNet: a deep learning model for predicting transcription factor binding sites.MLSNet:一种用于预测转录因子结合位点的深度学习模型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae489.
3
A foundational large language model for edible plant genomes.食用植物基因组的基础大语言模型。
Commun Biol. 2024 Jul 9;7(1):835. doi: 10.1038/s42003-024-06465-2.
4
BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning.BERT-TFBS:一种基于迁移学习的用于预测转录因子结合位点的新型基于BERT的模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae195.
5
Inflammation-Targeted Nanomedicines Alleviate Oxidative Stress and Reprogram Macrophages Polarization for Myocardial Infarction Treatment.炎症靶向纳米药物减轻氧化应激并重塑巨噬细胞极化以治疗心肌梗死。
Adv Sci (Weinh). 2024 Jun;11(21):e2308910. doi: 10.1002/advs.202308910. Epub 2024 Apr 6.
6
Class imbalance should not throw you off balance: Choosing the right classifiers and performance metrics for brain decoding with imbalanced data.不要被类别不平衡问题困扰:选择合适的分类器和性能指标,对不平衡数据进行脑解码。
Neuroimage. 2023 Aug 15;277:120253. doi: 10.1016/j.neuroimage.2023.120253. Epub 2023 Jun 28.
7
Transcription factor binding sites are frequently under accelerated evolution in primates.转录因子结合位点在灵长类动物中经常处于加速进化状态。
Nat Commun. 2023 Feb 11;14(1):783. doi: 10.1038/s41467-023-36421-3.
8
Evolutionary dynamics and adaptive benefits of deleterious mutations in crop gene pools.作物基因库中有害突变的进化动态和适应优势。
Trends Plant Sci. 2023 Jun;28(6):685-697. doi: 10.1016/j.tplants.2023.01.006. Epub 2023 Feb 8.
9
Transposable elements orchestrate subgenome-convergent and -divergent transcription in common wheat.转座元件在普通小麦中调控亚基因组趋同和趋异转录。
Nat Commun. 2022 Nov 14;13(1):6940. doi: 10.1038/s41467-022-34290-w.
10
PlantBind: an attention-based multi-label neural network for predicting plant transcription factor binding sites.植物结合域预测:基于注意力的多标签神经网络方法
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac425.