• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于新型可变窗口 Z 曲线方法的原核启动子识别。

Recognition of prokaryotic promoters based on a novel variable-window Z-curve method.

机构信息

School of Chemical Engineering and Technology, Tianjin University, Tianjin, 300072, China.

出版信息

Nucleic Acids Res. 2012 Feb;40(3):963-71. doi: 10.1093/nar/gkr795. Epub 2011 Sep 27.

DOI:10.1093/nar/gkr795
PMID:21954440
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3273801/
Abstract

Transcription is the first step in gene expression, and it is the step at which most of the regulation of expression occurs. Although sequenced prokaryotic genomes provide a wealth of information, transcriptional regulatory networks are still poorly understood using the available genomic information, largely because accurate prediction of promoters is difficult. To improve promoter recognition performance, a novel variable-window Z-curve method is developed to extract general features of prokaryotic promoters. The features are used for further classification by the partial least squares technique. To verify the prediction performance, the proposed method is applied to predict promoter fragments of two representative prokaryotic model organisms (Escherichia coli and Bacillus subtilis). Depending on the feature extraction and selection power of the proposed method, the promoter prediction accuracies are improved markedly over most existing approaches: for E. coli, the accuracies are 96.05% (σ(70) promoters, coding negative samples), 90.44% (σ(70) promoters, non-coding negative samples), 92.13% (known sigma-factor promoters, coding negative samples), 92.50% (known sigma-factor promoters, non-coding negative samples), respectively; for B. subtilis, the accuracies are 95.83% (known sigma-factor promoters, coding negative samples) and 99.09% (known sigma-factor promoters, non-coding negative samples). Additionally, being a linear technique, the computational simplicity of the proposed method makes it easy to run in a matter of minutes on ordinary personal computers or even laptops. More importantly, there is no need to optimize parameters, so it is very practical for predicting other species promoters without any prior knowledge or prior information of the statistical properties of the samples.

摘要

转录是基因表达的第一步,也是表达调控发生的主要步骤。尽管已测序的原核生物基因组提供了丰富的信息,但利用现有的基因组信息,转录调控网络仍然知之甚少,这主要是因为准确预测启动子较为困难。为了提高启动子识别性能,提出了一种新的可变窗口 Z 曲线方法来提取原核生物启动子的一般特征。使用偏最小二乘法技术对这些特征进行进一步分类。为了验证预测性能,将该方法应用于两个有代表性的原核生物模型(大肠杆菌和枯草芽孢杆菌)的启动子片段预测。根据所提出方法的特征提取和选择能力,与大多数现有方法相比,预测性能有了显著提高:对于大肠杆菌,预测精度分别为 96.05%(σ(70)启动子,编码负样本)、90.44%(σ(70)启动子,非编码负样本)、92.13%(已知 σ 因子启动子,编码负样本)、92.50%(已知 σ 因子启动子,非编码负样本);对于枯草芽孢杆菌,预测精度分别为 95.83%(已知 σ 因子启动子,编码负样本)和 99.09%(已知 σ 因子启动子,非编码负样本)。此外,作为一种线性技术,所提出方法的计算简单性使得它可以在普通个人计算机甚至笔记本电脑上在几分钟内运行。更重要的是,不需要优化参数,因此对于没有样本统计特性的先验知识或先验信息的其他物种启动子的预测非常实用。

相似文献

1
Recognition of prokaryotic promoters based on a novel variable-window Z-curve method.基于新型可变窗口 Z 曲线方法的原核启动子识别。
Nucleic Acids Res. 2012 Feb;40(3):963-71. doi: 10.1093/nar/gkr795. Epub 2011 Sep 27.
2
Eukaryotic and prokaryotic promoter prediction using hybrid approach.使用混合方法进行真核和原核启动子预测。
Theory Biosci. 2011 Jun;130(2):91-100. doi: 10.1007/s12064-010-0114-8. Epub 2010 Nov 3.
3
A novel method for prokaryotic promoter prediction based on DNA stability.一种基于DNA稳定性的原核生物启动子预测新方法。
BMC Bioinformatics. 2005 Jan 5;6:1. doi: 10.1186/1471-2105-6-1.
4
Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks.使用卷积深度学习神经网络识别原核生物和真核生物启动子。
PLoS One. 2017 Feb 3;12(2):e0171410. doi: 10.1371/journal.pone.0171410. eCollection 2017.
5
Promoter addresses: revelations from oligonucleotide profiling applied to the Escherichia coli genome.启动子地址:应用于大肠杆菌基因组的寡核苷酸谱分析所揭示的信息
Theor Biol Med Model. 2005 May 31;2:20. doi: 10.1186/1742-4682-2-20.
6
Promoter prediction and annotation of microbial genomes based on DNA sequence and structural responses to superhelical stress.基于DNA序列和对超螺旋应力的结构响应的微生物基因组启动子预测与注释
BMC Bioinformatics. 2006 May 5;7:248. doi: 10.1186/1471-2105-7-248.
7
Predicting essential genes in prokaryotic genomes using a linear method: ZUPLS.使用线性方法ZUPLS预测原核生物基因组中的必需基因。
Integr Biol (Camb). 2014 Apr;6(4):460-9. doi: 10.1039/c3ib40241j. Epub 2014 Mar 7.
8
Predicting strength and function for promoters of the Escherichia coli alternative sigma factor, sigmaE.预测大肠杆菌替代σ因子 sigmaE 的启动子的强度和功能。
Proc Natl Acad Sci U S A. 2010 Feb 16;107(7):2854-9. doi: 10.1073/pnas.0915066107. Epub 2010 Feb 1.
9
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.一种用于原核生物基因组中顺式调控基序识别的综合且适用的系统发育足迹分析框架。
BMC Genomics. 2016 Aug 9;17:578. doi: 10.1186/s12864-016-2982-x.
10
The cross-species prediction of bacterial promoters using a support vector machine.使用支持向量机对细菌启动子进行跨物种预测。
Comput Biol Chem. 2008 Oct;32(5):359-66. doi: 10.1016/j.compbiolchem.2008.07.009. Epub 2008 Jul 15.

引用本文的文献

1
CAPE: a deep learning framework with Chaos-Attention net for Promoter Evolution.CAPE:用于启动子进化的具有混沌注意力网络的深度学习框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae398.
2
Predicting Promoters in Multiple Prokaryotes with Prompt.利用 Prompt 预测多种原核生物的启动子。
Interdiscip Sci. 2024 Dec;16(4):814-828. doi: 10.1007/s12539-024-00637-8. Epub 2024 Aug 7.
3
iPro2L-DG: Hybrid network based on improved densenet and global attention mechanism for identifying promoter sequences.iPro2L-DG:基于改进型密集连接网络和全局注意力机制的混合网络用于识别启动子序列。
Heliyon. 2024 Mar 6;10(6):e27364. doi: 10.1016/j.heliyon.2024.e27364. eCollection 2024 Mar 30.
4
Computational identification of promoters in by using support vector machine.利用支持向量机对[具体对象]中的启动子进行计算识别。 (原文中“in by using”表述不完整,推测应该是“in [具体对象] by using” ,这里按照推测后的完整意思翻译)
Front Microbiol. 2023 May 5;14:1200678. doi: 10.3389/fmicb.2023.1200678. eCollection 2023.
5
iProm-Sigma54: A CNN Base Prediction Tool for Promoters.iProm-Sigma54:一种基于 CNN 的启动子预测工具。
Cells. 2023 Mar 7;12(6):829. doi: 10.3390/cells12060829.
6
Sigma70Pred: A highly accurate method for predicting sigma70 promoter in K-12 strains.Sigma70Pred:一种预测K-12菌株中sigma70启动子的高精度方法。
Front Microbiol. 2022 Nov 14;13:1042127. doi: 10.3389/fmicb.2022.1042127. eCollection 2022.
7
iPro-WAEL: a comprehensive and robust framework for identifying promoters in multiple species.iPro-WAEL:一种全面而强大的多物种启动子识别框架。
Nucleic Acids Res. 2022 Oct 14;50(18):10278-10289. doi: 10.1093/nar/gkac824.
8
Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction.Circ-LocNet:一种用于环状 RNA 亚细胞定位预测的计算框架。
Int J Mol Sci. 2022 Jul 26;23(15):8221. doi: 10.3390/ijms23158221.
9
Prokaryotic and eukaryotic promoters identification based on residual network transfer learning.基于残差网络迁移学习的原核生物和真核生物启动子识别
Bioprocess Biosyst Eng. 2022 May;45(5):955-967. doi: 10.1007/s00449-022-02716-w. Epub 2022 Mar 13.
10
Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.原核生物和真核生物启动子预测的计算工具的批判性评估。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab551.

本文引用的文献

1
RegulonDB version 7.0: transcriptional regulation of Escherichia coli K-12 integrated within genetic sensory response units (Gensor Units).RegulonDB 7.0版本:整合在遗传感应反应单元(Gensor单元)内的大肠杆菌K-12转录调控。
Nucleic Acids Res. 2011 Jan;39(Database issue):D98-105. doi: 10.1093/nar/gkq1110. Epub 2010 Nov 4.
2
Eukaryotic and prokaryotic promoter prediction using hybrid approach.使用混合方法进行真核和原核启动子预测。
Theory Biosci. 2011 Jun;130(2):91-100. doi: 10.1007/s12064-010-0114-8. Epub 2010 Nov 3.
3
Promoter prediction in E. coli based on SIDD profiles and Artificial Neural Networks.基于 SIDD 谱和人工神经网络的大肠杆菌启动子预测。
BMC Bioinformatics. 2010 Oct 7;11 Suppl 6(Suppl 6):S17. doi: 10.1186/1471-2105-11-S6-S17.
4
N4: a precise and highly sensitive promoter predictor using neural network fed by nearest neighbors.N4:一种使用由最近邻提供数据的神经网络的精确且高度灵敏的启动子预测器。
Genes Genet Syst. 2009 Dec;84(6):425-30. doi: 10.1266/ggs.84.425.
5
GenBank.GenBank。
Nucleic Acids Res. 2010 Jan;38(Database issue):D46-51. doi: 10.1093/nar/gkp1024. Epub 2009 Nov 12.
6
Mechanisms and evolution of control logic in prokaryotic transcriptional regulation.原核生物转录调控中控制逻辑的机制与进化
Microbiol Mol Biol Rev. 2009 Sep;73(3):481-509, Table of Contents. doi: 10.1128/MMBR.00037-08.
7
Analysis of n-gram based promoter recognition methods and application to whole genome promoter prediction.基于n元语法的启动子识别方法分析及其在全基因组启动子预测中的应用。
In Silico Biol. 2009;9(1-2):S1-16.
8
Transcription initiation by mix and match elements: flexibility for polymerase binding to bacterial promoters.通过混合匹配元件进行转录起始:聚合酶与细菌启动子结合的灵活性
Gene Regul Syst Bio. 2007;1:275-93.
9
Origins of replication in Cyanothece 51142.蓝细菌51142中的复制起点。
Proc Natl Acad Sci U S A. 2008 Dec 30;105(52):E125; author reply E126-7. doi: 10.1073/pnas.0809987106.
10
The regulatory network of E. coli metabolism as a Boolean dynamical system exhibits both homeostasis and flexibility of response.作为布尔动力系统的大肠杆菌代谢调控网络展现出稳态和反应灵活性。
BMC Syst Biol. 2008 Feb 29;2:21. doi: 10.1186/1752-0509-2-21.