• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

iPromoter-2L:一种双层预测器,通过基于多窗口的 PseKNC 来识别启动子及其类型。

iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC.

机构信息

School of Computer Science and Technology, Harbin Institute of Technology Shenzhen Graduate School, Shenzhen, Guangdong 518055, China.

The Gordon Life Science Institute, Boston, MA 02478, USA.

出版信息

Bioinformatics. 2018 Jan 1;34(1):33-40. doi: 10.1093/bioinformatics/btx579.

DOI:10.1093/bioinformatics/btx579
PMID:28968797
Abstract

MOTIVATION

Being responsible for initiating transaction of a particular gene in genome, promoter is a short region of DNA. Promoters have various types with different functions. Owing to their importance in biological process, it is highly desired to develop computational tools for timely identifying promoters and their types. Such a challenge has become particularly critical and urgent in facing the avalanche of DNA sequences discovered in the postgenomic age. Although some prediction methods were developed, they can only be used to discriminate a specific type of promoters from non-promoters. None of them has the ability to identify the types of promoters. This is due to the facts that different types of promoters may share quite similar consensus sequence pattern, and that the promoters of same type may have considerably different consensus sequences.

RESULTS

To overcome such difficulty, using the multi-window-based PseKNC (pseudo K-tuple nucleotide composition) approach to incorporate the short-, middle-, and long-range sequence information, we have developed a two-layer seamless predictor named as 'iPromoter-2 L'. The first layer serves to identify a query DNA sequence as a promoter or non-promoter, and the second layer to predict which of the following six types the identified promoter belongs to: σ24, σ28, σ32, σ38, σ54 and σ70.

AVAILABILITY AND IMPLEMENTATION

For the convenience of most experimental scientists, a user-friendly and publicly accessible web-server for the powerful new predictor has been established at http://bioinformatics.hitsz.edu.cn/iPromoter-2L/. It is anticipated that iPromoter-2 L will become a very useful high throughput tool for genome analysis.

CONTACT

bliu@hit.edu.cn or dshuang@tongji.edu.cn or kcchou@gordonlifescience.org.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

启动特定基因在基因组中启动子是一段短的 DNA 序列。启动子有不同的类型和不同的功能。由于它们在生物过程中的重要性,开发及时识别启动子及其类型的计算工具是非常需要的。在面对后基因组时代发现的大量 DNA 序列的情况下,这种挑战变得尤为关键和紧迫。尽管已经开发了一些预测方法,但它们只能用于区分特定类型的启动子和非启动子。没有一种方法能够识别启动子的类型。这是因为不同类型的启动子可能具有非常相似的保守序列模式,而同一类型的启动子可能具有相当不同的保守序列。

结果

为了克服这一困难,使用基于多窗口的 PseKNC(伪 K-元核苷酸组成)方法整合短、中、长距离序列信息,我们开发了一种两层无缝预测器,命名为“iPromoter-2L”。第一层用于识别查询 DNA 序列是启动子还是非启动子,第二层用于预测识别出的启动子属于以下六种类型中的哪一种:σ24、σ28、σ32、σ38、σ54 和 σ70。

可用性和实现

为了方便大多数实验科学家,我们在 http://bioinformatics.hitsz.edu.cn/iPromoter-2L/ 上建立了一个用户友好的、公共访问的、功能强大的新预测器的网络服务器。预计 iPromoter-2L 将成为基因组分析的一个非常有用的高通量工具。

联系信息

bliu@hit.edu.cn 或 dshuang@tongji.edu.cn 或 kcchou@gordonlifescience.org。

补充信息

补充数据可在 Bioinformatics 在线获得。

相似文献

1
iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC.iPromoter-2L:一种双层预测器,通过基于多窗口的 PseKNC 来识别启动子及其类型。
Bioinformatics. 2018 Jan 1;34(1):33-40. doi: 10.1093/bioinformatics/btx579.
2
iEnhancer-2L: a two-layer predictor for identifying enhancers and their strength by pseudo k-tuple nucleotide composition.iEnhancer-2L:一种通过伪 k-元核苷酸组成识别增强子及其强度的两层预测器。
Bioinformatics. 2016 Feb 1;32(3):362-9. doi: 10.1093/bioinformatics/btv604. Epub 2015 Oct 17.
3
iPSW(2L)-PseKNC: A two-layer predictor for identifying promoters and their strength by hybrid features via pseudo K-tuple nucleotide composition.iPSW(2L)-PseKNC:一种双层预测器,通过伪 K- 元核苷酸组成的混合特征来识别启动子及其强度。
Genomics. 2019 Dec;111(6):1785-1793. doi: 10.1016/j.ygeno.2018.12.001. Epub 2018 Dec 5.
4
iPromoter-BnCNN: a novel branched CNN-based predictor for identifying and classifying sigma promoters.iPromoter-BnCNN:一种基于分支卷积神经网络的新型预测器,用于识别和分类σ启动子。
Bioinformatics. 2020 Dec 8;36(19):4869-4875. doi: 10.1093/bioinformatics/btaa609.
5
iRSpot-EL: identify recombination spots with an ensemble learning approach.iRSpot-EL:基于集成学习方法识别重组热点。
Bioinformatics. 2017 Jan 1;33(1):35-41. doi: 10.1093/bioinformatics/btw539. Epub 2016 Aug 16.
6
iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.iPro54-PseKNC:一种基于序列的预测工具,用于通过伪k元核苷酸组成识别原核生物中的σ-54启动子。
Nucleic Acids Res. 2014 Dec 1;42(21):12961-72. doi: 10.1093/nar/gku1019. Epub 2014 Oct 31.
7
Identifying Sigma70 Promoters with Novel Pseudo Nucleotide Composition.利用新型伪核苷酸组成识别 Sigma70 启动子。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1316-1321. doi: 10.1109/TCBB.2017.2666141. Epub 2017 Feb 8.
8
iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework.iDHS-EL:通过将三种不同模式的伪核苷酸组成融合到一个集成学习框架中,来识别 DNase I 超敏位点。
Bioinformatics. 2016 Aug 15;32(16):2411-8. doi: 10.1093/bioinformatics/btw186. Epub 2016 Apr 8.
9
iNuc-PseKNC: a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition.iNuc-PseKNC:一种基于序列的预测器,用于预测基因组中具有伪 k-元核苷酸组成的核小体定位。
Bioinformatics. 2014 Jun 1;30(11):1522-9. doi: 10.1093/bioinformatics/btu083. Epub 2014 Feb 6.
10
iPromoter-2L2.0: Identifying Promoters and Their Types by Combining Smoothing Cutting Window Algorithm and Sequence-Based Features.iPromoter-2L2.0:结合平滑切割窗口算法和基于序列的特征识别启动子及其类型
Mol Ther Nucleic Acids. 2019 Dec 6;18:80-87. doi: 10.1016/j.omtn.2019.08.008. Epub 2019 Aug 14.

引用本文的文献

1
An ensemble strategy for piRNA identification through hybrid moment-based feature modeling.一种基于混合矩特征建模的piRNA识别集成策略。
Sci Rep. 2025 Aug 18;15(1):30157. doi: 10.1038/s41598-025-14194-7.
2
Regulation of the tagatose catabolic gene cluster and development of a tagatose-inducible expression system in the probiotic Escherichia coli Nissle 1917.益生菌大肠杆菌Nissle 1917中塔格糖分解代谢基因簇的调控及塔格糖诱导表达系统的开发。
Microb Cell Fact. 2025 Jul 4;24(1):158. doi: 10.1186/s12934-025-02771-4.
3
iPro-CSAF: identification of promoters based on convolutional spiking neural networks and spiking attention mechanism.
iPro-CSAF:基于卷积脉冲神经网络和脉冲注意力机制的启动子识别
PeerJ Comput Sci. 2025 Mar 26;11:e2761. doi: 10.7717/peerj-cs.2761. eCollection 2025.
4
DNA sequence analysis landscape: a comprehensive review of DNA sequence analysis task types, databases, datasets, word embedding methods, and language models.DNA序列分析全景:对DNA序列分析任务类型、数据库、数据集、词嵌入方法和语言模型的全面综述。
Front Med (Lausanne). 2025 Apr 8;12:1503229. doi: 10.3389/fmed.2025.1503229. eCollection 2025.
5
Enhancing nucleotide sequence representations in genomic analysis with contrastive optimization.通过对比优化增强基因组分析中的核苷酸序列表示。
Commun Biol. 2025 Mar 29;8(1):517. doi: 10.1038/s42003-025-07902-6.
6
Negative dataset selection impacts machine learning-based predictors for multiple bacterial species promoters.阴性数据集选择会影响基于机器学习的多种细菌物种启动子预测器。
Bioinformatics. 2025 Mar 29;41(4). doi: 10.1093/bioinformatics/btaf135.
7
Ethylene and epoxyethane metabolism in methanotrophic bacteria: comparative genomics and physiological studies using .甲烷营养型细菌中的乙烯和环氧乙烷代谢:比较基因组学和生理研究利用。
Microb Genom. 2024 Oct;10(10). doi: 10.1099/mgen.0.001306.
8
Distribution rules of 8-mer spectra and characterization of evolution state in animal genome sequences.8 元组谱的分布规律及动物基因组序列进化状态的特征。
BMC Genomics. 2024 Sep 12;25(1):855. doi: 10.1186/s12864-024-10786-1.
9
m7GRegpred: substrate prediction of N7-methylguanosine (m7G) writers and readers based on sequencing features.m7GRegpred:基于测序特征的N7-甲基鸟苷(m7G)写入器和读取器的底物预测
Front Genet. 2024 Aug 28;15:1469011. doi: 10.3389/fgene.2024.1469011. eCollection 2024.
10
Enhancing metabolic efficiency via novel constitutive promoters to produce protocatechuic acid in Escherichia coli.通过新型组成型启动子提高代谢效率,在大肠杆菌中生产原儿茶酸。
Appl Microbiol Biotechnol. 2024 Aug 17;108(1):442. doi: 10.1007/s00253-024-13256-6.