• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于离散小波变换的伪三核苷酸组成预测启动子。

Predicting promoters by pseudo-trinucleotide compositions based on discrete wavelets transform.

机构信息

School of Pharmacy, Guangdong Pharmaceutical University, Guangzhou 510006, PR China.

出版信息

J Theor Biol. 2013 Feb 21;319:1-7. doi: 10.1016/j.jtbi.2012.11.024. Epub 2012 Dec 2.

DOI:10.1016/j.jtbi.2012.11.024
PMID:23211833
Abstract

In this paper, the discrete wavelet transform was introduced into the trinucleotide compositions and a novel DNA sequence representation technique, namely pseudo-trinucleotide compositions was proposed. The pseudo-trinucleotide compositions based on discrete wavelets transform were then employed to model support vector machines (SVM) for the prediction of promoters. The model was evaluated on the genie dataset, and the overall prediction accuracy (ACC) by jackknife validation for the classification of promoters, introns and exons was 82.46%, while the ACC for the classification of promoters and unpromoters was 82.18%, which was far better than the previous results. The satisfied prediction result revealed that the pseudo-trinucleotide composition based on discrete wavelet transform was an effective representation method for DNA sequence, and plays a very important role in the prediction of DNA function.

摘要

在本文中,我们将离散小波变换引入三核苷酸组成中,提出了一种新的 DNA 序列表示技术,即伪三核苷酸组成。然后,基于离散小波变换的伪三核苷酸组成被用于支持向量机 (SVM) 建模,以预测启动子。该模型在 genie 数据集上进行了评估,通过 Jackknife 验证对启动子、内含子和外显子的分类的总体预测准确率 (ACC) 为 82.46%,而对启动子和非启动子的分类的 ACC 为 82.18%,这明显优于以前的结果。满意的预测结果表明,基于离散小波变换的伪三核苷酸组成是一种有效的 DNA 序列表示方法,在 DNA 功能预测中起着非常重要的作用。

相似文献

1
Predicting promoters by pseudo-trinucleotide compositions based on discrete wavelets transform.基于离散小波变换的伪三核苷酸组成预测启动子。
J Theor Biol. 2013 Feb 21;319:1-7. doi: 10.1016/j.jtbi.2012.11.024. Epub 2012 Dec 2.
2
Predicting methylation status of human DNA sequences by pseudo-trinucleotide composition.基于伪三核苷酸组成预测人类 DNA 序列的甲基化状态。
Talanta. 2011 Aug 15;85(2):1143-7. doi: 10.1016/j.talanta.2011.05.043. Epub 2011 May 27.
3
A novel algorithm combining support vector machine with the discrete wavelet transform for the prediction of protein subcellular localization.一种将支持向量机与离散小波变换相结合的新型算法,用于预测蛋白质亚细胞定位。
Comput Biol Med. 2012 Feb;42(2):180-7. doi: 10.1016/j.compbiomed.2011.11.006. Epub 2011 Dec 6.
4
Prediction of methylation CpGs and their methylation degrees in human DNA sequences.预测人类 DNA 序列中的甲基化 CpG 及其甲基化程度。
Comput Biol Med. 2012 Apr;42(4):408-13. doi: 10.1016/j.compbiomed.2011.12.008. Epub 2011 Dec 29.
5
Pol II promoter prediction using characteristic 4-mer motifs: a machine learning approach.利用特征四聚体基序进行RNA聚合酶II启动子预测:一种机器学习方法
BMC Bioinformatics. 2008 Oct 4;9:414. doi: 10.1186/1471-2105-9-414.
6
Prediction of membrane proteins using split amino acid and ensemble classification.基于氨基酸分割和集成分类的膜蛋白预测。
Amino Acids. 2012 Jun;42(6):2447-60. doi: 10.1007/s00726-011-1053-5. Epub 2011 Aug 18.
7
A new hybrid fractal algorithm for predicting thermophilic nucleotide sequences.一种用于预测嗜热核苷酸序列的新型混合分形算法。
J Theor Biol. 2012 Jan 21;293:74-81. doi: 10.1016/j.jtbi.2011.09.028. Epub 2011 Oct 10.
8
Kernel machines for epilepsy diagnosis via EEG signal classification: a comparative study.基于脑电信号分类的癫痫诊断核机器:一项对比研究。
Artif Intell Med. 2011 Oct;53(2):83-95. doi: 10.1016/j.artmed.2011.07.003. Epub 2011 Aug 17.
9
Prediction of protein structural classes by Chou's pseudo amino acid composition: approached using continuous wavelet transform and principal component analysis.基于周式伪氨基酸组成预测蛋白质结构类别:采用连续小波变换和主成分分析方法
Amino Acids. 2009 Jul;37(2):415-25. doi: 10.1007/s00726-008-0170-2. Epub 2008 Aug 23.
10
Using support vector machines for prediction of protein structural classes based on discrete wavelet transform.基于离散小波变换,使用支持向量机预测蛋白质结构类别。
J Comput Chem. 2009 Jun;30(8):1344-50. doi: 10.1002/jcc.21115.

引用本文的文献

1
M-band wavelet-based multi-view clustering of cells.基于M波段小波的细胞多视图聚类
PLoS Comput Biol. 2025 May 23;21(5):e1013060. doi: 10.1371/journal.pcbi.1013060. eCollection 2025 May.
2
Circ-LocNet: A Computational Framework for Circular RNA Sub-Cellular Localization Prediction.Circ-LocNet:一种用于环状 RNA 亚细胞定位预测的计算框架。
Int J Mol Sci. 2022 Jul 26;23(15):8221. doi: 10.3390/ijms23158221.
3
70ProPred: a predictor for discovering sigma70 promoters based on combining multiple features.70ProPred:一种基于多特征组合发现σ70启动子的预测工具。
BMC Syst Biol. 2018 Apr 24;12(Suppl 4):44. doi: 10.1186/s12918-018-0570-1.
4
Prediction of aptamer-protein interacting pairs using an ensemble classifier in combination with various protein sequence attributes.使用集成分类器结合各种蛋白质序列属性预测适配体-蛋白质相互作用对。
BMC Bioinformatics. 2016 May 31;17(1):225. doi: 10.1186/s12859-016-1087-5.
5
Assessing the effects of data selection and representation on the development of reliable E. coli sigma 70 promoter region predictors.评估数据选择和表示对可靠的大肠杆菌σ70启动子区域预测器开发的影响。
PLoS One. 2015 Mar 24;10(3):e0119721. doi: 10.1371/journal.pone.0119721. eCollection 2015.
6
iPro54-PseKNC: a sequence-based predictor for identifying sigma-54 promoters in prokaryote with pseudo k-tuple nucleotide composition.iPro54-PseKNC:一种基于序列的预测工具,用于通过伪k元核苷酸组成识别原核生物中的σ-54启动子。
Nucleic Acids Res. 2014 Dec 1;42(21):12961-72. doi: 10.1093/nar/gku1019. Epub 2014 Oct 31.
7
Prediction of DNase I hypersensitive sites by using pseudo nucleotide compositions.利用伪核苷酸组成预测脱氧核糖核酸酶I超敏感位点
ScientificWorldJournal. 2014;2014:740506. doi: 10.1155/2014/740506. Epub 2014 Aug 19.