Suppr超能文献

PSoL:一种用于寻找非编码RNA基因的仅正样本学习算法。

PSoL: a positive sample only learning algorithm for finding non-coding RNA genes.

作者信息

Wang Chunlin, Ding Chris, Meraz Richard F, Holbrook Stephen R

机构信息

Physical Biosciences Division, Lawrence Berkeley National Laboratory Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2006 Nov 1;22(21):2590-6. doi: 10.1093/bioinformatics/btl441. Epub 2006 Aug 31.

Abstract

MOTIVATION

Small non-coding RNA (ncRNA) genes play important regulatory roles in a variety of cellular processes. However, detection of ncRNA genes is a great challenge to both experimental and computational approaches. In this study, we describe a new approach called positive sample only learning (PSoL) to predict ncRNA genes in the Escherichia coli genome. Although PSoL is a machine learning method for classification, it requires no negative training data, which, in general, is hard to define properly and affects the performance of machine learning dramatically. In addition, using the support vector machine (SVM) as the core learning algorithm, PSoL can integrate many different kinds of information to improve the accuracy of prediction. Besides the application of PSoL for predicting ncRNAs, PSoL is applicable to many other bioinformatics problems as well.

RESULTS

The PSoL method is assessed by 5-fold cross-validation experiments which show that PSoL can achieve about 80% accuracy in recovery of known ncRNAs. We compared PSoL predictions with five previously published results. The PSoL method has the highest percentage of predictions overlapping with those from other methods.

摘要

动机

小型非编码RNA(ncRNA)基因在多种细胞过程中发挥着重要的调控作用。然而,ncRNA基因的检测对实验方法和计算方法来说都是巨大的挑战。在本研究中,我们描述了一种名为仅正样本学习(PSoL)的新方法,用于预测大肠杆菌基因组中的ncRNA基因。尽管PSoL是一种用于分类的机器学习方法,但它不需要负训练数据,而负训练数据通常很难正确定义,并且会极大地影响机器学习的性能。此外,以支持向量机(SVM)作为核心学习算法,PSoL可以整合许多不同类型的信息以提高预测的准确性。除了将PSoL应用于预测ncRNA外,PSoL也适用于许多其他生物信息学问题。

结果

通过五折交叉验证实验对PSoL方法进行了评估,结果表明PSoL在恢复已知ncRNA方面可以达到约80%的准确率。我们将PSoL的预测结果与之前发表的五个结果进行了比较。PSoL方法与其他方法的预测结果重叠的百分比最高。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验