Suppr超能文献

POODLE-L:一种用于可靠预测长无序区域的两级支持向量机预测系统。

POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions.

作者信息

Hirose Shuichi, Shimizu Kana, Kanai Satoru, Kuroda Yutaka, Noguchi Tamotsu

机构信息

PharmaDesign, Inc., Tokyo 104-0032, Japan.

出版信息

Bioinformatics. 2007 Aug 15;23(16):2046-53. doi: 10.1093/bioinformatics/btm302. Epub 2007 Jun 1.

Abstract

MOTIVATION

Recent experimental and theoretical studies have revealed several proteins containing sequence segments that are unfolded under physiological conditions. These segments are called disordered regions. They are actively investigated because of their possible involvement in various biological processes, such as cell signaling, transcriptional and translational regulation. Additionally, disordered regions can represent a major obstacle to high-throughput proteome analysis and often need to be removed from experimental targets. The accurate prediction of long disordered regions is thus expected to provide annotations that are useful for a wide range of applications.

RESULTS

We developed Prediction Of Order and Disorder by machine LEarning (POODLE-L; L stands for long), the Support Vector Machines (SVMs) based method for predicting long disordered regions using 10 kinds of simple physico-chemical properties of amino acid. POODLE-L assembles the output of 10 two-level SVM predictors into a final prediction of disordered regions. The performance of POODLE-L for predicting long disordered regions, which exhibited a Matthew's correlation coefficient of 0.658, was the highest when compared with eight well-established publicly available disordered region predictors.

AVAILABILITY

POODLE-L is freely available at http://mbs.cbrc.jp/poodle/poodle-l.html.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

最近的实验和理论研究揭示了几种蛋白质含有在生理条件下未折叠的序列片段。这些片段被称为无序区域。由于它们可能参与各种生物过程,如细胞信号传导、转录和翻译调控,因此受到了积极的研究。此外,无序区域可能是高通量蛋白质组分析的主要障碍,通常需要从实验目标中去除。因此,准确预测长无序区域有望提供对广泛应用有用的注释。

结果

我们开发了基于机器学习的有序和无序预测方法(POODLE-L;L代表长),这是一种基于支持向量机(SVM)的方法,使用10种简单的氨基酸物理化学性质来预测长无序区域。POODLE-L将10个二级SVM预测器的输出组合成无序区域的最终预测。与八个成熟的公开可用的无序区域预测器相比,POODLE-L预测长无序区域的性能最高,马修斯相关系数为0.658。

可用性

POODLE-L可在http://mbs.cbrc.jp/poodle/poodle-l.html上免费获得。

补充信息

补充数据可在《生物信息学》在线获取。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验