POODLE-L：一种用于可靠预测长无序区域的两级支持向量机预测系统。

POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions.

作者信息

Hirose Shuichi, Shimizu Kana, Kanai Satoru, Kuroda Yutaka, Noguchi Tamotsu

机构信息

PharmaDesign, Inc., Tokyo 104-0032, Japan.

出版信息

Bioinformatics. 2007 Aug 15;23(16):2046-53. doi: 10.1093/bioinformatics/btm302. Epub 2007 Jun 1.

DOI:10.1093/bioinformatics/btm302

PMID:17545177

Abstract

MOTIVATION

Recent experimental and theoretical studies have revealed several proteins containing sequence segments that are unfolded under physiological conditions. These segments are called disordered regions. They are actively investigated because of their possible involvement in various biological processes, such as cell signaling, transcriptional and translational regulation. Additionally, disordered regions can represent a major obstacle to high-throughput proteome analysis and often need to be removed from experimental targets. The accurate prediction of long disordered regions is thus expected to provide annotations that are useful for a wide range of applications.

RESULTS

We developed Prediction Of Order and Disorder by machine LEarning (POODLE-L; L stands for long), the Support Vector Machines (SVMs) based method for predicting long disordered regions using 10 kinds of simple physico-chemical properties of amino acid. POODLE-L assembles the output of 10 two-level SVM predictors into a final prediction of disordered regions. The performance of POODLE-L for predicting long disordered regions, which exhibited a Matthew's correlation coefficient of 0.658, was the highest when compared with eight well-established publicly available disordered region predictors.

AVAILABILITY

POODLE-L is freely available at http://mbs.cbrc.jp/poodle/poodle-l.html.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

最近的实验和理论研究揭示了几种蛋白质含有在生理条件下未折叠的序列片段。这些片段被称为无序区域。由于它们可能参与各种生物过程，如细胞信号传导、转录和翻译调控，因此受到了积极的研究。此外，无序区域可能是高通量蛋白质组分析的主要障碍，通常需要从实验目标中去除。因此，准确预测长无序区域有望提供对广泛应用有用的注释。

结果

我们开发了基于机器学习的有序和无序预测方法（POODLE-L；L代表长），这是一种基于支持向量机（SVM）的方法，使用10种简单的氨基酸物理化学性质来预测长无序区域。POODLE-L将10个二级SVM预测器的输出组合成无序区域的最终预测。与八个成熟的公开可用的无序区域预测器相比，POODLE-L预测长无序区域的性能最高，马修斯相关系数为0.658。

可用性

POODLE-L可在http://mbs.cbrc.jp/poodle/poodle-l.html上免费获得。

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions.POODLE-L：一种用于可靠预测长无序区域的两级支持向量机预测系统。

Bioinformatics. 2007 Aug 15;23(16):2046-53. doi: 10.1093/bioinformatics/btm302. Epub 2007 Jun 1.

POODLE-I: disordered region prediction by integrating POODLE series and structural information predictors based on a workflow approach.POODLE-I：基于工作流程方法整合POODLE系列和结构信息预测器进行无序区域预测。

In Silico Biol. 2010;10(3):185-91. doi: 10.3233/ISB-2010-0426.

Support vector machines for prediction of dihedral angle regions.用于预测二面角区域的支持向量机

Bioinformatics. 2006 Dec 15;22(24):3009-15. doi: 10.1093/bioinformatics/btl489. Epub 2006 Sep 27.

Predicting protein stability changes from sequences using support vector machines.使用支持向量机从序列预测蛋白质稳定性变化。

Bioinformatics. 2005 Sep 1;21 Suppl 2:ii54-8. doi: 10.1093/bioinformatics/bti1109.

Predicting disulfide connectivity from protein sequence using multiple sequence feature vectors and secondary structure.使用多序列特征向量和二级结构从蛋白质序列预测二硫键连接性。

Bioinformatics. 2007 Dec 1;23(23):3147-54. doi: 10.1093/bioinformatics/btm505. Epub 2007 Oct 17.

Protein backbone angle prediction with machine learning approaches.基于机器学习方法的蛋白质主链角度预测

Bioinformatics. 2004 Jul 10;20(10):1612-21. doi: 10.1093/bioinformatics/bth136. Epub 2004 Feb 26.

Prediction of unfolded segments in a protein sequence based on amino acid composition.基于氨基酸组成预测蛋白质序列中的未折叠片段。

Bioinformatics. 2005 May 1;21(9):1891-900. doi: 10.1093/bioinformatics/bti266. Epub 2005 Jan 18.

Ensemble classifier for protein fold pattern recognition.用于蛋白质折叠模式识别的集成分类器。

Bioinformatics. 2006 Jul 15;22(14):1717-22. doi: 10.1093/bioinformatics/btl170. Epub 2006 May 3.

Prediction of protein structural class with Rough Sets.基于粗糙集的蛋白质结构类预测

BMC Bioinformatics. 2006 Jan 14;7:20. doi: 10.1186/1471-2105-7-20.

A new representation for protein secondary structure prediction based on frequent patterns.一种基于频繁模式的蛋白质二级结构预测新表示法。

Bioinformatics. 2006 Nov 1;22(21):2628-34. doi: 10.1093/bioinformatics/btl453. Epub 2006 Aug 29.

引用本文的文献

Subversion of mRNA degradation pathways by EWSR1::FLI1 represents a therapeutic vulnerability in Ewing sarcoma.EWSR1::FLI1对mRNA降解途径的破坏是尤因肉瘤的一个治疗弱点。

Nat Commun. 2025 Jul 16;16(1):6537. doi: 10.1038/s41467-025-61725-x.

IDP-EDL: enhancing intrinsically disordered protein prediction by combining protein language model and ensemble deep learning.IDP-EDL：通过结合蛋白质语言模型和集成深度学习增强内在无序蛋白质预测

Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf182.

Computational Prediction of Protein Intrinsically Disordered Region Related Interactions and Functions.计算预测蛋白质无规卷曲区域相关相互作用和功能。

Genes (Basel). 2023 Feb 8;14(2):432. doi: 10.3390/genes14020432.

Exon Elongation Added Intrinsically Disordered Regions to the Encoded Proteins and Facilitated the Emergence of the Last Eukaryotic Common Ancestor.外显子延伸为编码蛋白添加了固有无序区域，并促进了最后一个真核生物共同祖先的出现。

Mol Biol Evol. 2023 Jan 4;40(1). doi: 10.1093/molbev/msac272.

Casein Kinase 1 Regulates Cytorhabdovirus Replication and Transcription by Phosphorylating a Phosphoprotein Serine-Rich Motif.酪蛋白激酶 1 通过磷酸化富含丝氨酸的磷蛋白基序调节细胞弹状病毒的复制和转录。

Plant Cell. 2020 Sep;32(9):2878-2897. doi: 10.1105/tpc.20.00369. Epub 2020 Jul 8.

Identification of Intrinsically Disordered Proteins and Regions by Length-Dependent Predictors Based on Conditional Random Fields.基于条件随机场的长度依赖性预测器识别内在无序蛋白质及区域

Mol Ther Nucleic Acids. 2019 Sep 6;17:396-404. doi: 10.1016/j.omtn.2019.06.004. Epub 2019 Jun 15.

Quality and bias of protein disorder predictors.蛋白质无序预测器的质量和偏差。

Sci Rep. 2019 Mar 26;9(1):5137. doi: 10.1038/s41598-019-41644-w.

Both Intrinsically Disordered Regions and Structural Domains Evolve Rapidly in Immune-Related Mammalian Proteins.免疫相关哺乳动物蛋白中的无规则结构区域和结构域均快速进化。

Int J Mol Sci. 2018 Dec 4;19(12):3860. doi: 10.3390/ijms19123860.

Biological classification with RNA-seq data: Can alternatively spliced transcript expression enhance machine learning classifiers?基于 RNA-seq 数据的生物学分类：剪接转录本表达能否增强机器学习分类器？

RNA. 2018 Sep;24(9):1119-1132. doi: 10.1261/rna.062802.117. Epub 2018 Jun 25.

Accurately Predicting Disordered Regions of Proteins Using Rosetta ResidueDisorder Application.利用 Rosetta ResidueDisorder 应用程序准确预测蛋白质的无序区域。

J Phys Chem B. 2018 Apr 12;122(14):3920-3930. doi: 10.1021/acs.jpcb.8b01763. Epub 2018 Mar 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

POODLE-L：一种用于可靠预测长无序区域的两级支持向量机预测系统。

POODLE-L: a two-level SVM prediction system for reliably predicting long disordered regions.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

AVAILABILITY

SUPPLEMENTARY INFORMATION

动机

结果

可用性

补充信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献