• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多层融合多种信息源的改进序列预测无序区域。

Improved sequence-based prediction of disordered regions with multilayer fusion of multiple information sources.

机构信息

Department of Electrical and Computer Engineering, University of Alberta, Edmonton, Canada.

出版信息

Bioinformatics. 2010 Sep 15;26(18):i489-96. doi: 10.1093/bioinformatics/btq373.

DOI:10.1093/bioinformatics/btq373
PMID:20823312
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2935446/
Abstract

MOTIVATION

Intrinsically disordered proteins play a crucial role in numerous regulatory processes. Their abundance and ubiquity combined with a relatively low quantity of their annotations motivate research toward the development of computational models that predict disordered regions from protein sequences. Although the prediction quality of these methods continues to rise, novel and improved predictors are urgently needed.

RESULTS

We propose a novel method, named MFDp (Multilayered Fusion-based Disorder predictor), that aims to improve over the current disorder predictors. MFDp is as an ensemble of 3 Support Vector Machines specialized for the prediction of short, long and generic disordered regions. It combines three complementary disorder predictors, sequence, sequence profiles, predicted secondary structure, solvent accessibility, backbone dihedral torsion angles, residue flexibility and B-factors. Our method utilizes a custom-designed set of features that are based on raw predictions and aggregated raw values and recognizes various types of disorder. The MFDp is compared at the residue level on two datasets against eight recent disorder predictors and top-performing methods from the most recent CASP8 experiment. In spite of using training chains with <or=25% similarity to the test sequences, our method consistently and significantly outperforms the other methods based on the MCC index. The MFDp outperforms modern disorder predictors for the binary disorder assignment and provides competitive real-valued predictions. The MFDp's outputs are also shown to outperform the other methods in the identification of proteins with long disordered regions.

AVAILABILITY

http://biomine.ece.ualberta.ca/MFDp.html.

摘要

动机

无序蛋白质在许多调节过程中起着至关重要的作用。它们的丰富度和普遍性,加上它们的注释相对较少,这促使人们研究开发能够从蛋白质序列中预测无序区域的计算模型。尽管这些方法的预测质量不断提高,但迫切需要新的和改进的预测器。

结果

我们提出了一种名为 MFDp(基于多层融合的无序预测器)的新方法,旨在改进现有的无序预测器。MFDp 是由 3 个专门用于预测短、长和通用无序区域的支持向量机构成的集成。它结合了三种互补的无序预测器,序列、序列图谱、预测的二级结构、溶剂可及性、骨架二面角扭转角、残基柔性和 B 因子。我们的方法利用了一组基于原始预测和聚合原始值的定制特征,识别各种类型的无序。在两个数据集上,我们在残基水平上与 8 种最新的无序预测器和最近的 CASP8 实验中的顶级方法进行了比较。尽管使用的训练链与测试序列的相似度<或=25%,但我们的方法基于 MCC 指数始终显著优于其他方法。MFDp 在二值无序分配方面优于现代无序预测器,并提供有竞争力的实值预测。MFDp 的输出在识别具有长无序区域的蛋白质方面也表现优于其他方法。

可用性

http://biomine.ece.ualberta.ca/MFDp.html。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/36599f133b5a/btq373f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/3e89da3e6f99/btq373f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/e7298bd4e8c7/btq373f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/370c66f8c639/btq373f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/36599f133b5a/btq373f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/3e89da3e6f99/btq373f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/e7298bd4e8c7/btq373f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/370c66f8c639/btq373f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8f33/2935446/36599f133b5a/btq373f4.jpg

相似文献

1
Improved sequence-based prediction of disordered regions with multilayer fusion of multiple information sources.基于多层融合多种信息源的改进序列预测无序区域。
Bioinformatics. 2010 Sep 15;26(18):i489-96. doi: 10.1093/bioinformatics/btq373.
2
MFDp2: Accurate predictor of disorder in proteins by fusion of disorder probabilities, content and profiles.MFDp2:通过融合无序概率、含量和图谱实现蛋白质无序的精确预测器。
Intrinsically Disord Proteins. 2013 Apr 1;1(1):e24428. doi: 10.4161/idp.24428. eCollection 2013 Jan-Dec.
3
In-silico prediction of disorder content using hybrid sequence representation.使用混合序列表示进行无规则内容的计算预测。
BMC Bioinformatics. 2011 Jun 17;12:245. doi: 10.1186/1471-2105-12-245.
4
MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins.MoRFpred,一种基于序列的计算工具,用于预测和描述蛋白质中短的无序到有序转变的结合区域。
Bioinformatics. 2012 Jun 15;28(12):i75-83. doi: 10.1093/bioinformatics/bts209.
5
DFLpred: High-throughput prediction of disordered flexible linker regions in protein sequences.DFLpred:蛋白质序列中无序柔性连接区的高通量预测
Bioinformatics. 2016 Jun 15;32(12):i341-i350. doi: 10.1093/bioinformatics/btw280.
6
Prediction of beta-turns at over 80% accuracy based on an ensemble of predicted secondary structures and multiple alignments.基于预测的二级结构集合和多重比对,以超过80%的准确率预测β转角。
BMC Bioinformatics. 2008 Oct 10;9:430. doi: 10.1186/1471-2105-9-430.
7
Genome-scale prediction of proteins with long intrinsically disordered regions.具有长内在无序区域的蛋白质的全基因组规模预测。
Proteins. 2014 Jan;82(1):145-58. doi: 10.1002/prot.24348. Epub 2013 Sep 17.
8
Length-dependent prediction of protein intrinsic disorder.蛋白质内在无序性的长度依赖性预测。
BMC Bioinformatics. 2006 Apr 17;7:208. doi: 10.1186/1471-2105-7-208.
9
Prediction of intrinsic disorder in proteins using MFDp2.使用MFDp2预测蛋白质中的内在无序性。
Methods Mol Biol. 2014;1137:147-62. doi: 10.1007/978-1-4939-0366-5_11.
10
Modular prediction of protein structural classes from sequences of twilight-zone identity with predicting sequences.从与预测序列具有 twilight-zone 身份的序列中预测蛋白质结构类别
BMC Bioinformatics. 2009 Dec 13;10:414. doi: 10.1186/1471-2105-10-414.

引用本文的文献

1
FusionEncoder: identification of intrinsically disordered regions based on multi-feature fusion.融合编码器:基于多特征融合的内在无序区域识别
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf362.
2
IDP-EDL: enhancing intrinsically disordered protein prediction by combining protein language model and ensemble deep learning.IDP-EDL:通过结合蛋白质语言模型和集成深度学习增强内在无序蛋白质预测
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf182.
3
A conserved motif in Henipavirus P/V/W proteins drives the fibrillation of the W protein from Hendra virus.

本文引用的文献

1
Structural genomics target selection for the New York consortium on membrane protein structure.纽约膜蛋白结构联盟的结构基因组学目标选择
J Struct Funct Genomics. 2009 Dec;10(4):255-68. doi: 10.1007/s10969-009-9071-1. Epub 2009 Oct 27.
2
Assessment of disorder predictions in CASP8.评估 CASP8 中的紊乱预测。
Proteins. 2009;77 Suppl 9:210-6. doi: 10.1002/prot.22586.
3
The role of intrinsically unstructured proteins in neurodegenerative diseases.内在无序蛋白质在神经退行性疾病中的作用。
亨尼帕病毒P/V/W蛋白中的一个保守基序驱动了亨德拉病毒W蛋白的纤维化。
Protein Sci. 2025 Apr;34(4):e70085. doi: 10.1002/pro.70085.
4
Accurate Prediction of Protein-Binding Residues in Protein Sequences Using SCRIBER.使用 SCRIBER 准确预测蛋白质序列中的蛋白质结合残基。
Methods Mol Biol. 2025;2867:247-260. doi: 10.1007/978-1-0716-4196-5_15.
5
Accurate and Fast Prediction of Intrinsic Disorder Using flDPnn.使用 flDPnn 进行精确快速的固有无序预测。
Methods Mol Biol. 2025;2867:201-218. doi: 10.1007/978-1-0716-4196-5_12.
6
Recent Advances in Computational Prediction of Secondary and Supersecondary Structures from Protein Sequences.从蛋白质序列预测二级和超二级结构的计算方法的最新进展
Methods Mol Biol. 2025;2870:1-19. doi: 10.1007/978-1-0716-4213-9_1.
7
Protein intrinsically disordered region prediction by combining neural architecture search and multi-objective genetic algorithm.通过结合神经结构搜索和多目标遗传算法进行蛋白质无规则区域预测。
BMC Biol. 2023 Sep 7;21(1):188. doi: 10.1186/s12915-023-01672-5.
8
Computational prediction of disordered binding regions.无序结合区域的计算预测
Comput Struct Biotechnol J. 2023 Feb 10;21:1487-1497. doi: 10.1016/j.csbj.2023.02.018. eCollection 2023.
9
Computational Prediction of Protein Intrinsically Disordered Region Related Interactions and Functions.计算预测蛋白质无规卷曲区域相关相互作用和功能。
Genes (Basel). 2023 Feb 8;14(2):432. doi: 10.3390/genes14020432.
10
Prediction of protein-protein interaction sites in intrinsically disordered proteins.内在无序蛋白质中蛋白质-蛋白质相互作用位点的预测
Front Mol Biosci. 2022 Sep 30;9:985022. doi: 10.3389/fmolb.2022.985022. eCollection 2022.
PLoS One. 2009;4(5):e5566. doi: 10.1371/journal.pone.0005566. Epub 2009 May 15.
4
On the relation between residue flexibility and local solvent accessibility in proteins.关于蛋白质中残基柔性与局部溶剂可及性之间的关系。
Proteins. 2009 Aug 15;76(3):617-36. doi: 10.1002/prot.22375.
5
Close encounters of the third kind: disordered domains and the interactions of proteins.第三类亲密接触:无序结构域与蛋白质相互作用
Bioessays. 2009 Mar;31(3):328-35. doi: 10.1002/bies.200800151.
6
Improved disorder prediction by combination of orthogonal approaches.通过正交方法组合改进疾病预测
PLoS One. 2009;4(2):e4433. doi: 10.1371/journal.pone.0004433. Epub 2009 Feb 11.
7
Large-scale prediction of long disordered regions in proteins using random forests.利用随机森林对蛋白质中长无序区域进行大规模预测。
BMC Bioinformatics. 2009 Jan 7;10:8. doi: 10.1186/1471-2105-10-8.
8
The unfoldomics decade: an update on intrinsically disordered proteins.未折叠组学十年:内在无序蛋白质的最新进展
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S1. doi: 10.1186/1471-2164-9-S2-S1.
9
Improving the prediction accuracy of residue solvent accessibility and real-value backbone torsion angles of proteins by guided-learning through a two-layer neural network.通过两层神经网络的引导学习提高蛋白质残基溶剂可及性和实值主链扭转角的预测准确性。
Proteins. 2009 Mar;74(4):847-56. doi: 10.1002/prot.22193.
10
Intrinsic disorder prediction from the analysis of multiple protein fold recognition models.基于多种蛋白质折叠识别模型分析的内在无序预测
Bioinformatics. 2008 Aug 15;24(16):1798-804. doi: 10.1093/bioinformatics/btn326. Epub 2008 Jun 25.