• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于预测蛋白质结构模型错误的综合评分。

A composite score for predicting errors in protein structure models.

作者信息

Eramian David, Shen Min-yi, Devos Damien, Melo Francisco, Sali Andrej, Marti-Renom Marc A

机构信息

Graduate Group in Biophysics, Department of Biopharmaceutical Sciences, University of California at San Francisco 94158, USA.

出版信息

Protein Sci. 2006 Jul;15(7):1653-66. doi: 10.1110/ps.062095806. Epub 2006 Jun 2.

DOI:10.1110/ps.062095806
PMID:16751606
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2242555/
Abstract

Reliable prediction of model accuracy is an important unsolved problem in protein structure modeling. To address this problem, we studied 24 individual assessment scores, including physics-based energy functions, statistical potentials, and machine learning-based scoring functions. Individual scores were also used to construct approximately 85,000 composite scoring functions using support vector machine (SVM) regression. The scores were tested for their abilities to identify the most native-like models from a set of 6000 comparative models of 20 representative protein structures. Each of the 20 targets was modeled using a template of <30% sequence identity, corresponding to challenging comparative modeling cases. The best SVM score outperformed all individual scores by decreasing the average RMSD difference between the model identified as the best of the set and the model with the lowest RMSD (DeltaRMSD) from 0.63 A to 0.45 A, while having a higher Pearson correlation coefficient to RMSD (r=0.87) than any other tested score. The most accurate score is based on a combination of the DOPE non-hydrogen atom statistical potential; surface, contact, and combined statistical potentials from MODPIPE; and two PSIPRED/DSSP scores. It was implemented in the SVMod program, which can now be applied to select the final model in various modeling problems, including fold assignment, target-template alignment, and loop modeling.

摘要

在蛋白质结构建模中,可靠地预测模型准确性是一个重要的未解决问题。为了解决这个问题,我们研究了24种个体评估分数,包括基于物理的能量函数、统计势和基于机器学习的评分函数。还使用个体分数通过支持向量机(SVM)回归构建了约85,000种复合评分函数。测试了这些分数从20个代表性蛋白质结构的6000个比较模型中识别最接近天然结构模型的能力。20个目标中的每一个都使用序列同一性小于30%的模板进行建模,这对应于具有挑战性的比较建模情况。最佳的支持向量机分数通过将被确定为该组中最佳模型与具有最低均方根偏差(RMSD)的模型之间的平均RMSD差异(DeltaRMSD)从0.63 Å降低到0.45 Å,同时与RMSD的皮尔逊相关系数(r = 0.87)高于任何其他测试分数,从而优于所有个体分数。最准确的分数基于DOPE非氢原子统计势、MODPIPE的表面、接触和组合统计势以及两个PSIPRED/DSSP分数的组合。它在SVMod程序中实现,现在可应用于在各种建模问题中选择最终模型,包括折叠分配、目标-模板比对和环建模。

相似文献

1
A composite score for predicting errors in protein structure models.用于预测蛋白质结构模型错误的综合评分。
Protein Sci. 2006 Jul;15(7):1653-66. doi: 10.1110/ps.062095806. Epub 2006 Jun 2.
2
Estimating quality of template-based protein models by alignment stability.通过比对稳定性评估基于模板的蛋白质模型的质量。
Proteins. 2008 May 15;71(3):1255-74. doi: 10.1002/prot.21819.
3
How well can the accuracy of comparative protein structure models be predicted?比较蛋白质结构模型的准确性能被预测到什么程度?
Protein Sci. 2008 Nov;17(11):1881-93. doi: 10.1110/ps.036061.108. Epub 2008 Oct 1.
4
Protein secondary structure prediction with SPARROW.利用 SPARROW 进行蛋白质二级结构预测。
J Chem Inf Model. 2012 Feb 27;52(2):545-56. doi: 10.1021/ci200321u. Epub 2012 Jan 23.
5
Statistical potential for assessment and prediction of protein structures.用于蛋白质结构评估和预测的统计势
Protein Sci. 2006 Nov;15(11):2507-24. doi: 10.1110/ps.062416606.
6
A "FRankenstein's monster" approach to comparative modeling: merging the finest fragments of Fold-Recognition models and iterative model refinement aided by 3D structure evaluation.一种用于比较建模的“科学怪人”方法:融合折叠识别模型的最佳片段,并借助三维结构评估进行迭代模型优化。
Proteins. 2003;53 Suppl 6:369-79. doi: 10.1002/prot.10545.
7
fRMSDPred: predicting local RMSD between structural fragments using sequence information.fRMSDPred:利用序列信息预测结构片段之间的局部均方根偏差。
Comput Syst Bioinformatics Conf. 2007;6:311-22.
8
Quality assessment of modeled protein structure using physicochemical properties.利用物理化学性质对模拟蛋白质结构进行质量评估。
J Bioinform Comput Biol. 2015 Apr;13(2):1550005. doi: 10.1142/S0219720015500055. Epub 2014 Dec 19.
9
Prediction of protein loop conformations using multiscale modeling methods with physical energy scoring functions.使用具有物理能量评分函数的多尺度建模方法预测蛋白质环构象。
J Comput Chem. 2008 Apr 15;29(5):820-31. doi: 10.1002/jcc.20827.
10
Fold recognition by predicted alignment accuracy.通过预测比对准确性进行折叠识别。
IEEE/ACM Trans Comput Biol Bioinform. 2005 Apr-Jun;2(2):157-65. doi: 10.1109/TCBB.2005.24.

引用本文的文献

1
In Silico Screening of Drugs That Target Different Forms of E Protein for Potential Treatment of COVID-19.针对不同形式E蛋白的药物的计算机模拟筛选以用于COVID-19的潜在治疗
Pharmaceuticals (Basel). 2023 Feb 14;16(2):296. doi: 10.3390/ph16020296.
2
Spike-Independent Infection of Human Coronavirus 229E in Bat Cells.人冠状病毒 229E 在蝙蝠细胞中的非依赖性感染。
Microbiol Spectr. 2023 Jun 15;11(3):e0348322. doi: 10.1128/spectrum.03483-22. Epub 2023 May 18.
3
Uncovering cryptic pockets in the SARS-CoV-2 spike glycoprotein.揭示 SARS-CoV-2 刺突糖蛋白中的隐匿口袋。
Structure. 2022 Aug 4;30(8):1062-1074.e4. doi: 10.1016/j.str.2022.05.006. Epub 2022 Jun 3.
4
A Benchmark Dataset for Evaluating Practical Performance of Model Quality Assessment of Homology Models.一个用于评估同源模型质量评估实际性能的基准数据集。
Bioengineering (Basel). 2022 Mar 15;9(3):118. doi: 10.3390/bioengineering9030118.
5
Allosteric perspective on the mutability and druggability of the SARS-CoV-2 Spike protein.变构视角下的 SARS-CoV-2 刺突蛋白的可变性和可成药性。
Structure. 2022 Apr 7;30(4):590-607.e4. doi: 10.1016/j.str.2021.12.011. Epub 2022 Jan 20.
6
Current Approaches in Supersecondary Structures Investigation.当前超二级结构研究方法。
Int J Mol Sci. 2021 Nov 2;22(21):11879. doi: 10.3390/ijms222111879.
7
Molecular modeling, docking and dynamics analysis of lipid droplet associated enzyme Ypr147cp from Saccharomyces cerevisiae.酿酒酵母中脂滴相关酶Ypr147cp的分子建模、对接及动力学分析
Bioinformation. 2021 Jan 31;17(1):132-138. doi: 10.6026/97320630017132. eCollection 2021.
8
Site-Specific Steric Control of SARS-CoV-2 Spike Glycosylation.SARS-CoV-2 刺突糖基化的位点特异性空间位阻控制。
Biochemistry. 2021 Jul 13;60(27):2153-2169. doi: 10.1021/acs.biochem.1c00279. Epub 2021 Jul 2.
9
Structure-Guided Computational Approaches to Unravel Druggable Proteomic Landscape of .用于解析[具体对象]可成药蛋白质组图谱的结构导向计算方法
Front Mol Biosci. 2021 May 7;8:663301. doi: 10.3389/fmolb.2021.663301. eCollection 2021.
10
Can molecular dynamics simulations improve the structural accuracy and virtual screening performance of GPCR models?分子动力学模拟能否提高 GPCR 模型的结构准确性和虚拟筛选性能?
PLoS Comput Biol. 2021 May 13;17(5):e1008936. doi: 10.1371/journal.pcbi.1008936. eCollection 2021 May.

本文引用的文献

1
CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAP.系统发育树的置信区间:一种使用自展法的方法。
Evolution. 1985 Jul;39(4):783-791. doi: 10.1111/j.1558-5646.1985.tb00420.x.
2
The victor/FRST function for model quality estimation.用于模型质量评估的胜利者/FRST函数。
J Comput Biol. 2005 Dec;12(10):1316-27. doi: 10.1089/cmb.2005.12.1316.
3
Toward high-resolution de novo structure prediction for small proteins.迈向小蛋白质的高分辨率从头结构预测
Science. 2005 Sep 16;309(5742):1868-71. doi: 10.1126/science.1113801.
4
Practical lessons from protein structure prediction.蛋白质结构预测的实践经验。
Nucleic Acids Res. 2005 Apr 1;33(6):1874-91. doi: 10.1093/nar/gki327. Print 2005.
5
Improving functional annotation of non-synonomous SNPs with information theory.利用信息论改进非同义单核苷酸多态性的功能注释。
Pac Symp Biocomput. 2005:397-408. doi: 10.1142/9789812702456_0038.
6
Structural characterization of components of protein assemblies by comparative modeling and electron cryo-microscopy.通过比较建模和冷冻电子显微镜对蛋白质组装体成分进行结构表征。
J Struct Biol. 2005 Feb;149(2):191-203. doi: 10.1016/j.jsb.2004.11.004.
7
The CATH Domain Structure Database and related resources Gene3D and DHS provide comprehensive domain family information for genome analysis.CATH结构域数据库以及相关资源Gene3D和DHS为基因组分析提供了全面的结构域家族信息。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D247-51. doi: 10.1093/nar/gki024.
8
The Universal Protein Resource (UniProt).通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D154-9. doi: 10.1093/nar/gki070.
9
LiveBench-8: the large-scale, continuous assessment of automated protein structure prediction.LiveBench-8:自动化蛋白质结构预测的大规模连续评估
Protein Sci. 2005 Jan;14(1):240-5. doi: 10.1110/ps.04888805.
10
Accurate prediction of solvent accessibility using neural networks-based regression.使用基于神经网络的回归准确预测溶剂可及性。
Proteins. 2004 Sep 1;56(4):753-67. doi: 10.1002/prot.20176.