一种用于蛋白质比较建模的多模板组合算法。

A multi-template combination algorithm for protein comparative modeling.

作者信息

Cheng Jianlin

机构信息

Department of Computer Science, Informatics Institute, University of Missouri, Columbia, MO 65211-2060, USA.

出版信息

BMC Struct Biol. 2008 Mar 17;8:18. doi: 10.1186/1472-6807-8-18.

DOI:10.1186/1472-6807-8-18

PMID:18366648

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2311309/

Abstract

BACKGROUND

Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available.

RESULTS

Here we develop an effective multi-template combination algorithm for protein comparative modeling. The algorithm selects templates according to the similarity significance of the alignments between template and target proteins. It combines the whole template-target alignments whose similarity significance score is close to that of the top template-target alignment within a threshold, whereas it only takes alignment fragments from a less similar template-target alignment that align with a sizable uncovered region of the target. We compare the algorithm with the traditional method of using a single top template on the 45 comparative modeling targets (i.e. easy template-based modeling targets) used in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7). The multi-template combination algorithm improves the GDT-TS scores of predicted models by 6.8% on average. The statistical analysis shows that the improvement is significant (p-value < 10-4). Compared with the ideal approach that always uses the best template, the multi-template approach yields only slightly better performance. During the CASP7 experiment, the preliminary implementation of the multi-template combination algorithm (FOLDpro) was ranked second among 67 servers in the category of high-accuracy structure prediction in terms of GDT-TS measure.

CONCLUSION

We have developed a novel multi-template algorithm to improve protein comparative modeling.

摘要

背景

在蛋白质结构预测中，多模板法是一种常用的人工预测方法。然而，目前能自动选择并组合多个模板的算法却很少。

结果

我们开发了一种有效的蛋白质比较建模多模板组合算法。该算法根据模板与目标蛋白比对的相似性显著性来选择模板。它会组合那些相似性显著性得分在阈值范围内且接近最优模板-目标比对得分的完整模板-目标比对，而对于相似度较低的模板-目标比对，仅采用与目标蛋白较大未覆盖区域比对的比对片段。我们将该算法与在蛋白质结构预测技术关键评估（CASP7）第七版中使用的45个比较建模目标（即基于模板的简单建模目标）上使用单一最优模板的传统方法进行了比较。多模板组合算法使预测模型的GDT-TS得分平均提高了6.8%。统计分析表明这种提高具有显著性（p值<10-4）。与始终使用最佳模板的理想方法相比，多模板方法的性能仅略优。在CASP7实验中，多模板组合算法的初步实现（FOLDpro）在基于GDT-TS度量的高精度结构预测类别中，在67个服务器中排名第二。

结论

我们开发了一种新型多模板算法来改进蛋白质比较建模。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2173/2311309/7cb44fe1eb22/1472-6807-8-18-1.jpg

相似文献

A multi-template combination algorithm for protein comparative modeling.一种用于蛋白质比较建模的多模板组合算法。

BMC Struct Biol. 2008 Mar 17;8:18. doi: 10.1186/1472-6807-8-18.

Template-based protein structure modeling using TASSER(VMT.).使用TASSER（VMT.）进行基于模板的蛋白质结构建模。

Proteins. 2012 Feb;80(2):352-61. doi: 10.1002/prot.23183. Epub 2011 Nov 22.

Refined template selection and combination algorithm significantly improves template-based modeling accuracy.优化的模板选择与组合算法显著提高了基于模板的建模精度。

J Bioinform Comput Biol. 2019 Apr;17(2):1950006. doi: 10.1142/S0219720019500069.

Assessment of template-based modeling of protein structure in CASP11.CASP11中基于模板的蛋白质结构建模评估。

Proteins. 2016 Sep;84 Suppl 1(Suppl 1):200-20. doi: 10.1002/prot.25049. Epub 2016 Jun 15.

Comparative protein structure modeling by combining multiple templates and optimizing sequence-to-structure alignments.通过结合多个模板并优化序列与结构比对进行比较蛋白质结构建模。

Bioinformatics. 2007 Oct 1;23(19):2558-65. doi: 10.1093/bioinformatics/btm377. Epub 2007 Sep 6.

A "FRankenstein's monster" approach to comparative modeling: merging the finest fragments of Fold-Recognition models and iterative model refinement aided by 3D structure evaluation.一种用于比较建模的“科学怪人”方法：融合折叠识别模型的最佳片段，并借助三维结构评估进行迭代模型优化。

Proteins. 2003;53 Suppl 6:369-79. doi: 10.1002/prot.10545.

A Stochastic Point Cloud Sampling Method for Multi-Template Protein Comparative Modeling.一种用于多模板蛋白质比较建模的随机点云采样方法。

Sci Rep. 2016 May 10;6:25687. doi: 10.1038/srep25687.

GalaxyTBM: template-based modeling by building a reliable core and refining unreliable local regions.GalaxyTBM：通过构建可靠的核心和精炼不可靠的局部区域来进行基于模板的建模。

BMC Bioinformatics. 2012 Aug 10;13:198. doi: 10.1186/1471-2105-13-198.

Assessment of CASP7 predictions in the high accuracy template-based modeling category.基于高精度模板建模类别的CASP7预测评估。

Proteins. 2007;69 Suppl 8:27-37. doi: 10.1002/prot.21662.

Improving protein tertiary structure prediction by deep learning and distance prediction in CASP14.通过深度学习和距离预测改进 CASP14 中的蛋白质三级结构预测。

Proteins. 2022 Jan;90(1):58-72. doi: 10.1002/prot.26186. Epub 2021 Jul 27.

引用本文的文献

Recent Progress of Protein Tertiary Structure Prediction.蛋白质三级结构预测的最新进展。

Molecules. 2024 Feb 13;29(4):832. doi: 10.3390/molecules29040832.

Contact-Assisted Threading in Low-Homology Protein Modeling.接触辅助线程在低同源性蛋白质建模中的应用。

Methods Mol Biol. 2023;2627:41-59. doi: 10.1007/978-1-0716-2974-1_3.

CRFalign: A Sequence-Structure Alignment of Proteins Based on a Combination of HMM-HMM Comparison and Conditional Random Fields.CRFalign：一种基于 HMM-HMM 比较和条件随机场组合的蛋白质序列-结构比对方法。

Molecules. 2022 Jun 9;27(12):3711. doi: 10.3390/molecules27123711.

MULTICOM2 open-source protein structure prediction system powered by deep learning and distance prediction.基于深度学习和距离预测的 MULTICOM2 开源蛋白质结构预测系统。

Sci Rep. 2021 Jun 23;11(1):13155. doi: 10.1038/s41598-021-92395-6.

Chemical system biology approach to identify multi-targeting FDA inhibitors for treating COVID-19 and associated health complications.化学系统生物学方法鉴定治疗 COVID-19 和相关健康并发症的多靶标 FDA 抑制剂。

J Biomol Struct Dyn. 2022;40(19):9543-9567. doi: 10.1080/07391102.2021.1931451. Epub 2021 Jun 1.

Recent Advances in Protein Homology Detection Propelled by Inter-Residue Interaction Map Threading.基于残基间相互作用图谱穿线法推动的蛋白质同源性检测的最新进展

Front Mol Biosci. 2021 May 11;8:643752. doi: 10.3389/fmolb.2021.643752. eCollection 2021.

Probabilistic divergence of a template-based modelling methodology from the ideal protocol.基于模板的建模方法与理想方案的概率偏差。

J Mol Model. 2021 Jan 7;27(2):25. doi: 10.1007/s00894-020-04640-w.

High throughput virtual screening reveals SARS-CoV-2 multi-target binding natural compounds to lead instant therapy for COVID-19 treatment.高通量虚拟筛选揭示 SARS-CoV-2 多靶结合天然化合物，为 COVID-19 治疗提供即时治疗的先导药物。

Int J Biol Macromol. 2020 Oct 1;160:1-17. doi: 10.1016/j.ijbiomac.2020.05.184. Epub 2020 May 26.

TCRpMHCmodels: Structural modelling of TCR-pMHC class I complexes.TCR-pMHC 模型：TCR-pMHC I 类复合物的结构建模。

Sci Rep. 2019 Oct 10;9(1):14530. doi: 10.1038/s41598-019-50932-4.

Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。

Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.

本文引用的文献

Structure prediction for CASP7 targets using extensive all-atom refinement with Rosetta@home.使用Rosetta@home进行广泛的全原子精修对第7届蛋白质结构预测关键评估（CASP7）目标进行结构预测。

Proteins. 2007;69 Suppl 8:118-28. doi: 10.1002/prot.21636.

Template-based modeling and free modeling by I-TASSER in CASP7.在蛋白质结构预测技术评估第7轮（CASP7）中，I-TASSER基于模板的建模和自由建模。

Proteins. 2007;69 Suppl 8:108-17. doi: 10.1002/prot.21702.

Automated server predictions in CASP7.CASP7中的自动化服务器预测。

Proteins. 2007;69 Suppl 8:68-82. doi: 10.1002/prot.21761.

Assessment of CASP7 predictions in the high accuracy template-based modeling category.基于高精度模板建模类别的CASP7预测评估。

Proteins. 2007;69 Suppl 8:27-37. doi: 10.1002/prot.21662.

High accuracy template based modeling by global optimization.通过全局优化实现基于高精度模板的建模。

Proteins. 2007;69 Suppl 8:83-9. doi: 10.1002/prot.21628.

Analysis of TASSER-based CASP7 protein structure prediction results.基于TASSER的CASP7蛋白质结构预测结果分析。

Proteins. 2007;69 Suppl 8:90-7. doi: 10.1002/prot.21649.

Ab initio modeling of small proteins by iterative TASSER simulations.通过迭代TASSER模拟对小蛋白质进行从头建模。

BMC Biol. 2007 May 8;5:17. doi: 10.1186/1741-7007-5-17.

LOMETS: a local meta-threading-server for protein structure prediction.LOMETS：一种用于蛋白质结构预测的局部元线程服务器。

Nucleic Acids Res. 2007;35(10):3375-82. doi: 10.1093/nar/gkm251. Epub 2007 May 3.

TASSER-Lite: an automated tool for protein comparative modeling.TASSER-Lite：一种用于蛋白质比较建模的自动化工具。

Biophys J. 2006 Dec 1;91(11):4180-90. doi: 10.1529/biophysj.106.084293. Epub 2006 Sep 8.

STRUCTFAST: protein sequence remote homology detection and alignment using novel dynamic programming and profile-profile scoring.STRUCTFAST：利用新型动态规划和轮廓-轮廓评分进行蛋白质序列远程同源性检测与比对。

Proteins. 2006 Sep 1;64(4):960-7. doi: 10.1002/prot.21049.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种用于蛋白质比较建模的多模板组合算法。

A multi-template combination algorithm for protein comparative modeling.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献