阐明“混沌地带”：困难蛋白建模的进展。

Illuminating the "Twilight Zone": Advances in Difficult Protein Modeling.

机构信息

Department of Synthesis and Chemical Technology of Pharmaceutical Substances with Computer Modelling Laboratory, Medical University of Lublin, Lublin, Poland.

University of Eastern Finland, School of Pharmacy, Kuopio, Finland.

出版信息

Methods Mol Biol. 2023;2627:25-40. doi: 10.1007/978-1-0716-2974-1_2.

DOI:10.1007/978-1-0716-2974-1_2

PMID:36959440

Abstract

Homology modeling was long considered a method of choice in tertiary protein structure prediction. However, it used to provide models of acceptable quality only when templates with appreciable sequence identity with a target could be found. The threshold value was long assumed to be around 20-30%. Below this level, obtained sequence identity was getting dangerously close to values that can be obtained by chance, after aligning any random, unrelated sequences. In these cases, other approaches, including ab initio folding simulations or fragment assembly, were usually employed. The most recent editions of the CASP and CAMEO community-wide modeling methods assessment have brought some surprising outcomes, proving that much more clues can be inferred from protein sequence analyses than previously thought. In this chapter, we focus on recent advances in the field of difficult protein modeling, pushing the threshold deep into the "twilight zone", with particular attention devoted to improvements in applications of machine learning and model evaluation.

摘要

同源建模长期以来被认为是预测蛋白质三级结构的首选方法。然而，过去只有在能够找到与目标具有显著序列同一性的模板时，才能提供具有可接受质量的模型。该阈值长期以来被认为在 20-30%左右。在这个水平以下，获得的序列同一性已经非常接近通过对齐任何随机的、不相关的序列偶然获得的同一性。在这些情况下，通常采用其他方法，包括从头折叠模拟或碎片组装。最近的 CASP 和 CAMEO 社区建模方法评估版本带来了一些令人惊讶的结果，证明从蛋白质序列分析中可以推断出比以前更多的线索。在本章中，我们专注于困难蛋白质建模领域的最新进展，将阈值推向“黄昏地带”，特别关注机器学习和模型评估应用的改进。

相似文献

Illuminating the "Twilight Zone": Advances in Difficult Protein Modeling.阐明“混沌地带”：困难蛋白建模的进展。

Methods Mol Biol. 2023;2627:25-40. doi: 10.1007/978-1-0716-2974-1_2.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.基于超深度学习模型的蛋白质接触图从头精确预测

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

Continuous Automated Model EvaluatiOn (CAMEO) complementing the critical assessment of structure prediction in CASP12.连续自动模型评估（CAMEO）对蛋白质结构预测关键评估（CASP12）的补充

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):387-398. doi: 10.1002/prot.25431. Epub 2017 Dec 17.

General overview on structure prediction of twilight-zone proteins.关于暗区蛋白结构预测的概述

Theor Biol Med Model. 2015 Sep 4;12:15. doi: 10.1186/s12976-015-0014-1.

Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.基于深度学习的蛋白质三级结构建模和 CASP13 中的接触距离预测。

Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.

Assessment of contact predictions in CASP12: Co-evolution and deep learning coming of age.蛋白质结构预测技术关键评估第12轮（CASP12）中的接触预测评估：协同进化与深度学习走向成熟。

Proteins. 2018 Mar;86 Suppl 1(Suppl Suppl 1):51-66. doi: 10.1002/prot.25407. Epub 2017 Nov 7.

Introducing "best single template" models as reference baseline for the Continuous Automated Model Evaluation (CAMEO).引入“最佳单模板”模型作为连续自动化模型评估（CAMEO）的参考基准。

Proteins. 2019 Dec;87(12):1378-1387. doi: 10.1002/prot.25815. Epub 2019 Oct 16.

Protein contact prediction by integrating deep multiple sequence alignments, coevolution and machine learning.通过整合深度多序列比对、协同进化和机器学习进行蛋白质接触预测。

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):84-96. doi: 10.1002/prot.25405. Epub 2017 Oct 31.

Template-based and free modeling of I-TASSER and QUARK pipelines using predicted contact maps in CASP12.在蛋白质结构预测技术评估第12轮（CASP12）中，基于模板以及I-TASSER和QUARK流程的自由建模，并使用预测的接触图。

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):136-151. doi: 10.1002/prot.25414. Epub 2017 Nov 14.

Improving fragment-based ab initio protein structure assembly using low-accuracy contact-map predictions.利用低精度接触图预测改进基于片段的从头蛋白质结构组装。

Nat Commun. 2021 Aug 18;12(1):5011. doi: 10.1038/s41467-021-25316-w.

引用本文的文献

Isolation, characterization, and cloning of thermostable pullulanase from ADM-11.从ADM-11中分离、鉴定和克隆耐热支链淀粉酶

Saudi J Biol Sci. 2024 Feb;31(2):103901. doi: 10.1016/j.sjbs.2023.103901. Epub 2023 Dec 10.

本文引用的文献

AlphaFold 2: Why It Works and Its Implications for Understanding the Relationships of Protein Sequence, Structure, and Function.AlphaFold 2：为何它能奏效，及其对理解蛋白质序列、结构和功能关系的启示。

J Chem Inf Model. 2021 Oct 25;61(10):4827-4831. doi: 10.1021/acs.jcim.1c01114. Epub 2021 Sep 29.

Highly accurate protein structure prediction with AlphaFold.利用 AlphaFold 进行高精度蛋白质结构预测。

Nature. 2021 Aug;596(7873):583-589. doi: 10.1038/s41586-021-03819-2. Epub 2021 Jul 15.

Deep learning methods in protein structure prediction.蛋白质结构预测中的深度学习方法。

Comput Struct Biotechnol J. 2020 Jan 22;18:1301-1310. doi: 10.1016/j.csbj.2019.12.011. eCollection 2020.

Evaluating the significance of contact maps in low-homology protein modeling using contact-assisted threading.使用接触辅助对接评估低同源性蛋白质建模中接触图的意义。

Sci Rep. 2020 Feb 19;10(1):2908. doi: 10.1038/s41598-020-59834-2.

Protein structure predictions by enhanced conformational sampling methods.通过增强构象采样方法进行蛋白质结构预测。

Biophys Physicobiol. 2019 Nov 29;16:344-366. doi: 10.2142/biophysico.16.0_344. eCollection 2019.

Improved protein structure prediction using potentials from deep learning.利用深度学习势进行蛋白质结构预测的改进。

Nature. 2020 Jan;577(7792):706-710. doi: 10.1038/s41586-019-1923-7. Epub 2020 Jan 15.

Improved protein structure prediction using predicted interresidue orientations.利用预测的残基间取向改进蛋白质结构预测。

Proc Natl Acad Sci U S A. 2020 Jan 21;117(3):1496-1503. doi: 10.1073/pnas.1914677117. Epub 2020 Jan 2.

Revisiting the "satisfaction of spatial restraints" approach of MODELLER for protein homology modeling.重新审视 MODELLER 中用于蛋白质同源建模的“满足空间约束”方法。

PLoS Comput Biol. 2019 Dec 17;15(12):e1007219. doi: 10.1371/journal.pcbi.1007219. eCollection 2019 Dec.

QMEANDisCo-distance constraints applied on model quality estimation.QMEANDisCo 距离约束应用于模型质量评估。

Bioinformatics. 2020 Mar 1;36(6):1765-1771. doi: 10.1093/bioinformatics/btz828.

High-accuracy protein structures by combining machine-learning with physics-based refinement.通过将机器学习与基于物理的精修相结合，实现高精度的蛋白质结构预测。

Proteins. 2020 May;88(5):637-642. doi: 10.1002/prot.25847. Epub 2019 Nov 15.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

阐明“混沌地带”：困难蛋白建模的进展。

Illuminating the "Twilight Zone": Advances in Difficult Protein Modeling.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献