蛋白质结构预测：挑战、进展与研究范式的转变

Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms.

作者信息

Huang Bin, Kong Lupeng, Wang Chao, Ju Fusong, Zhang Qi, Zhu Jianwei, Gong Tiansu, Zhang Haicang, Yu Chungong, Zheng Wei-Mou, Bu Dongbo

机构信息

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; University of Chinese Academy of Sciences, Beijing 100049, China.

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; Changping Laboratory, Beijing 102206, China.

出版信息

Genomics Proteomics Bioinformatics. 2023 Oct;21(5):913-925. doi: 10.1016/j.gpb.2022.11.014. Epub 2023 Mar 30.

DOI:10.1016/j.gpb.2022.11.014

PMID:37001856

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10928435/

Abstract

Protein structure prediction is an interdisciplinary research topic that has attracted researchers from multiple fields, including biochemistry, medicine, physics, mathematics, and computer science. These researchers adopt various research paradigms to attack the same structure prediction problem: biochemists and physicists attempt to reveal the principles governing protein folding; mathematicians, especially statisticians, usually start from assuming a probability distribution of protein structures given a target sequence and then find the most likely structure, while computer scientists formulate protein structure prediction as an optimization problem - finding the structural conformation with the lowest energy or minimizing the difference between predicted structure and native structure. These research paradigms fall into the two statistical modeling cultures proposed by Leo Breiman, namely, data modeling and algorithmic modeling. Recently, we have also witnessed the great success of deep learning in protein structure prediction. In this review, we present a survey of the efforts for protein structure prediction. We compare the research paradigms adopted by researchers from different fields, with an emphasis on the shift of research paradigms in the era of deep learning. In short, the algorithmic modeling techniques, especially deep neural networks, have considerably improved the accuracy of protein structure prediction; however, theories interpreting the neural networks and knowledge on protein folding are still highly desired.

摘要

蛋白质结构预测是一个跨学科的研究课题，吸引了来自多个领域的研究人员，包括生物化学、医学、物理学、数学和计算机科学。这些研究人员采用各种研究范式来攻克同一个结构预测问题：生物化学家和物理学家试图揭示蛋白质折叠的原理；数学家，尤其是统计学家，通常从给定目标序列假设蛋白质结构的概率分布开始，然后找到最可能的结构，而计算机科学家将蛋白质结构预测表述为一个优化问题——找到能量最低的结构构象或最小化预测结构与天然结构之间的差异。这些研究范式属于利奥·布雷曼提出的两种统计建模文化，即数据建模和算法建模。最近，我们也见证了深度学习在蛋白质结构预测方面取得的巨大成功。在这篇综述中，我们对蛋白质结构预测的相关工作进行了概述。我们比较了不同领域研究人员采用的研究范式，重点关注深度学习时代研究范式的转变。简而言之，算法建模技术，尤其是深度神经网络，已经显著提高了蛋白质结构预测的准确性；然而，解释神经网络的理论以及蛋白质折叠方面的知识仍然非常需要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/837c/10928435/92afb3bdbcc2/gr1.jpg

相似文献

Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms.

Genomics Proteomics Bioinformatics. 2023 Oct;21(5):913-925. doi: 10.1016/j.gpb.2022.11.014. Epub 2023 Mar 30.

Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).

Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.

Protein tertiary structure modeling driven by deep learning and contact distance prediction in CASP13.

Proteins. 2019 Dec;87(12):1165-1178. doi: 10.1002/prot.25697. Epub 2019 Apr 25.

A novel model-based on FCM-LM algorithm for prediction of protein folding rate.

J Bioinform Comput Biol. 2017 Aug;15(4):1750012. doi: 10.1142/S0219720017500123. Epub 2017 Apr 25.

Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.

PLoS Comput Biol. 2017 Jan 5;13(1):e1005324. doi: 10.1371/journal.pcbi.1005324. eCollection 2017 Jan.

Fast and accurate Ab Initio Protein structure prediction using deep learning potentials.

PLoS Comput Biol. 2022 Sep 16;18(9):e1010539. doi: 10.1371/journal.pcbi.1010539. eCollection 2022 Sep.

Machine learning in protein structure prediction.

Curr Opin Chem Biol. 2021 Dec;65:1-8. doi: 10.1016/j.cbpa.2021.04.005. Epub 2021 May 18.

Protein structure prediction using multiple deep neural networks in the 13th Critical Assessment of Protein Structure Prediction (CASP13).

Proteins. 2019 Dec;87(12):1141-1148. doi: 10.1002/prot.25834.

Analysis of deep learning methods for blind protein contact prediction in CASP12.

Proteins. 2018 Mar;86 Suppl 1(Suppl 1):67-77. doi: 10.1002/prot.25377. Epub 2017 Sep 6.

Enhancing Evolutionary Couplings with Deep Convolutional Neural Networks.

Cell Syst. 2018 Jan 24;6(1):65-74.e3. doi: 10.1016/j.cels.2017.11.014. Epub 2017 Dec 20.

引用本文的文献

Deep-learning structure elucidation from single-mutant deep mutational scanning.

Nat Commun. 2025 Jul 25;16(1):6874. doi: 10.1038/s41467-025-62261-4.

Using multiple computer-predicted structures as molecular replacement models: application to the antiviral mini-protein LCB2.

IUCrJ. 2025 Jul 1;12(Pt 4):488-501. doi: 10.1107/S2052252525005123.

Characterization of soil-derived Bacillus subtilis metabolites against breast cancer: In vitro and in silico studies.

Saudi Pharm J. 2025 Apr 17;33(1-2):3. doi: 10.1007/s44446-025-00006-6.

An overview on olfaction in the biological, analytical, computational, and machine learning fields.

Arch Pharm (Weinheim). 2025 Jan;358(1):e2400414. doi: 10.1002/ardp.202400414. Epub 2024 Oct 22.

approaches supporting drug repurposing for Leishmaniasis: a scoping review.

EXCLI J. 2024 Sep 3;23:1117-1169. doi: 10.17179/excli2024-7552. eCollection 2024.

A comprehensive review of artificial intelligence for pharmacology research.

Front Genet. 2024 Sep 3;15:1450529. doi: 10.3389/fgene.2024.1450529. eCollection 2024.

In silico approaches to study the human asparagine synthetase: An insight of the interaction between the enzyme active sites and its substrates.

PLoS One. 2024 Aug 2;19(8):e0307448. doi: 10.1371/journal.pone.0307448. eCollection 2024.

Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning.

Res Sq. 2024 Jun 28:rs.3.rs-4301803. doi: 10.21203/rs.3.rs-4301803/v1.

Unraveling the metabolic potential of biocontrol fungi through omics data: a key to enhancing large-scaleapplication strategies.

Acta Biochim Biophys Sin (Shanghai). 2024 Jun 25;56(6):825-832. doi: 10.3724/abbs.2024056.

Exploring DNA Damage and Repair Mechanisms: A Review with Computational Insights.

BioTech (Basel). 2024 Jan 16;13(1):3. doi: 10.3390/biotech13010003.

本文引用的文献

Rotamer-free protein sequence design based on deep learning and self-consistency.

Nat Comput Sci. 2022 Jul;2(7):451-462. doi: 10.1038/s43588-022-00273-6. Epub 2022 Jul 21.

Evolutionary-scale prediction of atomic-level protein structure with a language model.

Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.

Accurate and efficient protein sequence design through learning concise local environment of residues.

Bioinformatics. 2023 Mar 1;39(3). doi: 10.1093/bioinformatics/btad122.

Improved AlphaFold modeling with implicit experimental information.

Nat Methods. 2022 Nov;19(11):1376-1382. doi: 10.1038/s41592-022-01645-6. Epub 2022 Oct 20.

Single-sequence protein structure prediction using a language model and deep learning.

Nat Biotechnol. 2022 Nov;40(11):1617-1623. doi: 10.1038/s41587-022-01432-w. Epub 2022 Oct 3.

Robust deep learning-based protein sequence design using ProteinMPNN.

Science. 2022 Oct 7;378(6615):49-56. doi: 10.1126/science.add2187. Epub 2022 Sep 15.

Structure of cytoplasmic ring of nuclear pore complex by integrative cryo-EM and AlphaFold.

Science. 2022 Jun 10;376(6598):eabm9326. doi: 10.1126/science.abm9326.

Neural relational inference to learn long-range allosteric interactions in proteins from molecular dynamics simulations.

Nat Commun. 2022 Mar 29;13(1):1661. doi: 10.1038/s41467-022-29331-3.

Improved prediction of protein-protein interactions using AlphaFold2.

Nat Commun. 2022 Mar 10;13(1):1265. doi: 10.1038/s41467-022-28865-w.

Ultrafast end-to-end protein structure prediction enables high-throughput exploration of uncharacterized proteins.

Proc Natl Acad Sci U S A. 2022 Jan 25;119(4). doi: 10.1073/pnas.2113348119.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

蛋白质结构预测：挑战、进展与研究范式的转变

Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms.

作者信息

Huang Bin, Kong Lupeng, Wang Chao, Ju Fusong, Zhang Qi, Zhu Jianwei, Gong Tiansu, Zhang Haicang, Yu Chungong, Zheng Wei-Mou, Bu Dongbo

机构信息

Key Laboratory of Intelligent Information Processing, Institute of Computing Technology, Chinese Academy of Sciences, Beijing 100190, China; Changping Laboratory, Beijing 102206, China.

出版信息

Genomics Proteomics Bioinformatics. 2023 Oct;21(5):913-925. doi: 10.1016/j.gpb.2022.11.014. Epub 2023 Mar 30.

DOI:10.1016/j.gpb.2022.11.014

PMID:37001856

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10928435/

Abstract

摘要

蛋白质结构预测：挑战、进展与研究范式的转变

Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

蛋白质结构预测：挑战、进展与研究范式的转变

Protein Structure Prediction: Challenges, Advances, and the Shift of Research Paradigms.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献