Suppr超能文献

AlphaFold2 前后:蛋白质结构预测概述

Before and after AlphaFold2: An overview of protein structure prediction.

作者信息

Bertoline Letícia M F, Lima Angélica N, Krieger Jose E, Teixeira Samantha K

机构信息

Laboratory of Genetics and Molecular Cardiology, Heart Institute, University of São Paulo Medical School, São Paulo, Brazil.

出版信息

Front Bioinform. 2023 Feb 28;3:1120370. doi: 10.3389/fbinf.2023.1120370. eCollection 2023.

Abstract

Three-dimensional protein structure is directly correlated with its function and its determination is critical to understanding biological processes and addressing human health and life science problems in general. Although new protein structures are experimentally obtained over time, there is still a large difference between the number of protein sequences placed in Uniprot and those with resolved tertiary structure. In this context, studies have emerged to predict protein structures by methods based on a template or free modeling. In the last years, different methods have been combined to overcome their individual limitations, until the emergence of AlphaFold2, which demonstrated that predicting protein structure with high accuracy at unprecedented scale is possible. Despite its current impact in the field, AlphaFold2 has limitations. Recently, new methods based on protein language models have promised to revolutionize the protein structural biology allowing the discovery of protein structure and function only from evolutionary patterns present on protein sequence. Even though these methods do not reach AlphaFold2 accuracy, they already covered some of its limitations, being able to predict with high accuracy more than 200 million proteins from metagenomic databases. In this mini-review, we provide an overview of the breakthroughs in protein structure prediction before and after AlphaFold2 emergence.

摘要

蛋白质的三维结构与其功能直接相关,确定其结构对于理解生物学过程以及解决人类健康和生命科学领域的诸多问题至关重要。尽管随着时间推移会通过实验获得新的蛋白质结构,但在UniProt中登记的蛋白质序列数量与已解析三级结构的蛋白质数量之间仍存在巨大差异。在此背景下,出现了一些基于模板或自由建模方法来预测蛋白质结构的研究。在过去几年中,人们将不同方法结合起来以克服各自的局限性,直到AlphaFold2出现,它证明了以前所未有的规模高精度预测蛋白质结构是可行的。尽管AlphaFold2目前在该领域产生了重大影响,但它也存在局限性。最近,基于蛋白质语言模型的新方法有望彻底改变蛋白质结构生物学,使得仅从蛋白质序列中存在的进化模式就能发现蛋白质的结构和功能。尽管这些方法尚未达到AlphaFold2的精度,但它们已经弥补了其一些局限性,能够从宏基因组数据库中高精度预测超过2亿种蛋白质。在这篇小型综述中,我们概述了AlphaFold2出现前后蛋白质结构预测领域的突破。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e263/10011655/5be9bb306287/fbinf-03-1120370-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验