解锁人工智能模型的力量：通过比较分析探索蛋白质折叠预测。

Unlocking the power of AI models: exploring protein folding prediction through comparative analysis.

机构信息

ETS Ingenieros Informáticos, 16771 Universidad Politécnica de Madrid , Madrid, Spain.

Centro de Tecnología Biomédica, 16771 Universidad Politécnica de Madrid , Pozuelo de Alarcón, Madrid, Spain.

出版信息

J Integr Bioinform. 2024 May 27;21(2). doi: 10.1515/jib-2023-0041. eCollection 2024 Jun 1.

DOI:10.1515/jib-2023-0041

PMID:38797876

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11377126/

Abstract

Protein structure determination has made progress with the aid of deep learning models, enabling the prediction of protein folding from protein sequences. However, obtaining accurate predictions becomes essential in certain cases where the protein structure remains undescribed. This is particularly challenging when dealing with rare, diverse structures and complex sample preparation. Different metrics assess prediction reliability and offer insights into result strength, providing a comprehensive understanding of protein structure by combining different models. In a previous study, two proteins named ARM58 and ARM56 were investigated. These proteins contain four domains of unknown function and are present in spp. ARM refers to an antimony resistance marker. The study's main objective is to assess the accuracy of the model's predictions, thereby providing insights into the complexities and supporting metrics underlying these findings. The analysis also extends to the comparison of predictions obtained from other species and organisms. Notably, one of these proteins shares an ortholog with and , leading further significance to our analysis. This attempt underscored the importance of evaluating the diverse outputs from deep learning models, facilitating comparisons across different organisms and proteins. This becomes particularly pertinent in cases where no previous structural information is available.

摘要

在深度学习模型的辅助下，蛋白质结构的测定取得了进展，使得从蛋白质序列预测蛋白质折叠成为可能。然而，在某些情况下，当蛋白质结构仍然未知时，获得准确的预测变得至关重要。在处理罕见、多样的结构和复杂的样品制备时，这尤其具有挑战性。不同的指标评估预测的可靠性，并深入了解结果的强度，通过结合不同的模型，提供对蛋白质结构的全面理解。在之前的一项研究中，研究了两种名为 ARM58 和 ARM56 的蛋白质。这些蛋白质包含四个未知功能的结构域，存在于 spp.中。ARM 是指抗锑标记物。该研究的主要目的是评估模型预测的准确性，从而深入了解这些发现背后的复杂性和支持指标。分析还扩展到比较从其他物种和生物体获得的预测。值得注意的是，这些蛋白质中的一种与和具有直系同源物，这进一步增加了我们分析的意义。这一尝试强调了评估深度学习模型多样化输出的重要性，促进了不同生物体和蛋白质之间的比较。在没有先前结构信息的情况下，这一点尤为重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/ddcf36b60006/j_jib-2023-0041_fig_001.jpg

相似文献

Unlocking the power of AI models: exploring protein folding prediction through comparative analysis.

J Integr Bioinform. 2024 May 27;21(2). doi: 10.1515/jib-2023-0041. eCollection 2024 Jun 1.

A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.

PLoS One. 2021 Nov 11;16(11):e0259871. doi: 10.1371/journal.pone.0259871. eCollection 2021.

Role of the Trypanosoma brucei HEN1 family methyltransferase in small interfering RNA modification.

Eukaryot Cell. 2014 Jan;13(1):77-86. doi: 10.1128/EC.00233-13. Epub 2013 Nov 1.

In silico prediction of the glycosomal enzymes of Leishmania major and trypanosomes.

Mol Biochem Parasitol. 2006 Jun;147(2):193-206. doi: 10.1016/j.molbiopara.2006.02.010. Epub 2006 Mar 9.

Molecular comparison of the mitochondrial and cytoplasmic hsp70 of Trypanosoma cruzi, Trypanosoma brucei and Leishmania major.

J Eukaryot Microbiol. 1995 Sep-Oct;42(5):473-6. doi: 10.1111/j.1550-7408.1995.tb05893.x.

Gene identification and comparative molecular modeling of a Trypanosoma rangeli major surface protease.

J Mol Model. 2013 Aug;19(8):3053-64. doi: 10.1007/s00894-013-1834-8. Epub 2013 Apr 13.

Comparative analysis of the kinomes of three pathogenic trypanosomatids: Leishmania major, Trypanosoma brucei and Trypanosoma cruzi.

BMC Genomics. 2005 Sep 15;6:127. doi: 10.1186/1471-2164-6-127.

The Architecture of Trypanosoma brucei editosomes.

Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):E6476-E6485. doi: 10.1073/pnas.1610177113. Epub 2016 Oct 5.

Unusual features and localization of the membrane kinome of Trypanosoma brucei.

PLoS One. 2021 Oct 15;16(10):e0258814. doi: 10.1371/journal.pone.0258814. eCollection 2021.

Oligopeptidase B, a missing enzyme in mammals and a potential drug target for trypanosomatid diseases.

Biochimie. 2019 Dec;167:207-216. doi: 10.1016/j.biochi.2019.10.006. Epub 2019 Oct 16.

本文引用的文献

Single-sequence protein structure prediction using supervised transformer protein language models.

Nat Comput Sci. 2022 Dec;2(12):804-814. doi: 10.1038/s43588-022-00373-3. Epub 2022 Dec 19.

Progress at protein structure prediction, as seen in CASP15.

Curr Opin Struct Biol. 2023 Jun;80:102594. doi: 10.1016/j.sbi.2023.102594. Epub 2023 Apr 14.

Evolutionary-scale prediction of atomic-level protein structure with a language model.

Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.

Before and after AlphaFold2: An overview of protein structure prediction.

Front Bioinform. 2023 Feb 28;3:1120370. doi: 10.3389/fbinf.2023.1120370. eCollection 2023.

UniProt: the Universal Protein Knowledgebase in 2023.

Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.

Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants.

Protein Sci. 2022 Aug;31(8):e4379. doi: 10.1002/pro.4379.

AlphaFold2 models indicate that protein sequence determines both structure and dynamics.

Sci Rep. 2022 Jun 23;12(1):10696. doi: 10.1038/s41598-022-14382-9.

ColabFold: making protein folding accessible to all.

Nat Methods. 2022 Jun;19(6):679-682. doi: 10.1038/s41592-022-01488-1. Epub 2022 May 30.

AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.

Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.

The trRosetta server for fast and accurate protein structure prediction.

Nat Protoc. 2021 Dec;16(12):5634-5651. doi: 10.1038/s41596-021-00628-9. Epub 2021 Nov 10.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

解锁人工智能模型的力量：通过比较分析探索蛋白质折叠预测。

Unlocking the power of AI models: exploring protein folding prediction through comparative analysis.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献