• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

解锁人工智能模型的力量:通过比较分析探索蛋白质折叠预测。

Unlocking the power of AI models: exploring protein folding prediction through comparative analysis.

机构信息

ETS Ingenieros Informáticos, 16771 Universidad Politécnica de Madrid , Madrid, Spain.

Centro de Tecnología Biomédica, 16771 Universidad Politécnica de Madrid , Pozuelo de Alarcón, Madrid, Spain.

出版信息

J Integr Bioinform. 2024 May 27;21(2). doi: 10.1515/jib-2023-0041. eCollection 2024 Jun 1.

DOI:10.1515/jib-2023-0041
PMID:38797876
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11377126/
Abstract

Protein structure determination has made progress with the aid of deep learning models, enabling the prediction of protein folding from protein sequences. However, obtaining accurate predictions becomes essential in certain cases where the protein structure remains undescribed. This is particularly challenging when dealing with rare, diverse structures and complex sample preparation. Different metrics assess prediction reliability and offer insights into result strength, providing a comprehensive understanding of protein structure by combining different models. In a previous study, two proteins named ARM58 and ARM56 were investigated. These proteins contain four domains of unknown function and are present in spp. ARM refers to an antimony resistance marker. The study's main objective is to assess the accuracy of the model's predictions, thereby providing insights into the complexities and supporting metrics underlying these findings. The analysis also extends to the comparison of predictions obtained from other species and organisms. Notably, one of these proteins shares an ortholog with and , leading further significance to our analysis. This attempt underscored the importance of evaluating the diverse outputs from deep learning models, facilitating comparisons across different organisms and proteins. This becomes particularly pertinent in cases where no previous structural information is available.

摘要

在深度学习模型的辅助下,蛋白质结构的测定取得了进展,使得从蛋白质序列预测蛋白质折叠成为可能。然而,在某些情况下,当蛋白质结构仍然未知时,获得准确的预测变得至关重要。在处理罕见、多样的结构和复杂的样品制备时,这尤其具有挑战性。不同的指标评估预测的可靠性,并深入了解结果的强度,通过结合不同的模型,提供对蛋白质结构的全面理解。在之前的一项研究中,研究了两种名为 ARM58 和 ARM56 的蛋白质。这些蛋白质包含四个未知功能的结构域,存在于 spp.中。ARM 是指抗锑标记物。该研究的主要目的是评估模型预测的准确性,从而深入了解这些发现背后的复杂性和支持指标。分析还扩展到比较从其他物种和生物体获得的预测。值得注意的是,这些蛋白质中的一种与 和 具有直系同源物,这进一步增加了我们分析的意义。这一尝试强调了评估深度学习模型多样化输出的重要性,促进了不同生物体和蛋白质之间的比较。在没有先前结构信息的情况下,这一点尤为重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/b2d81b48ed00/j_jib-2023-0041_fig_008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/ddcf36b60006/j_jib-2023-0041_fig_001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/f20005a3a7dd/j_jib-2023-0041_fig_002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/f8049027667f/j_jib-2023-0041_fig_003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/6d1ee1831ecb/j_jib-2023-0041_fig_004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/f76771caf794/j_jib-2023-0041_fig_005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/839a6d04711c/j_jib-2023-0041_fig_006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/c83e04b105c0/j_jib-2023-0041_fig_007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/b2d81b48ed00/j_jib-2023-0041_fig_008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/ddcf36b60006/j_jib-2023-0041_fig_001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/f20005a3a7dd/j_jib-2023-0041_fig_002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/f8049027667f/j_jib-2023-0041_fig_003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/6d1ee1831ecb/j_jib-2023-0041_fig_004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/f76771caf794/j_jib-2023-0041_fig_005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/839a6d04711c/j_jib-2023-0041_fig_006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/c83e04b105c0/j_jib-2023-0041_fig_007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a80a/11377126/b2d81b48ed00/j_jib-2023-0041_fig_008.jpg

相似文献

1
Unlocking the power of AI models: exploring protein folding prediction through comparative analysis.解锁人工智能模型的力量:通过比较分析探索蛋白质折叠预测。
J Integr Bioinform. 2024 May 27;21(2). doi: 10.1515/jib-2023-0041. eCollection 2024 Jun 1.
2
A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.用于改进锥虫和利什曼原虫蛋白三维结构预测的资源。
PLoS One. 2021 Nov 11;16(11):e0259871. doi: 10.1371/journal.pone.0259871. eCollection 2021.
3
Role of the Trypanosoma brucei HEN1 family methyltransferase in small interfering RNA modification.布氏锥虫HEN1家族甲基转移酶在小干扰RNA修饰中的作用。
Eukaryot Cell. 2014 Jan;13(1):77-86. doi: 10.1128/EC.00233-13. Epub 2013 Nov 1.
4
In silico prediction of the glycosomal enzymes of Leishmania major and trypanosomes.利什曼原虫和锥虫糖体酶的计算机模拟预测。
Mol Biochem Parasitol. 2006 Jun;147(2):193-206. doi: 10.1016/j.molbiopara.2006.02.010. Epub 2006 Mar 9.
5
Molecular comparison of the mitochondrial and cytoplasmic hsp70 of Trypanosoma cruzi, Trypanosoma brucei and Leishmania major.克氏锥虫、布氏锥虫和硕大利什曼原虫线粒体和细胞质热休克蛋白70的分子比较。
J Eukaryot Microbiol. 1995 Sep-Oct;42(5):473-6. doi: 10.1111/j.1550-7408.1995.tb05893.x.
6
Gene identification and comparative molecular modeling of a Trypanosoma rangeli major surface protease.冈比亚锥虫主要表面蛋白酶的基因鉴定与比较分子建模。
J Mol Model. 2013 Aug;19(8):3053-64. doi: 10.1007/s00894-013-1834-8. Epub 2013 Apr 13.
7
Comparative analysis of the kinomes of three pathogenic trypanosomatids: Leishmania major, Trypanosoma brucei and Trypanosoma cruzi.三种致病性锥虫(硕大利什曼原虫、布氏锥虫和克氏锥虫)激酶组的比较分析。
BMC Genomics. 2005 Sep 15;6:127. doi: 10.1186/1471-2164-6-127.
8
The Architecture of Trypanosoma brucei editosomes.布氏锥虫编辑体的结构
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):E6476-E6485. doi: 10.1073/pnas.1610177113. Epub 2016 Oct 5.
9
Unusual features and localization of the membrane kinome of Trypanosoma brucei.布氏锥虫膜激酶组的非典型特征和定位。
PLoS One. 2021 Oct 15;16(10):e0258814. doi: 10.1371/journal.pone.0258814. eCollection 2021.
10
Oligopeptidase B, a missing enzyme in mammals and a potential drug target for trypanosomatid diseases.寡肽酶 B,哺乳动物中缺失的一种酶,也是针对利什曼原虫病和锥虫病的潜在药物靶点。
Biochimie. 2019 Dec;167:207-216. doi: 10.1016/j.biochi.2019.10.006. Epub 2019 Oct 16.

本文引用的文献

1
Single-sequence protein structure prediction using supervised transformer protein language models.使用监督式转换器蛋白质语言模型进行单序列蛋白质结构预测。
Nat Comput Sci. 2022 Dec;2(12):804-814. doi: 10.1038/s43588-022-00373-3. Epub 2022 Dec 19.
2
Progress at protein structure prediction, as seen in CASP15.在 CASP15 中看到的蛋白质结构预测的进展。
Curr Opin Struct Biol. 2023 Jun;80:102594. doi: 10.1016/j.sbi.2023.102594. Epub 2023 Apr 14.
3
Evolutionary-scale prediction of atomic-level protein structure with a language model.
用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
4
Before and after AlphaFold2: An overview of protein structure prediction.AlphaFold2 前后:蛋白质结构预测概述
Front Bioinform. 2023 Feb 28;3:1120370. doi: 10.3389/fbinf.2023.1120370. eCollection 2023.
5
UniProt: the Universal Protein Knowledgebase in 2023.UniProt:2023 年的通用蛋白质知识库。
Nucleic Acids Res. 2023 Jan 6;51(D1):D523-D531. doi: 10.1093/nar/gkac1052.
6
Benchmarking AlphaFold for protein complex modeling reveals accuracy determinants.基于 AlphaFold 对蛋白质复合物建模的基准测试揭示了准确性的决定因素。
Protein Sci. 2022 Aug;31(8):e4379. doi: 10.1002/pro.4379.
7
AlphaFold2 models indicate that protein sequence determines both structure and dynamics.AlphaFold2 模型表明,蛋白质序列决定了结构和动力学。
Sci Rep. 2022 Jun 23;12(1):10696. doi: 10.1038/s41598-022-14382-9.
8
ColabFold: making protein folding accessible to all.ColabFold:让蛋白质折叠变得人人可用。
Nat Methods. 2022 Jun;19(6):679-682. doi: 10.1038/s41592-022-01488-1. Epub 2022 May 30.
9
AlphaFold Protein Structure Database: massively expanding the structural coverage of protein-sequence space with high-accuracy models.AlphaFold 蛋白质结构数据库:用高精度模型极大地扩展蛋白质序列空间的结构覆盖范围。
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444. doi: 10.1093/nar/gkab1061.
10
The trRosetta server for fast and accurate protein structure prediction.TrRosetta 服务器:用于快速准确的蛋白质结构预测。
Nat Protoc. 2021 Dec;16(12):5634-5651. doi: 10.1038/s41596-021-00628-9. Epub 2021 Nov 10.