• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大型蛋白质数据库揭示了结构互补性和功能局部性。

Large protein databases reveal structural complementarity and functional locality.

作者信息

Szczerbiak Paweł, Szydlowski Lukasz M, Wydmański Witold, Renfrew P Douglas, Leman Julia Koehler, Kosciolek Tomasz

机构信息

Sano Centre for Computational Medicine, Kraków, Poland.

Małopolska Centre of Biotechnology, Jagiellonian University, Kraków, Poland.

出版信息

Nat Commun. 2025 Aug 25;16(1):7925. doi: 10.1038/s41467-025-63250-3.

DOI:10.1038/s41467-025-63250-3
PMID:40854905
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12378989/
Abstract

Recent breakthroughs in protein structure prediction have led to a surge in high-quality 3D models, highlighting the need for efficient computational solutions. In our work, we examine the structural clusters from the AlphaFold Protein Structure Database (AFDB), a high-quality subset of ESMAtlas, and the Microbiome Immunity Project (MIP). We create a single cohesive low-dimensional representation of the resulting protein space. We show that, while each database occupies distinct regions, they collectively exhibit significant overlap in their functional profiles. High-level biological functions tend to cluster in particular regions, revealing a shared functional landscape despite the diverse sources of data. By creating a representation of protein structure space, localizing functional annotations within this space, and providing an open-access web-server for exploration, this work offers insights for future research concerning protein sequence-structure-function relationships, enabling biological questions to be asked about taxonomic assignments, environmental factors, or functional specificity. This approach is generalizable, thus enabling further discovery beyond findings presented here.

摘要

蛋白质结构预测方面的最新突破导致高质量3D模型激增,凸显了对高效计算解决方案的需求。在我们的工作中,我们研究了来自AlphaFold蛋白质结构数据库(AFDB)、ESMAtlas的一个高质量子集以及微生物组免疫项目(MIP)的结构簇。我们创建了所得蛋白质空间的单一连贯低维表示。我们表明,虽然每个数据库占据不同区域,但它们在功能概况上总体表现出显著重叠。高级生物学功能倾向于聚集在特定区域,揭示了尽管数据来源多样但仍存在共享的功能景观。通过创建蛋白质结构空间的表示、在该空间内定位功能注释并提供一个开放获取的网络服务器以供探索,这项工作为未来关于蛋白质序列-结构-功能关系的研究提供了见解,使人们能够就分类学归属、环境因素或功能特异性提出生物学问题。这种方法具有通用性,从而能够在此处呈现的发现之外进行进一步探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/777e93b7c4ab/41467_2025_63250_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/7ab91ef0fd09/41467_2025_63250_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/9b95216a3a1c/41467_2025_63250_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/0ca0dc6b7adc/41467_2025_63250_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/ebfd819228e7/41467_2025_63250_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/ddaff48f4c1b/41467_2025_63250_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/2d655d20c0fa/41467_2025_63250_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/777e93b7c4ab/41467_2025_63250_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/7ab91ef0fd09/41467_2025_63250_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/9b95216a3a1c/41467_2025_63250_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/0ca0dc6b7adc/41467_2025_63250_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/ebfd819228e7/41467_2025_63250_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/ddaff48f4c1b/41467_2025_63250_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/2d655d20c0fa/41467_2025_63250_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a95b/12378989/777e93b7c4ab/41467_2025_63250_Fig7_HTML.jpg

相似文献

1
Large protein databases reveal structural complementarity and functional locality.大型蛋白质数据库揭示了结构互补性和功能局部性。
Nat Commun. 2025 Aug 25;16(1):7925. doi: 10.1038/s41467-025-63250-3.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
A cross-species analysis of neuroanatomical covariance sex differences in humans and mice.人类和小鼠神经解剖协方差性别差异的跨物种分析。
Biol Sex Differ. 2025 Jul 1;16(1):47. doi: 10.1186/s13293-025-00728-1.
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
Unveiling the evolution of policies for enhancing protein structure predictions: A comprehensive analysis.揭示增强蛋白质结构预测政策的演变:全面分析。
Comput Biol Med. 2024 Sep;179:108815. doi: 10.1016/j.compbiomed.2024.108815. Epub 2024 Jul 11.
6
Parents' and informal caregivers' views and experiences of communication about routine childhood vaccination: a synthesis of qualitative evidence.父母及非正式照料者关于儿童常规疫苗接种沟通的观点与经历:定性证据综述
Cochrane Database Syst Rev. 2017 Feb 7;2(2):CD011787. doi: 10.1002/14651858.CD011787.pub2.
7
How lived experiences of illness trajectories, burdens of treatment, and social inequalities shape service user and caregiver participation in health and social care: a theory-informed qualitative evidence synthesis.疾病轨迹的生活经历、治疗负担和社会不平等如何影响服务使用者和照顾者参与健康和社会护理:一项基于理论的定性证据综合分析
Health Soc Care Deliv Res. 2025 Jun;13(24):1-120. doi: 10.3310/HGTQ8159.
8
RCSB protein Data Bank: exploring protein 3D similarities via comprehensive structural alignments.RCSB 蛋白质数据库:通过全面的结构比对探索蛋白质 3D 相似性。
Bioinformatics. 2024 Jun 3;40(6). doi: 10.1093/bioinformatics/btae370.
9
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
10
Factors that influence parents' and informal caregivers' views and practices regarding routine childhood vaccination: a qualitative evidence synthesis.影响父母和非正式照顾者对常规儿童疫苗接种看法和做法的因素:定性证据综合分析。
Cochrane Database Syst Rev. 2021 Oct 27;10(10):CD013265. doi: 10.1002/14651858.CD013265.pub2.

本文引用的文献

1
Foldclass and Merizo-search: scalable structural similarity search for single- and multi-domain proteins using geometric learning.Foldclass和Merizo-search:使用几何学习对单域和多域蛋白质进行可扩展的结构相似性搜索。
Bioinformatics. 2025 May 6;41(5). doi: 10.1093/bioinformatics/btaf277.
2
Unveiling the cold reality of metamorphic proteins.揭示变质蛋白质的残酷现实。
Proc Natl Acad Sci U S A. 2025 Mar 25;122(12):e2422725122. doi: 10.1073/pnas.2422725122. Epub 2025 Mar 13.
3
S-PLM: Structure-Aware Protein Language Model via Contrastive Learning Between Sequence and Structure.
S-PLM:通过序列与结构之间的对比学习实现的结构感知蛋白质语言模型
Adv Sci (Weinh). 2025 Feb;12(5):e2404212. doi: 10.1002/advs.202404212. Epub 2024 Dec 12.
4
BFVD-a large repository of predicted viral protein structures.BFVD——一个预测病毒蛋白结构的大型数据库。
Nucleic Acids Res. 2025 Jan 6;53(D1):D340-D347. doi: 10.1093/nar/gkae1119.
5
Exploring structural diversity across the protein universe with The Encyclopedia of Domains.利用《结构域百科全书》探索蛋白质宇宙中的结构多样性。
Science. 2024 Nov;386(6721):eadq4946. doi: 10.1126/science.adq4946. Epub 2024 Nov 1.
6
Adaptation to space conditions of novel bacterial species isolated from the International Space Station revealed by functional gene annotations and comparative genome analysis.通过功能基因注释和比较基因组分析揭示了从国际空间站分离的新型细菌物种对空间条件的适应。
Microbiome. 2024 Oct 4;12(1):190. doi: 10.1186/s40168-024-01916-8.
7
Targeted isolation of Methanobrevibacter strains from fecal samples expands the cultivated human archaeome.从粪便样本中靶向分离甲烷短杆菌菌株可扩展人类培养古菌组。
Nat Commun. 2024 Aug 31;15(1):7593. doi: 10.1038/s41467-024-52037-7.
8
Efficient protein structure archiving using ProteStAr.利用 ProteStAr 实现高效的蛋白质结构存档。
Bioinformatics. 2024 Jul 1;40(7). doi: 10.1093/bioinformatics/btae428.
9
Predicting the Structure of Enzymes with Metal Cofactors: The Example of [FeFe] Hydrogenases.预测含金属辅因子酶的结构:以 [FeFe]氢化酶为例。
Int J Mol Sci. 2024 Mar 25;25(7):3663. doi: 10.3390/ijms25073663.
10
CATH 2024: CATH-AlphaFlow Doubles the Number of Structures in CATH and Reveals Nearly 200 New Folds.CATH 2024:CATH-AlphaFlow 将 CATH 中的结构数量增加了一倍,并揭示了近 200 个新结构折叠类型。
J Mol Biol. 2024 Sep 1;436(17):168551. doi: 10.1016/j.jmb.2024.168551. Epub 2024 Mar 27.