• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于CASP16-CAPRI的大规模折叠数据:一项系统性大规模采样实验

MassiveFold Data for CASP16-CAPRI: A Systematic Massive Sampling Experiment.

作者信息

Raouraoua Nessim, Lensink Marc F, Brysbaert Guillaume

机构信息

Univ. Lille, CNRS UMR 8576-UGSF-Unité de Glycobiologie Structurale et Fonctionnelle, Lille, France.

出版信息

Proteins. 2025 Aug 28. doi: 10.1002/prot.70040.

DOI:10.1002/prot.70040
PMID:40874652
Abstract

Massive sampling with AlphaFold2 has become a widely used approach in protein structure prediction. Here we present the MassiveFold CASP16-CAPRI dataset, a systematic, large-scale sampling of both monomeric and multimeric protein targets. By exploiting maximal parallelization, we produced up to 8040 models per target and shared them with the community for collaborative selection and scoring. This collective effort minimizes redundant computation and environmental impact, while granting resource-limited groups - especially those focused on scoring - access to high quality structures. In our analysis, we define an interface-difficulty classification based on DockQ metrics, showing that massive sampling yields the greatest gains on most of the challenging interfaces. Crucially, this classification can be predicted from the median ipTM scores of a routine AF2 run, enabling users to selectively deploy massive sampling only when it is most needed. Combined with a reduction of the massive sampling from 8040 to 2475 predictions, such targeted strategies dramatically cut computation time and resource use with minimal loss of accuracy. Finally, we underscore the persistent challenge of choosing optimal models from massive sampling datasets, emphasizing the need for more robust scoring methods. The MassiveFold datasets, together with AlphaFold ranking scores and CASP and CAPRI assessment metrics, are publicly available at https://github.com/GBLille/CASP16-CAPRI_MassiveFold_Data to accelerate further progress in protein structure prediction and assembly modeling.

摘要

使用AlphaFold2进行大规模采样已成为蛋白质结构预测中一种广泛使用的方法。在此,我们展示了MassiveFold CASP16-CAPRI数据集,这是一个对单体和多聚体蛋白质靶点进行的系统、大规模采样。通过利用最大并行化,我们为每个靶点生成了多达8040个模型,并与社区共享以进行协作选择和评分。这种集体努力最大限度地减少了冗余计算和环境影响,同时使资源有限的团队——尤其是那些专注于评分的团队——能够获得高质量的结构。在我们的分析中,我们基于DockQ指标定义了一种界面难度分类,表明大规模采样在大多数具有挑战性的界面上产生了最大的收益。至关重要的是,这种分类可以从常规AF2运行的中位数ipTM分数预测出来,使用户能够仅在最需要时选择性地部署大规模采样。结合将大规模采样从8040次预测减少到2475次,这种有针对性的策略在准确性损失最小的情况下显著减少了计算时间和资源使用。最后,我们强调了从大规模采样数据集中选择最佳模型的持续挑战,强调需要更强大的评分方法。MassiveFold数据集,连同AlphaFold排名分数以及CASP和CAPRI评估指标,可在https://github.com/GBLille/CASP16-CAPRI_MassiveFold_Data上公开获取,以加速蛋白质结构预测和组装建模的进一步进展。

相似文献

1
MassiveFold Data for CASP16-CAPRI: A Systematic Massive Sampling Experiment.用于CASP16-CAPRI的大规模折叠数据:一项系统性大规模采样实验
Proteins. 2025 Aug 28. doi: 10.1002/prot.70040.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Assessment of Protein Complex Predictions in CASP16: Are we making progress?对蛋白质复合物预测在蛋白质结构预测关键评估第16轮中的评估:我们有进展吗?
bioRxiv. 2025 May 30:2025.05.29.656875. doi: 10.1101/2025.05.29.656875.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
CASP16 Protein Monomer Structure Prediction Assessment.半胱天冬酶16蛋白单体结构预测评估
Proteins. 2025 Aug 17. doi: 10.1002/prot.70031.
7
CASP16 protein monomer structure prediction assessment.半胱天冬酶16(CASP16)蛋白单体结构预测评估
bioRxiv. 2025 Jun 2:2025.05.29.656942. doi: 10.1101/2025.05.29.656942.
8
[Guidelines for the prevention and management of bronchial asthma (2024 edition)].[支气管哮喘防治指南(2024年版)]
Zhonghua Jie He He Hu Xi Za Zhi. 2025 Mar 12;48(3):208-248. doi: 10.3760/cma.j.cn112147-20241013-00601.
9
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
10
Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。
Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.