• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

CoBdock-2:通过结合集成和多模型特征选择方法的混合特征选择提高盲对接性能。

CoBdock-2: enhancing blind docking performance through hybrid feature selection combining ensemble and multimodel feature selection approaches.

作者信息

Ugurlu Sadettin Y

机构信息

Novexus Ltd, Antalya, 07058, Turkey.

Department of Materials Science and Engineering, Faculty of Engineering, Akdeniz University, Dumlupinar Bulvari, Antalya, 07058, Turkey.

出版信息

J Comput Aided Mol Des. 2025 Jul 13;39(1):48. doi: 10.1007/s10822-025-00629-w.

DOI:10.1007/s10822-025-00629-w
PMID:40652425
Abstract

Identifying orthosteric binding sites and predicting small molecule affinities remains a key challenge in virtual screening. While blind docking explores the entire protein surface, its precision is hindered by the vast search space. Cavity detection-guided docking improves accuracy by narrowing focus to predicted pockets, but its effectiveness depends heavily on the quality of cavity detection tools. To overcome these limitations, we developed Consensus Blind Dock (CoBDock), a machine learning-based blind docking method that integrates molecular docking and cavity detection results to enhance binding site and pose prediction. Building on this, CoBDock-2 replaces traditional docking tools by extracting 1D numerical representations from protein, ligand, and interaction structural features, and applying advanced ensemble feature selection techniques. By evaluating 21 feature selection methods across 9,598 features, CoBDock-2 identifies key molecular characteristics of orthosteric binding sites. CoBDock-2 demonstrates consistent improvements over the original CoBDock across benchmark datasets (PDBBind v2020-general, MTi, ADS, DUD-E, CASF-2016), achieving 77% binding site identification accuracy (within 8 Å), 55% ligand pose prediction accuracy (RMSD 2 Å), a 19% reduction in the mean distance to ground truth ligands within the binding site, and an 18.5% decrease in the mean pose RMSD. Statistical analysis across the combined benchmark set confirms the significance of these improvements ( ). Notably, the Weighted Hybrid Feature Selection variant in CoBDock-2 further increases binding site accuracy to 79.8%, demonstrating the benefit of combining multimodel and ensemble feature selection strategies. Variability in predictions also decreased significantly, highlighting enhanced reliability and generalizability. Also, a low-bias hypothetical comparison with a state-of-the-art DiffDock + NMDN method was conducted to position CoBDock-2 relative to modern deep learning-based docking strategies.

摘要

识别正构结合位点并预测小分子亲和力仍然是虚拟筛选中的一项关键挑战。虽然盲目对接会探索整个蛋白质表面,但其精度受到巨大搜索空间的阻碍。腔检测引导对接通过将重点缩小到预测的口袋来提高准确性,但其有效性在很大程度上取决于腔检测工具的质量。为了克服这些限制,我们开发了共识盲目对接(CoBDock),这是一种基于机器学习的盲目对接方法,它整合了分子对接和腔检测结果,以增强结合位点和构象预测。在此基础上,CoBDock-2通过从蛋白质、配体和相互作用结构特征中提取一维数值表示,并应用先进的集成特征选择技术,取代了传统的对接工具。通过评估9598个特征上的21种特征选择方法,CoBDock-2识别了正构结合位点的关键分子特征。CoBDock-2在基准数据集(PDBBind v2020-general、MTi、ADS、DUD-E、CASF-2016)上相对于原始CoBDock表现出持续的改进,实现了77%的结合位点识别准确率(在8 Å范围内)、55%的配体构象预测准确率(RMSD ≤ 2 Å),结合位点内到真实配体的平均距离减少了19%,平均构象RMSD降低了18.5%。对组合基准集的统计分析证实了这些改进的显著性( )。值得注意的是,CoBDock-2中的加权混合特征选择变体将结合位点准确率进一步提高到79.8%,证明了结合多模型和集成特征选择策略的好处。预测的变异性也显著降低,突出了更高的可靠性和通用性。此外,还与一种先进的DiffDock + NMDN方法进行了低偏差假设比较,以将CoBDock-2相对于现代基于深度学习的对接策略进行定位。

相似文献

1
CoBdock-2: enhancing blind docking performance through hybrid feature selection combining ensemble and multimodel feature selection approaches.CoBdock-2:通过结合集成和多模型特征选择方法的混合特征选择提高盲对接性能。
J Comput Aided Mol Des. 2025 Jul 13;39(1):48. doi: 10.1007/s10822-025-00629-w.
2
Stabilizing machine learning for reproducible and explainable results: A novel validation approach to subject-specific insights.稳定机器学习以获得可重复和可解释的结果:一种针对特定个体见解的新型验证方法。
Comput Methods Programs Biomed. 2025 Jun 21;269:108899. doi: 10.1016/j.cmpb.2025.108899.
3
Cobdock: an accurate and practical machine learning-based consensus blind docking method.Cobdock:一种基于机器学习的准确且实用的一致性盲对接方法。
J Cheminform. 2024 Jan 11;16(1):5. doi: 10.1186/s13321-023-00793-x.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
CorrEA: A Web Server for Optimizing Correlations between Calculated Energies and Activities in Ligand-Receptor Systems Considering Multiple Binding Site Conformations.CorrEA:一个用于考虑多种结合位点构象优化配体-受体系统中计算能量与活性之间相关性的网络服务器。
J Chem Inf Model. 2025 Jul 14;65(13):7113-7128. doi: 10.1021/acs.jcim.5c00571. Epub 2025 Jun 23.
6
SG-ML-PLAP: A structure-guided machine learning-based scoring function for protein-ligand binding affinity prediction.SG-ML-PLAP:一种基于结构引导的机器学习蛋白质-配体结合亲和力预测评分函数。
Protein Sci. 2025 Jan;34(1):e5257. doi: 10.1002/pro.5257.
7
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
8
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
9
A 4D tensor-enhanced multi-dimensional convolutional neural network for accurate prediction of protein-ligand binding affinity.一种用于准确预测蛋白质-配体结合亲和力的4D张量增强多维卷积神经网络。
Mol Divers. 2024 Dec 23. doi: 10.1007/s11030-024-11044-y.
10
The effect of sample site and collection procedure on identification of SARS-CoV-2 infection.样本采集部位和采集程序对严重急性呼吸综合征冠状病毒2(SARS-CoV-2)感染鉴定的影响。
Cochrane Database Syst Rev. 2024 Dec 16;12(12):CD014780. doi: 10.1002/14651858.CD014780.

本文引用的文献

1
Temperature-Dependent Small-Molecule Solubility Prediction Using MoE-Enhanced Directed Message Passing Neural Networks.使用MoE增强的定向消息传递神经网络进行温度依赖性小分子溶解度预测。
J Chem Inf Model. 2025 Jul 28;65(14):7463-7477. doi: 10.1021/acs.jcim.5c00781. Epub 2025 Jul 10.
2
MSFCL: Drug Combination Risk Level Prediction Based on Multi-Source Feature Fusion and Contrastive Learning.MSFCL:基于多源特征融合与对比学习的药物组合风险水平预测
J Chem Inf Model. 2025 Jul 14;65(13):7285-7301. doi: 10.1021/acs.jcim.5c00913. Epub 2025 Jun 20.
3
MEGA PROTAC, MEGA DOCK-based PROTAC mediated ternary complex formation pipeline with sequential filtering and rank aggregation.
基于MEGA DOCK的MEGA PROTAC介导的三元复合物形成流程,具有顺序过滤和排名聚合。
Sci Rep. 2025 Feb 14;15(1):5545. doi: 10.1038/s41598-024-83558-2.
4
GraphPBSP: Protein binding site prediction based on Graph Attention Network and pre-trained model ProstT5.GraphPBSP:基于图注意力网络和预训练模型ProstT5的蛋白质结合位点预测
Int J Biol Macromol. 2024 Dec;282(Pt 1):136933. doi: 10.1016/j.ijbiomac.2024.136933. Epub 2024 Oct 28.
5
DynamicBind: predicting ligand-specific protein-ligand complex structure with a deep equivariant generative model.动态绑定:使用深度等变生成模型预测配体特异性蛋白质-配体复合物结构。
Nat Commun. 2024 Feb 5;15(1):1071. doi: 10.1038/s41467-024-45461-2.
6
DSDP: A Blind Docking Strategy Accelerated by GPUs.DSDP:一种基于 GPU 的盲对接策略。
J Chem Inf Model. 2023 Jul 24;63(14):4355-4363. doi: 10.1021/acs.jcim.3c00519. Epub 2023 Jun 29.
7
Protein-Ligand Docking in the Machine-Learning Era.蛋白质-配体对接在机器学习时代。
Molecules. 2022 Jul 18;27(14):4568. doi: 10.3390/molecules27144568.
8
ScanNet: an interpretable geometric deep learning model for structure-based protein binding site prediction.ScanNet:一种用于基于结构的蛋白质结合位点预测的可解释几何深度学习模型。
Nat Methods. 2022 Jun;19(6):730-739. doi: 10.1038/s41592-022-01490-7. Epub 2022 May 30.
9
EViS: An Enhanced Virtual Screening Approach Based on Pocket-Ligand Similarity.EViS:一种基于口袋-配体相似性的增强型虚拟筛选方法。
J Chem Inf Model. 2022 Feb 14;62(3):498-510. doi: 10.1021/acs.jcim.1c00944. Epub 2022 Jan 27.
10
Lin_F9: A Linear Empirical Scoring Function for Protein-Ligand Docking.林福 9:一种用于蛋白质-配体对接的线性经验评分函数。
J Chem Inf Model. 2021 Sep 27;61(9):4630-4644. doi: 10.1021/acs.jcim.1c00737. Epub 2021 Sep 1.