• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估用于分子性质预测的机器学习模型:分布外数据上的性能与稳健性

Evaluating Machine Learning Models for Molecular Property Prediction: Performance and Robustness on Out-of-Distribution Data.

作者信息

Fooladi Hosein, Vu Thi Ngoc Lan, Mathea Miriam, Kirchmair Johannes

机构信息

Department of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, Faculty of Life Sciences, University of Vienna, Josef-Holaubek-Platz 2, 1090 Vienna, Austria.

Christian Doppler Laboratory for Molecular Informatics in the Biosciences, Department of Pharmaceutical Sciences, University of Vienna, 1090 Vienna, Austria.

出版信息

J Chem Inf Model. 2025 Sep 15. doi: 10.1021/acs.jcim.5c00475.

DOI:10.1021/acs.jcim.5c00475
PMID:40947919
Abstract

Today, machine learning models are employed extensively to predict the physicochemical and biological properties of molecules. Their performance is typically evaluated on in-distribution (ID) data, i.e., data originating from the same distribution as the training data. However, the real-world applications of such models often involve molecules that are more distant from the training data, necessitating the assessment of their performance on out-of-distribution (OOD) data. In this work, we investigate and evaluate the performance of 14 machine learning models, including classical approaches like random forests, as well as graph neural network (GNN) methods, such as message-passing graph neural networks, across eight data sets using ten splitting strategies for OOD data generation. First, we investigate what constitutes OOD data in the molecular domain for bioactivity and ADMET prediction tasks. In contrast to the common point of view, we show that both classical machine learning and GNN models work well (not substantially different from random splitting) on data split based on Bemis-Murcko scaffolds. Splitting based on chemical similarity clustering (UMAP-based clustering using ECFP4 fingerprints) poses the most challenging task for both types of models. Second, we investigate the extent to which ID and OOD performance have a positive linear relationship. If a positive correlation holds, models with the best performance on the ID data can be selected with the promise of having the best performance on OOD data. We show that the strength of this linear relationship is strongly related to how the OOD data is generated, i.e., which splitting strategies are used for generating OOD data. While the correlation between ID and OOD performance for scaffold splitting is strong (Pearson's ∼ 0.9), this correlation decreases significantly for all the cluster-based splitting (Pearson's ∼ 0.4). Therefore, the relationship can be more nuanced, and a strong positive correlation is not guaranteed for all OOD scenarios. These findings suggest that OOD performance evaluation and model selection should be carefully aligned with the intended application domain.

摘要

如今,机器学习模型被广泛用于预测分子的物理化学和生物学性质。其性能通常在分布内(ID)数据上进行评估,即源自与训练数据相同分布的数据。然而,此类模型在现实世界中的应用往往涉及与训练数据差异较大的分子,因此有必要评估它们在分布外(OOD)数据上的性能。在这项工作中,我们研究并评估了14种机器学习模型的性能,包括随机森林等经典方法,以及图神经网络(GNN)方法,如消息传递图神经网络,使用十种用于生成OOD数据的分割策略,跨越八个数据集进行评估。首先,我们研究在生物活性和ADMET预测任务的分子领域中,什么构成了OOD数据。与普遍观点不同,我们表明,基于Bemis-Murcko骨架进行数据分割时,经典机器学习模型和GNN模型都表现良好(与随机分割没有实质性差异)。基于化学相似性聚类(使用ECFP4指纹的基于UMAP的聚类)进行分割对这两种类型的模型来说都是最具挑战性的任务。其次,我们研究ID性能和OOD性能在多大程度上具有正线性关系。如果存在正相关关系,那么可以选择在ID数据上表现最佳的模型,有望在OOD数据上也具有最佳性能。我们表明,这种线性关系的强度与OOD数据的生成方式密切相关,即用于生成OOD数据的分割策略。虽然基于骨架分割的ID性能和OOD性能之间的相关性很强(皮尔逊相关系数约为0.9),但对于所有基于聚类的分割,这种相关性会显著降低(皮尔逊相关系数约为0.4)。因此,这种关系可能更加微妙,并非所有OOD场景都能保证有很强的正相关关系。这些发现表明,OOD性能评估和模型选择应与预期的应用领域仔细匹配。

相似文献

1
Evaluating Machine Learning Models for Molecular Property Prediction: Performance and Robustness on Out-of-Distribution Data.评估用于分子性质预测的机器学习模型:分布外数据上的性能与稳健性
J Chem Inf Model. 2025 Sep 15. doi: 10.1021/acs.jcim.5c00475.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Aspects of Genetic Diversity, Host Specificity and Public Health Significance of Single-Celled Intestinal Parasites Commonly Observed in Humans and Mostly Referred to as 'Non-Pathogenic'.人类常见且大多被称为“非致病性”的单细胞肠道寄生虫的遗传多样性、宿主特异性及公共卫生意义
APMIS. 2025 Sep;133(9):e70036. doi: 10.1111/apm.70036.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Post-pandemic planning for maternity care for local, regional, and national maternity systems across the four nations: a mixed-methods study.针对四个地区的地方、区域和国家孕产妇保健系统的疫情后规划:一项混合方法研究。
Health Soc Care Deliv Res. 2025 Sep;13(35):1-25. doi: 10.3310/HHTE6611.
6
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
7
Sexual Harassment and Prevention Training性骚扰与预防培训
8
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
9
Short-Term Memory Impairment短期记忆障碍
10
Elbow Fractures Overview肘部骨折概述