• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用常见机器学习模型对灵活3D物体正确识别的稳健性进行基准测试。

Benchmarking the robustness of the correct identification of flexible 3D objects using common machine learning models.

作者信息

Zhang Yang, Vitalis Andreas

机构信息

Department of Biochemistry, University of Zurich, 8057 Zurich, Switzerland.

出版信息

Patterns (N Y). 2025 Jan 10;6(1):101147. doi: 10.1016/j.patter.2024.101147.

DOI:10.1016/j.patter.2024.101147
PMID:39896260
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11783895/
Abstract

True three-dimensional (3D) data are prevalent in domains such as molecular science or computer vision. In these data, machine learning models are often asked to identify objects subject to intrinsic flexibility. Our study introduces two datasets from molecular science to assess the classification robustness of common model/feature combinations. Molecules are flexible, and shapes alone offer intra-class heterogeneities that yield a high risk for confusions. By blocking training and test sets to reduce overlap, we establish a baseline requiring the trained models to abstract from shape. As training data coverage grows, all tested architectures perform better on unseen data with reduced overfitting. Empirically, 2D embeddings of voxelized data produced the best-performing models. Evidently, both featurization and task-appropriate model design are of continued importance, the latter point reinforced by comparisons to recent, more specialized models. Finally, we show that the shape abstraction learned from database samples extends to samples that are evolving explicitly in time.

摘要

真实的三维(3D)数据在分子科学或计算机视觉等领域很常见。在这些数据中,机器学习模型经常被要求识别具有内在灵活性的物体。我们的研究引入了两个来自分子科学的数据集,以评估常见模型/特征组合的分类稳健性。分子是灵活的,仅形状就会带来类内异质性,从而产生混淆的高风险。通过阻止训练集和测试集以减少重叠,我们建立了一个基线,要求训练模型从形状中抽象出来。随着训练数据覆盖范围的扩大,所有测试的架构在未见数据上表现更好,且过拟合减少。根据经验,体素化数据的二维嵌入产生了性能最佳的模型。显然,特征化和适合任务的模型设计都持续重要,通过与最近更专门的模型进行比较,后一点得到了加强。最后,我们表明从数据库样本中学到的形状抽象可以扩展到随时间明确演变的样本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/4f85e3768208/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/81b482ba4870/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/1122dd8213a4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/d121957d9fdc/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/0803169c4d57/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/b91e4cd446bb/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/966aec2f3935/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/1ef120b17932/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/79d969a18d20/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/4f85e3768208/gr9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/81b482ba4870/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/1122dd8213a4/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/d121957d9fdc/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/0803169c4d57/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/b91e4cd446bb/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/966aec2f3935/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/1ef120b17932/gr7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/79d969a18d20/gr8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afb0/11783895/4f85e3768208/gr9.jpg

相似文献

1
Benchmarking the robustness of the correct identification of flexible 3D objects using common machine learning models.使用常见机器学习模型对灵活3D物体正确识别的稳健性进行基准测试。
Patterns (N Y). 2025 Jan 10;6(1):101147. doi: 10.1016/j.patter.2024.101147.
2
How to approach machine learning-based prediction of drug/compound-target interactions.如何进行基于机器学习的药物/化合物-靶点相互作用预测。
J Cheminform. 2023 Feb 6;15(1):16. doi: 10.1186/s13321-023-00689-w.
3
Ensemble machine learning model trained on a new synthesized dataset generalizes well for stress prediction using wearable devices.在新合成数据集上训练的集成机器学习模型,对于使用可穿戴设备进行压力预测具有良好的泛化能力。
J Biomed Inform. 2023 Dec;148:104556. doi: 10.1016/j.jbi.2023.104556. Epub 2023 Dec 2.
4
Single-View 3D Mesh Reconstruction for Seen and Unseen Categories.针对可见和不可见类别的单视图3D网格重建
IEEE Trans Image Process. 2023;32:3746-3758. doi: 10.1109/TIP.2023.3279661. Epub 2023 Jul 7.
5
Open-Pose 3D zero-shot learning: Benchmark and challenges.开放姿态3D零样本学习:基准与挑战
Neural Netw. 2025 Jan;181:106775. doi: 10.1016/j.neunet.2024.106775. Epub 2024 Oct 9.
6
Object recognition in medical images via anatomy-guided deep learning.通过解剖学引导的深度学习实现医学图像中的目标识别。
Med Image Anal. 2022 Oct;81:102527. doi: 10.1016/j.media.2022.102527. Epub 2022 Jun 25.
7
Blood Stain Classification with Hyperspectral Imaging and Deep Neural Networks.高光谱成像与深度神经网络在血痕分类中的应用。
Sensors (Basel). 2020 Nov 21;20(22):6666. doi: 10.3390/s20226666.
8
ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI.R O O D-MRI:基准测试深度学习分割模型对 MRI 中分布外和损坏数据的鲁棒性。
Neuroimage. 2023 Sep;278:120289. doi: 10.1016/j.neuroimage.2023.120289. Epub 2023 Jul 24.
9
Hypergraph-Based Multi-Modal Representation for Open-Set 3D Object Retrieval.基于超图的开放集3D物体检索多模态表示
IEEE Trans Pattern Anal Mach Intell. 2024 Apr;46(4):2206-2223. doi: 10.1109/TPAMI.2023.3332768. Epub 2024 Mar 6.
10
Comparing Pre-trained and Feature-Based Models for Prediction of Alzheimer's Disease Based on Speech.基于语音比较预训练模型和基于特征的模型对阿尔茨海默病的预测
Front Aging Neurosci. 2021 Apr 27;13:635945. doi: 10.3389/fnagi.2021.635945. eCollection 2021.

引用本文的文献

1
Nearl: extracting dynamic features from molecular dynamics trajectories for machine learning tasks.Nearl:从分子动力学轨迹中提取用于机器学习任务的动态特征。
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf321.
2
The physics-AI dialogue in drug design.药物设计中的物理与人工智能对话。
RSC Med Chem. 2025 Jan 23;16(4):1499-1515. doi: 10.1039/d4md00869c. eCollection 2025 Apr 16.

本文引用的文献

1
MISATO: machine learning dataset of protein-ligand complexes for structure-based drug discovery.MISATO:基于结构的药物发现的蛋白质-配体复合物的机器学习数据集。
Nat Comput Sci. 2024 May;4(5):367-378. doi: 10.1038/s43588-024-00627-2. Epub 2024 May 10.
2
Feature Noise Boosts DNN Generalization Under Label Noise.特征噪声在标签噪声下提升深度神经网络的泛化能力。
IEEE Trans Neural Netw Learn Syst. 2025 Apr;36(4):7711-7724. doi: 10.1109/TNNLS.2024.3394511. Epub 2025 Apr 4.
3
Accurate structure prediction of biomolecular interactions with AlphaFold 3.
利用 AlphaFold 3 进行生物分子相互作用的精确结构预测。
Nature. 2024 Jun;630(8016):493-500. doi: 10.1038/s41586-024-07487-w. Epub 2024 May 8.
4
MELLODDY: Cross-pharma Federated Learning at Unprecedented Scale Unlocks Benefits in QSAR without Compromising Proprietary Information.美乐蒂:在前所未有的规模上进行跨制药公司联邦学习,在不损害专有信息的情况下,实现 QSAR 的优势。
J Chem Inf Model. 2024 Apr 8;64(7):2331-2344. doi: 10.1021/acs.jcim.3c00799. Epub 2023 Aug 29.
5
3DDPDs: describing protein dynamics for proteochemometric bioactivity prediction. A case for (mutant) G protein-coupled receptors.3DDPDs:用于蛋白质化学计量生物活性预测的蛋白质动力学描述。以(突变型)G蛋白偶联受体为例。
J Cheminform. 2023 Aug 28;15(1):74. doi: 10.1186/s13321-023-00745-5.
6
Rapid Prediction of a Liquid Structure from a Single Molecular Configuration Using Deep Learning.利用深度学习从单个分子构型快速预测液体结构。
J Chem Inf Model. 2023 Jun 26;63(12):3742-3750. doi: 10.1021/acs.jcim.3c00472. Epub 2023 Jun 12.
7
A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening.迈向可泛化性的一小步:基于结构的虚拟筛选的机器学习打分函数的训练。
J Chem Inf Model. 2023 May 22;63(10):2960-2974. doi: 10.1021/acs.jcim.3c00322. Epub 2023 May 11.
8
PeSTo: parameter-free geometric deep learning for accurate prediction of protein binding interfaces.PeSTo:用于准确预测蛋白质结合界面的无参几何深度学习。
Nat Commun. 2023 Apr 18;14(1):2175. doi: 10.1038/s41467-023-37701-8.
9
Enzyme Substrate Prediction from Three-Dimensional Feature Representations Using Space-Filling Curves.基于空间填充曲线的三维特征表示进行酶底物预测。
J Chem Inf Model. 2023 Mar 13;63(5):1637-1648. doi: 10.1021/acs.jcim.3c00005. Epub 2023 Feb 20.
10
Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.基于结构的深度学习预测蛋白质-配体结合亲和力的评分函数综述
Front Bioinform. 2022 Jun 17;2. doi: 10.3389/fbinf.2022.885983.