• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于扩展本体的盒嵌入:一种数据驱动且可解释的方法。

Box embeddings for extending ontologies: a data-driven and interpretable approach.

作者信息

Memariani Adel, Glauer Martin, Flügel Simon, Neuhaus Fabian, Hastings Janna, Mossakowski Till

机构信息

Data Science Group (DICE), Heinz Nixdorf Institute, Paderborn University, Warburger Str. 100, 33098, Paderborn, North Rhine-Westphalia, Germany.

Institute for Intelligent Cooperating Systems, Otto von Guericke University, Universitätsplatz 2, 39106, Magdeburg, Saxony-Anhalt, Germany.

出版信息

J Cheminform. 2025 Sep 1;17(1):138. doi: 10.1186/s13321-025-01086-1.

DOI:10.1186/s13321-025-01086-1
PMID:40890838
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12403937/
Abstract

Deriving symbolic knowledge from trained deep learning models is challenging due to the lack of transparency in such models. A promising approach to address this issue is to couple a semantic structure with the model outputs and thereby make the model interpretable. In prediction tasks such as multi-label classification, labels tend to form hierarchical relationships. Therefore, we propose enforcing a taxonomical structure on the model's outputs throughout the training phase. In vector space, a taxonomy can be represented using axis-aligned hyper-rectangles, or boxes, which may overlap or nest within one another. The boundaries of a box determine the extent of a particular category. Thus, we used box-shaped embeddings of ontology classes to learn and transparently represent logical relationships that are only implicit in multi-label datasets. We assessed our model by measuring its ability to approximate the full set of inferred subclass relations in the ChEBI ontology, which is an important knowledge base in the field of life science. We demonstrate that our model captures implicit hierarchical relationships among labels, ensuring consistency with the underlying ontological conceptualization, while also achieving state-of-the-art performance in multi-label classification. Notably, this is accomplished without requiring an explicit taxonomy during the training process. SCIENTIFIC CONTRIBUTION: Our proposed approach advances chemical classification by enabling interpretable outputs through a structured and geometrically expressive representation of molecules and their classes.

摘要

由于深度学习模型缺乏透明度,从训练好的深度学习模型中获取符号知识具有挑战性。解决这个问题的一个有前景的方法是将语义结构与模型输出相结合,从而使模型具有可解释性。在多标签分类等预测任务中,标签往往会形成层次关系。因此,我们建议在整个训练阶段对模型的输出强制实施一种分类结构。在向量空间中,分类法可以用轴对齐的超矩形或盒子来表示,这些超矩形或盒子可能相互重叠或嵌套。盒子的边界决定了特定类别的范围。因此,我们使用本体类别的盒状嵌入来学习并透明地表示多标签数据集中仅隐含的逻辑关系。我们通过测量模型近似ChEBI本体中完整推断子类关系集的能力来评估我们的模型,ChEBI本体是生命科学领域的一个重要知识库。我们证明,我们的模型捕捉了标签之间隐含的层次关系,确保与基础本体概念化一致,同时在多标签分类中也实现了最先进的性能。值得注意的是,这是在训练过程中不需要明确分类法的情况下完成的。科学贡献:我们提出的方法通过对分子及其类别进行结构化和几何表达的表示来实现可解释的输出,从而推动了化学分类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/adce9bf102b6/13321_2025_1086_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/fbf18d46eeb4/13321_2025_1086_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/f707483745a2/13321_2025_1086_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/89d1ad1fafb9/13321_2025_1086_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/37bf58f0d2e4/13321_2025_1086_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/ad9b26944f35/13321_2025_1086_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/cfd46446fa70/13321_2025_1086_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/5c3d0080d2bd/13321_2025_1086_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/254203ddb821/13321_2025_1086_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/adce9bf102b6/13321_2025_1086_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/fbf18d46eeb4/13321_2025_1086_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/f707483745a2/13321_2025_1086_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/89d1ad1fafb9/13321_2025_1086_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/37bf58f0d2e4/13321_2025_1086_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/ad9b26944f35/13321_2025_1086_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/cfd46446fa70/13321_2025_1086_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/5c3d0080d2bd/13321_2025_1086_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/254203ddb821/13321_2025_1086_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80e9/12403937/adce9bf102b6/13321_2025_1086_Fig9_HTML.jpg

相似文献

1
Box embeddings for extending ontologies: a data-driven and interpretable approach.用于扩展本体的盒嵌入:一种数据驱动且可解释的方法。
J Cheminform. 2025 Sep 1;17(1):138. doi: 10.1186/s13321-025-01086-1.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
CXR-MultiTaskNet a unified deep learning framework for joint disease localization and classification in chest radiographs.CXR-MultiTaskNet:一种用于胸部X光片中疾病联合定位与分类的统一深度学习框架。
Sci Rep. 2025 Aug 31;15(1):32022. doi: 10.1038/s41598-025-16669-z.
6
Management of urinary stones by experts in stone disease (ESD 2025).结石病专家对尿路结石的管理(2025年结石病专家共识)
Arch Ital Urol Androl. 2025 Jun 30;97(2):14085. doi: 10.4081/aiua.2025.14085.
7
Plug-and-play use of tree-based methods: consequences for clinical prediction modeling.基于树的方法的即插即用:对临床预测模型的影响。
J Clin Epidemiol. 2025 Aug;184:111834. doi: 10.1016/j.jclinepi.2025.111834. Epub 2025 May 19.
8
Leveraging a foundation model zoo for cell similarity search in oncological microscopy across devices.利用基础模型库进行跨设备肿瘤显微镜检查中的细胞相似性搜索。
Front Oncol. 2025 Jun 18;15:1480384. doi: 10.3389/fonc.2025.1480384. eCollection 2025.
9
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

本文引用的文献

1
Chebifier: automating semantic classification in ChEBI to accelerate data-driven discovery.Chebifier:实现ChEBI中语义分类自动化以加速数据驱动的发现。
Digit Discov. 2024 Mar 26;3(5):896-907. doi: 10.1039/d3dd00238a. eCollection 2024 May 15.
2
PubChem 2023 update.PubChem 2023 更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1373-D1380. doi: 10.1093/nar/gkac956.
3
DeepGOZero: improving protein function prediction from sequence and zero-shot learning based on ontology axioms.DeepGOZero:基于本体论公理的序列和零样本学习改进蛋白质功能预测。
Bioinformatics. 2022 Jun 24;38(Suppl 1):i238-i245. doi: 10.1093/bioinformatics/btac256.
4
Learning chemistry: exploring the suitability of machine learning for the task of structure-based chemical ontology classification.学习化学:探索机器学习在基于结构的化学本体分类任务中的适用性。
J Cheminform. 2021 Mar 16;13(1):23. doi: 10.1186/s13321-021-00500-8.
5
DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.DeepGO:使用深度本体感知分类器从序列和相互作用预测蛋白质功能。
Bioinformatics. 2018 Feb 15;34(4):660-668. doi: 10.1093/bioinformatics/btx624.
6
ClassyFire: automated chemical classification with a comprehensive, computable taxonomy.ClassyFire:基于全面、可计算分类法的自动化化学分类
J Cheminform. 2016 Nov 4;8:61. doi: 10.1186/s13321-016-0174-y. eCollection 2016.
7
ChEBI in 2016: Improved services and an expanding collection of metabolites.2016年的ChEBI:服务改进与代谢物集合的扩充
Nucleic Acids Res. 2016 Jan 4;44(D1):D1214-9. doi: 10.1093/nar/gkv1031. Epub 2015 Oct 13.
8
Structure-based classification and ontology in chemistry.基于结构的化学分类和本体论。
J Cheminform. 2012 Apr 5;4:8. doi: 10.1186/1758-2946-4-8.
9
Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.基因本体论:生物学统一工具。基因本体论联合会。
Nat Genet. 2000 May;25(1):25-9. doi: 10.1038/75556.