• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

语义相似性和本体论的机器学习。

Semantic similarity and machine learning with ontologies.

机构信息

King Abdullah University of Science and Technology.

Computational Bioscience Research Center and lead of the Structural and Functional Bioinformatics Group at King Abdullah University of Science and Technology.

出版信息

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa199.

DOI:10.1093/bib/bbaa199
PMID:33049044
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8293838/
Abstract

Ontologies have long been employed in the life sciences to formally represent and reason over domain knowledge and they are employed in almost every major biological database. Recently, ontologies are increasingly being used to provide background knowledge in similarity-based analysis and machine learning models. The methods employed to combine ontologies and machine learning are still novel and actively being developed. We provide an overview over the methods that use ontologies to compute similarity and incorporate them in machine learning methods; in particular, we outline how semantic similarity measures and ontology embeddings can exploit the background knowledge in ontologies and how ontologies can provide constraints that improve machine learning models. The methods and experiments we describe are available as a set of executable notebooks, and we also provide a set of slides and additional resources at https://github.com/bio-ontology-research-group/machine-learning-with-ontologies.

摘要

本体在生命科学中被长期用于正式表示和推理领域知识,并且几乎被用于每个主要的生物数据库中。最近,本体越来越多地被用于在基于相似性的分析和机器学习模型中提供背景知识。用于结合本体和机器学习的方法仍然是新颖的,并且正在积极开发中。我们提供了一种概述,介绍了使用本体计算相似性并将其纳入机器学习方法的方法;特别是,我们概述了语义相似性度量和本体嵌入如何利用本体中的背景知识,以及本体如何提供可以改进机器学习模型的约束。我们描述的方法和实验可作为一组可执行的笔记本使用,我们还在 https://github.com/bio-ontology-research-group/machine-learning-with-ontologies 上提供了一组幻灯片和其他资源。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32d3/8293838/cfd0db0c4754/bbaa199f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32d3/8293838/521ebc3c2298/bbaa199f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32d3/8293838/cfd0db0c4754/bbaa199f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32d3/8293838/521ebc3c2298/bbaa199f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/32d3/8293838/cfd0db0c4754/bbaa199f2.jpg

相似文献

1
Semantic similarity and machine learning with ontologies.语义相似性和本体论的机器学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa199.
2
mOWL: Python library for machine learning with biomedical ontologies.mOWL:用于生物医学本体机器学习的 Python 库。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac811.
3
Formal axioms in biomedical ontologies improve analysis and interpretation of associated data.生物医学本体论中的形式公理可改善相关数据的分析和解释。
Bioinformatics. 2020 Apr 1;36(7):2229-2236. doi: 10.1093/bioinformatics/btz920.
4
simona: a comprehensive R package for semantic similarity analysis on bio-ontologies.Simona:一个用于生物本体语义相似性分析的综合 R 包。
BMC Genomics. 2024 Sep 16;25(1):869. doi: 10.1186/s12864-024-10759-4.
5
Onto2Vec: joint vector-based representation of biological entities and their ontology-based annotations.Onto2Vec:基于向量的生物实体联合表示及其基于本体论的标注。
Bioinformatics. 2018 Jul 1;34(13):i52-i60. doi: 10.1093/bioinformatics/bty259.
6
Multi-Ontology Refined Embeddings (MORE): A hybrid multi-ontology and corpus-based semantic representation model for biomedical concepts.多本体精炼嵌入模型(MORE):一种基于混合多本体和语料库的生物医学概念语义表示模型。
J Biomed Inform. 2020 Nov;111:103581. doi: 10.1016/j.jbi.2020.103581. Epub 2020 Oct 1.
7
Bootstrapping Adversarial Learning of Biomedical Ontology Alignments.生物医学本体对齐的自训练对抗学习
AMIA Annu Symp Proc. 2020 Mar 4;2019:627-636. eCollection 2019.
8
Evolving knowledge graph similarity for supervised learning in complex biomedical domains.用于复杂生物医学领域中监督学习的进化知识图相似度。
BMC Bioinformatics. 2020 Jan 3;21(1):6. doi: 10.1186/s12859-019-3296-1.
9
Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.基于链接生物医学本体的疾病-药物领域自动化本体生成框架。
Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.
10
OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction.OPA2Vec:结合生物医学本体的正式和非正式内容以改进基于相似度的预测。
Bioinformatics. 2019 Jun 1;35(12):2133-2140. doi: 10.1093/bioinformatics/bty933.

引用本文的文献

1
Pathway Analysis Interpretation in the Multi-Omic Era.多组学时代的通路分析解读
BioTech (Basel). 2025 Jul 29;14(3):58. doi: 10.3390/biotech14030058.
2
The application of Large Language Models to the phenotype-based prioritization of causative genes in rare disease patients.大语言模型在基于表型对罕见病患者致病基因进行优先级排序中的应用。
Sci Rep. 2025 Apr 29;15(1):15093. doi: 10.1038/s41598-025-99539-y.
3
Gene expression knowledge graph for patient representation and diabetes prediction.用于患者表征和糖尿病预测的基因表达知识图谱。

本文引用的文献

1
Knowledge-Based Biomedical Data Science.基于知识的生物医学数据科学
Annu Rev Biomed Data Sci. 2020 Jul;3:23-41. doi: 10.1146/annurev-biodatasci-010820-091627. Epub 2020 Apr 7.
2
DeepPheno: Predicting single gene loss-of-function phenotypes using an ontology-aware hierarchical classifier.DeepPheno:使用本体感知层次分类器预测单基因功能丧失表型。
PLoS Comput Biol. 2020 Nov 18;16(11):e1008453. doi: 10.1371/journal.pcbi.1008453. eCollection 2020 Nov.
3
Predicting candidate genes from phenotypes, functions and anatomical site of expression.
J Biomed Semantics. 2025 Mar 8;16(1):2. doi: 10.1186/s13326-025-00325-6.
4
LCRAnnotationsDB: a database of low complexity regions functional and structural annotations.LCR注释数据库:一个关于低复杂度区域功能和结构注释的数据库。
BMC Genomics. 2024 Dec 27;25(1):1251. doi: 10.1186/s12864-024-10960-5.
5
Syntactic analysis of SMOSS model combined with improved LSTM model: Taking English writing teaching as an example.基于 SMOSS 模型的句法分析与改进型 LSTM 模型的结合:以英语写作教学为例。
PLoS One. 2024 Nov 15;19(11):e0312049. doi: 10.1371/journal.pone.0312049. eCollection 2024.
6
Early detection of mild cognitive impairment through neuropsychological tests in population screenings: a decision support system integrating ontologies and machine learning.通过神经心理学测试在人群筛查中早期发现轻度认知障碍:一个整合本体论和机器学习的决策支持系统。
Front Neuroinform. 2024 Oct 16;18:1378281. doi: 10.3389/fninf.2024.1378281. eCollection 2024.
7
Interpreting and visualizing pathway analyses using embedding representations with PAVER.使用PAVER的嵌入表示法解释和可视化通路分析。
Bioinformation. 2024 Jul 31;20(7):700-704. doi: 10.6026/973206300200700. eCollection 2024.
8
Unveiling inter-embryo variability in spindle length over time: Towards quantitative phenotype analysis.揭示胚胎间纺锤体长度随时间的变化:迈向定量表型分析。
PLoS Comput Biol. 2024 Sep 5;20(9):e1012330. doi: 10.1371/journal.pcbi.1012330. eCollection 2024 Sep.
9
Data-Driven Hypothesis Generation in Clinical Research: What We Learned from a Human Subject Study?临床研究中数据驱动的假设生成:我们从一项人体研究中学到了什么?
Med Res Arch. 2024 Feb;12(2). doi: 10.18103/mra.v12i2.5132. Epub 2024 Feb 28.
10
An ontology-based tool for modeling and documenting events in neurosurgery.一种基于本体的神经外科手术事件建模与记录工具。
BMC Med Inform Decis Mak. 2024 Jul 31;24(1):216. doi: 10.1186/s12911-024-02615-y.
从表型、功能和表达的解剖部位预测候选基因。
Bioinformatics. 2021 May 5;37(6):853-860. doi: 10.1093/bioinformatics/btaa879.
4
Evolving knowledge graph similarity for supervised learning in complex biomedical domains.用于复杂生物医学领域中监督学习的进化知识图相似度。
BMC Bioinformatics. 2020 Jan 3;21(1):6. doi: 10.1186/s12859-019-3296-1.
5
DeepMiR2GO: Inferring Functions of Human MicroRNAs Using a Deep Multi-Label Classification Model.DeepMiR2GO:使用深度多标签分类模型推断人类 microRNA 的功能。
Int J Mol Sci. 2019 Nov 30;20(23):6046. doi: 10.3390/ijms20236046.
6
Ontology-based prediction of cancer driver genes.基于本体论的癌症驱动基因预测。
Sci Rep. 2019 Nov 22;9(1):17405. doi: 10.1038/s41598-019-53454-1.
7
Comparison of Target Features for Predicting Drug-Target Interactions by Deep Neural Network Based on Large-Scale Drug-Induced Transcriptome Data.基于大规模药物诱导转录组数据的深度神经网络预测药物-靶点相互作用的目标特征比较
Pharmaceutics. 2019 Aug 2;11(8):377. doi: 10.3390/pharmaceutics11080377.
8
DEEPred: Automated Protein Function Prediction with Multi-task Feed-forward Deep Neural Networks.DEEPred:基于多任务前馈深度神经网络的蛋白质自动功能预测。
Sci Rep. 2019 May 14;9(1):7344. doi: 10.1038/s41598-019-43708-3.
9
INGA 2.0: improving protein function prediction for the dark proteome.INGA 2.0:改进黑暗蛋白质组中蛋白质功能的预测。
Nucleic Acids Res. 2019 Jul 2;47(W1):W373-W378. doi: 10.1093/nar/gkz375.
10
Deep learning in bioinformatics: Introduction, application, and perspective in the big data era.深度学习在生物信息学中的应用:大数据时代的介绍、应用和展望。
Methods. 2019 Aug 15;166:4-21. doi: 10.1016/j.ymeth.2019.04.008. Epub 2019 Apr 22.