• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Simona:一个用于生物本体语义相似性分析的综合 R 包。

simona: a comprehensive R package for semantic similarity analysis on bio-ontologies.

机构信息

Molecular Precision Oncology Program, National Center for Tumor Diseases (NCT), Im Neuenheimer Feld 280, Heidelberg, 69120, Germany.

出版信息

BMC Genomics. 2024 Sep 16;25(1):869. doi: 10.1186/s12864-024-10759-4.

DOI:10.1186/s12864-024-10759-4
PMID:39285315
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11406866/
Abstract

BACKGROUND

Bio-ontologies are keys in structuring complex biological information for effective data integration and knowledge representation. Semantic similarity analysis on bio-ontologies quantitatively assesses the degree of similarity between biological concepts based on the semantics encoded in ontologies. It plays an important role in structured and meaningful interpretations and integration of complex data from multiple biological domains.

RESULTS

We present simona, a novel R package for semantic similarity analysis on general bio-ontologies. Simona implements infrastructures for ontology analysis by offering efficient data structures, fast ontology traversal methods, and elegant visualizations. Moreover, it provides a robust toolbox supporting over 70 methods for semantic similarity analysis. With simona, we conducted a benchmark against current semantic similarity methods. The results demonstrate methods are clustered based on their mathematical methodologies, thus guiding researchers in the selection of appropriate methods. Additionally, we explored annotation-based versus topology-based methods, revealing that semantic similarities solely based on ontology topology can efficiently reveal semantic similarity structures, facilitating analysis on less-studied organisms and other ontologies.

CONCLUSIONS

Simona offers a versatile interface and efficient implementation for processing, visualization, and semantic similarity analysis on bio-ontologies. We believe that simona will serve as a robust tool for uncovering relationships and enhancing the interoperability of biological knowledge systems.

摘要

背景

生物本体是为有效数据集成和知识表示而构建复杂生物信息的关键。生物本体上的语义相似性分析定量评估了基于本体中编码的语义的生物概念之间的相似程度。它在从多个生物领域的复杂数据的结构化和有意义的解释和整合方面发挥着重要作用。

结果

我们提出了 simona,这是一个用于一般生物本体的语义相似性分析的新的 R 包。Simona 通过提供高效的数据结构、快速的本体遍历方法和优雅的可视化,为本体分析提供了基础设施。此外,它还提供了一个强大的工具箱,支持超过 70 种语义相似性分析方法。使用 simona,我们对当前的语义相似性方法进行了基准测试。结果表明,方法是根据其数学方法进行聚类的,从而指导研究人员选择合适的方法。此外,我们还探索了基于注释和基于拓扑的方法,结果表明,仅基于本体拓扑的语义相似性可以有效地揭示语义相似性结构,有助于分析研究较少的生物体和其他本体。

结论

Simona 为生物本体的处理、可视化和语义相似性分析提供了通用的接口和高效的实现。我们相信,simona 将成为揭示关系和增强生物知识系统互操作性的强大工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/c1333fc20486/12864_2024_10759_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/3d34638f3569/12864_2024_10759_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/1f1eeb55ca21/12864_2024_10759_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/45669d119164/12864_2024_10759_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/2ad2b36f4430/12864_2024_10759_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/c1333fc20486/12864_2024_10759_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/3d34638f3569/12864_2024_10759_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/1f1eeb55ca21/12864_2024_10759_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/45669d119164/12864_2024_10759_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/2ad2b36f4430/12864_2024_10759_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1142/11406866/c1333fc20486/12864_2024_10759_Fig5_HTML.jpg

相似文献

1
simona: a comprehensive R package for semantic similarity analysis on bio-ontologies.Simona:一个用于生物本体语义相似性分析的综合 R 包。
BMC Genomics. 2024 Sep 16;25(1):869. doi: 10.1186/s12864-024-10759-4.
2
Aggregating the syntactic and semantic similarity of healthcare data towards their transformation to HL7 FHIR through ontology matching.通过本体匹配,聚合医疗保健数据的语法和语义相似性,以将其转换为 HL7 FHIR。
Int J Med Inform. 2019 Dec;132:104002. doi: 10.1016/j.ijmedinf.2019.104002. Epub 2019 Oct 5.
3
Semantic similarity and machine learning with ontologies.语义相似性和本体论的机器学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa199.
4
Semantic similarity in biomedical ontologies.生物医学本体中的语义相似性。
PLoS Comput Biol. 2009 Jul;5(7):e1000443. doi: 10.1371/journal.pcbi.1000443. Epub 2009 Jul 31.
5
Multi-Ontology Refined Embeddings (MORE): A hybrid multi-ontology and corpus-based semantic representation model for biomedical concepts.多本体精炼嵌入模型(MORE):一种基于混合多本体和语料库的生物医学概念语义表示模型。
J Biomed Inform. 2020 Nov;111:103581. doi: 10.1016/j.jbi.2020.103581. Epub 2020 Oct 1.
6
Evaluating semantic similarity between Chinese biomedical terms through multiple ontologies with score normalization: An initial study.通过多本体和分数归一化评估中文生物医学术语之间的语义相似性:一项初步研究。
J Biomed Inform. 2016 Dec;64:273-287. doi: 10.1016/j.jbi.2016.10.017. Epub 2016 Nov 1.
7
A Semantic Similarity Evaluation for Healthcare Ontologies Matching to HL7 FHIR Resources.用于医疗保健本体与HL7 FHIR资源匹配的语义相似性评估
Stud Health Technol Inform. 2020 Jun 16;270:13-17. doi: 10.3233/SHTI200113.
8
OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction.OPA2Vec:结合生物医学本体的正式和非正式内容以改进基于相似度的预测。
Bioinformatics. 2019 Jun 1;35(12):2133-2140. doi: 10.1093/bioinformatics/bty933.
9
UFO: A tool for unifying biomedical ontology-based semantic similarity calculation, enrichment analysis and visualization.UFO:一种用于统一基于生物医学本体的语义相似性计算、富集分析和可视化的工具。
PLoS One. 2020 Jul 9;15(7):e0235670. doi: 10.1371/journal.pone.0235670. eCollection 2020.
10
HESML: a real-time semantic measures library for the biomedical domain with a reproducible survey.HESML:生物医学领域的实时语义度量库,附有可重现的调查。
BMC Bioinformatics. 2022 Jan 6;23(1):23. doi: 10.1186/s12859-021-04539-0.

引用本文的文献

1
Astrocyte diversity and subtypes: aligning transcriptomics with multimodal perspectives.星形胶质细胞的多样性与亚型:将转录组学与多模态视角相结合
EMBO Rep. 2025 Aug 1. doi: 10.1038/s44319-025-00529-y.
2
Sequencing validates deep learning models for EHR-based detection of Noonan syndrome in pediatric patients.测序验证了用于基于电子健康记录检测儿科患者努南综合征的深度学习模型。
NPJ Genom Med. 2025 Jul 21;10(1):56. doi: 10.1038/s41525-025-00512-5.
3
Data- and knowledge-derived functional landscape of human solute carriers.人类溶质载体的数据与知识衍生功能图谱

本文引用的文献

1
simplifyEnrichment: A Bioconductor Package for Clustering and Visualizing Functional Enrichment Results.simplifyEnrichment:一个用于聚类和可视化功能富集结果的 Bioconductor 包。
Genomics Proteomics Bioinformatics. 2023 Feb;21(1):190-202. doi: 10.1016/j.gpb.2022.04.008. Epub 2022 Jun 6.
2
The Human Disease Ontology 2022 update.人类疾病本体 2022 更新版。
Nucleic Acids Res. 2022 Jan 7;50(D1):D1255-D1261. doi: 10.1093/nar/gkab1063.
3
Cell type ontologies of the Human Cell Atlas.人类细胞图谱的细胞类型本体。
Mol Syst Biol. 2025 May 12. doi: 10.1038/s44320-025-00108-2.
4
Risk Prediction of RNA Off-Targets of CRISPR Base Editors in Tissue-Specific Transcriptomes Using Language Models.使用语言模型预测组织特异性转录组中CRISPR碱基编辑器的RNA脱靶风险
Int J Mol Sci. 2025 Feb 18;26(4):1723. doi: 10.3390/ijms26041723.
Nat Cell Biol. 2021 Nov;23(11):1129-1135. doi: 10.1038/s41556-021-00787-7. Epub 2021 Nov 8.
4
NCBI Taxonomy: a comprehensive update on curation, resources and tools.NCBI 分类学:在管理、资源和工具方面的全面更新。
Database (Oxford). 2020 Jan 1;2020. doi: 10.1093/database/baaa062.
5
A Literature Review of Gene Function Prediction by Modeling Gene Ontology.基于基因本体建模的基因功能预测文献综述
Front Genet. 2020 Apr 24;11:400. doi: 10.3389/fgene.2020.00400. eCollection 2020.
6
ROBOT: A Tool for Automating Ontology Workflows.机器人:自动化本体工作流程的工具。
BMC Bioinformatics. 2019 Jul 29;20(1):407. doi: 10.1186/s12859-019-3002-3.
7
GOGO: An improved algorithm to measure the semantic similarity between gene ontology terms.GO-GO:一种改进的基因本体术语间语义相似度测量算法。
Sci Rep. 2018 Oct 10;8(1):15107. doi: 10.1038/s41598-018-33219-y.
8
Interspecies gene function prediction using semantic similarity.基于语义相似性的跨物种基因功能预测
BMC Syst Biol. 2016 Dec 23;10(Suppl 4):121. doi: 10.1186/s12918-016-0361-5.
9
ontologyX: a suite of R packages for working with ontological data.本体X:一套用于处理本体数据的R包。
Bioinformatics. 2017 Apr 1;33(7):1104-1106. doi: 10.1093/bioinformatics/btw763.
10
Semantic Similarity in the Gene Ontology.基因本体论中的语义相似性。
Methods Mol Biol. 2017;1446:161-173. doi: 10.1007/978-1-4939-3743-1_12.