通过多维语义空间探索和链接生物医学资源。

Exploring and linking biomedical resources through multidimensional semantic spaces.

出版信息

BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2105-13-S1-S6.

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3471347/

Abstract

BACKGROUND

The semantic integration of biomedical resources is still a challenging issue which is required for effective information processing and data analysis. The availability of comprehensive knowledge resources such as biomedical ontologies and integrated thesauri greatly facilitates this integration effort by means of semantic annotation, which allows disparate data formats and contents to be expressed under a common semantic space. In this paper, we propose a multidimensional representation for such a semantic space, where dimensions regard the different perspectives in biomedical research (e.g., population, disease, anatomy and protein/genes).

RESULTS

This paper presents a novel method for building multidimensional semantic spaces from semantically annotated biomedical data collections. This method consists of two main processes: knowledge and data normalization. The former one arranges the concepts provided by a reference knowledge resource (e.g., biomedical ontologies and thesauri) into a set of hierarchical dimensions for analysis purposes. The latter one reduces the annotation set associated to each collection item into a set of points of the multidimensional space. Additionally, we have developed a visual tool, called 3D-Browser, which implements OLAP-like operators over the generated multidimensional space. The method and the tool have been tested and evaluated in the context of the Health-e-Child (HeC) project. Automatic semantic annotation was applied to tag three collections of abstracts taken from PubMed, one for each target disease of the project, the Uniprot database, and the HeC patient record database. We adopted the UMLS Meta-thesaurus 2010AA as the reference knowledge resource.

CONCLUSIONS

Current knowledge resources and semantic-aware technology make possible the integration of biomedical resources. Such an integration is performed through semantic annotation of the intended biomedical data resources. This paper shows how these annotations can be exploited for integration, exploration, and analysis tasks. Results over a real scenario demonstrate the viability and usefulness of the approach, as well as the quality of the generated multidimensional semantic spaces.

摘要

背景

生物医学资源的语义集成仍然是一个具有挑战性的问题，这对于有效的信息处理和数据分析是必需的。全面的知识资源（如生物医学本体和集成词库）的可用性极大地促进了这种集成工作，其方式是通过语义注释，从而使不同的数据格式和内容可以在共同的语义空间中表达。在本文中，我们提出了一种多维表示，其中维度涉及生物医学研究的不同视角（例如，人群、疾病、解剖和蛋白质/基因）。

结果

本文提出了一种从语义注释的生物医学数据集中构建多维语义空间的新方法。该方法包括两个主要过程：知识和数据规范化。前者将参考知识资源（例如生物医学本体和词库）提供的概念安排到一组用于分析的层次维度中。后者将与每个集合项相关联的注释集减少为多维空间中的一组点。此外，我们还开发了一个称为 3D-Browser 的可视化工具，该工具在生成的多维空间上实现了 OLAP 类似的操作符。该方法和工具已在 Health-e-Child（HeC）项目中进行了测试和评估。自动语义注释应用于从 PubMed 标记三个摘要集，每个项目都针对项目的目标疾病之一，UniProt 数据库和 HeC 患者记录数据库。我们采用 UMLS Meta-thesaurus 2010AA 作为参考知识资源。

结论

当前的知识资源和语义感知技术使生物医学资源的集成成为可能。这种集成是通过对预期的生物医学数据资源进行语义注释来实现的。本文展示了如何利用这些注释来执行集成、探索和分析任务。真实场景中的结果证明了该方法的可行性和有用性，以及生成的多维语义空间的质量。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/944b/3471347/d64cd6c51e0f/1471-2105-13-S1-S6-1.jpg

相似文献

Exploring and linking biomedical resources through multidimensional semantic spaces.

BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2105-13-S1-S6.

KaBOB: ontology-based semantic integration of biomedical databases.

BMC Bioinformatics. 2015 Apr 23;16(1):126. doi: 10.1186/s12859-015-0559-3.

SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes.

BMC Bioinformatics. 2018 Nov 6;19(1):405. doi: 10.1186/s12859-018-2429-2.

Semantic annotation in biomedicine: the current landscape.

J Biomed Semantics. 2017 Sep 22;8(1):44. doi: 10.1186/s13326-017-0153-x.

Harmonizing semantic annotations for computational models in biology.

Brief Bioinform. 2019 Mar 22;20(2):540-550. doi: 10.1093/bib/bby087.

Generation of open biomedical datasets through ontology-driven transformation and integration processes.

J Biomed Semantics. 2016 Jun 3;7:32. doi: 10.1186/s13326-016-0075-z.

PLoS Comput Biol. 2009 Jul;5(7):e1000443. doi: 10.1371/journal.pcbi.1000443. Epub 2009 Jul 31.

Using the Semantic Web for Rapid Integration of WikiPathways with Other Biological Online Data Resources.

PLoS Comput Biol. 2016 Jun 23;12(6):e1004989. doi: 10.1371/journal.pcbi.1004989. eCollection 2016 Jun.

simona: a comprehensive R package for semantic similarity analysis on bio-ontologies.

BMC Genomics. 2024 Sep 16;25(1):869. doi: 10.1186/s12864-024-10759-4.

Reuse of terminological resources for efficient ontological engineering in Life Sciences.

BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S4. doi: 10.1186/1471-2105-10-S10-S4.

引用本文的文献

Social Media Multidimensional Analysis for Intelligent Health Surveillance.

Int J Environ Res Public Health. 2020 Mar 28;17(7):2289. doi: 10.3390/ijerph17072289.

Investigating the role of interleukin-1 beta and glutamate in inflammatory bowel disease and epilepsy using discovery browsing.

J Biomed Semantics. 2018 Dec 27;9(1):25. doi: 10.1186/s13326-018-0192-y.

Improving the interoperability of biomedical ontologies with compound alignments.

J Biomed Semantics. 2018 Jan 9;9(1):1. doi: 10.1186/s13326-017-0171-8.

Natural language processing systems for capturing and standardizing unstructured clinical information: A systematic review.

J Biomed Inform. 2017 Sep;73:14-29. doi: 10.1016/j.jbi.2017.07.012. Epub 2017 Jul 17.

Ontology-Based Querying with Bio2RDF's Linked Open Data.

J Biomed Semantics. 2013 Apr 15;4 Suppl 1(Suppl 1):S1. doi: 10.1186/2041-1480-4-S1-S1.

本文引用的文献

Assessment of NER solutions against the first and second CALBC Silver Standard Corpus.

J Biomed Semantics. 2011 Oct 6;2 Suppl 5(Suppl 5):S11. doi: 10.1186/2041-1480-2-S5-S11.

Literature mining, ontologies and information visualization for drug repurposing.

Brief Bioinform. 2011 Jul;12(4):357-68. doi: 10.1093/bib/bbr005. Epub 2011 Jun 28.

EcoCyc: a comprehensive database of Escherichia coli biology.

Nucleic Acids Res. 2011 Jan;39(Database issue):D583-90. doi: 10.1093/nar/gkq1143. Epub 2010 Nov 21.

The BioPAX community standard for pathway data sharing.

Nat Biotechnol. 2010 Sep;28(9):935-42. doi: 10.1038/nbt.1666. Epub 2010 Sep 9.

CALBC silver standard corpus.

J Bioinform Comput Biol. 2010 Feb;8(1):163-79. doi: 10.1142/s0219720010004562.

Exploitation of ontological resources for scientific literature analysis: searching genes and related diseases.

Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:7073-8. doi: 10.1109/IEMBS.2009.5333359.

XML-based approaches for the integration of heterogeneous bio-molecular data.

BMC Bioinformatics. 2009 Oct 15;10 Suppl 12(Suppl 12):S7. doi: 10.1186/1471-2105-10-S12-S7.

FACTA: a text search engine for finding associated biomedical concepts.

Bioinformatics. 2008 Nov 1;24(21):2559-60. doi: 10.1093/bioinformatics/btn469. Epub 2008 Sep 4.

A data model for integrating heterogeneous medical data in the Health-e-Child project.

Stud Health Technol Inform. 2008;138:13-23.

Bio2RDF: towards a mashup to build bioinformatics knowledge systems.

J Biomed Inform. 2008 Oct;41(5):706-16. doi: 10.1016/j.jbi.2008.03.004. Epub 2008 Mar 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

通过多维语义空间探索和链接生物医学资源。

Exploring and linking biomedical resources through multidimensional semantic spaces.

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献