• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种异构数据源语义集成的方法。

An approach for semantic integration of heterogeneous data sources.

作者信息

Fusco Giuseppe, Aversano Lerina

机构信息

Department of Engineering, University of Sannio, Benevento, BN, Italia.

出版信息

PeerJ Comput Sci. 2020 Mar 2;6:e254. doi: 10.7717/peerj-cs.254. eCollection 2020.

DOI:10.7717/peerj-cs.254
PMID:33816906
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7924686/
Abstract

Integrating data from multiple heterogeneous data sources entails dealing with data distributed among heterogeneous information sources, which can be structured, semi-structured or unstructured, and providing the user with a unified view of these data. Thus, in general, gathering information is challenging, and one of the main reasons is that data sources are designed to support specific applications. Very often their structure is unknown to the large part of users. Moreover, the stored data is often redundant, mixed with information only needed to support enterprise processes, and incomplete with respect to the business domain. Collecting, integrating, reconciling and efficiently extracting information from heterogeneous and autonomous data sources is regarded as a major challenge. In this paper, we present an approach for the semantic integration of heterogeneous data sources, DIF (Data Integration Framework), and a software prototype to support all aspects of a complex data integration process. The proposed approach is an ontology-based generalization of both Global-as-View and Local-as-View approaches. In particular, to overcome problems due to semantic heterogeneity and to support interoperability with external systems, ontologies are used as a conceptual schema to represent both data sources to be integrated and the global view.

摘要

整合来自多个异构数据源的数据需要处理分布在异构信息源中的数据,这些数据源可以是结构化、半结构化或非结构化的,并为用户提供这些数据的统一视图。因此,一般来说,收集信息具有挑战性,主要原因之一是数据源旨在支持特定应用程序。在很大程度上,用户通常不知道它们的结构。此外,存储的数据往往是冗余的,与仅支持企业流程所需的信息混合在一起,并且在业务领域方面是不完整的。从异构和自治数据源收集、整合、协调和高效提取信息被视为一项重大挑战。在本文中,我们提出了一种用于异构数据源语义集成的方法——数据集成框架(DIF),以及一个支持复杂数据集成过程各个方面的软件原型。所提出的方法是基于本体对全局视图和局部视图方法的一种泛化。特别是,为了克服语义异构带来的问题并支持与外部系统的互操作性,本体被用作概念模式来表示要集成的数据源和全局视图。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/ab0426ddfbe4/peerj-cs-06-254-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/49a5c7b1b347/peerj-cs-06-254-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/1c5766fa1328/peerj-cs-06-254-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/646954ae5281/peerj-cs-06-254-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/2d6abab4a201/peerj-cs-06-254-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/550991fa2664/peerj-cs-06-254-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/94a8fbc8b331/peerj-cs-06-254-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/ee3bb0012787/peerj-cs-06-254-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/3ff1f6d23827/peerj-cs-06-254-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/2ec5e1e2e231/peerj-cs-06-254-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/e2b4bdd4f296/peerj-cs-06-254-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/ab0426ddfbe4/peerj-cs-06-254-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/49a5c7b1b347/peerj-cs-06-254-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/1c5766fa1328/peerj-cs-06-254-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/646954ae5281/peerj-cs-06-254-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/2d6abab4a201/peerj-cs-06-254-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/550991fa2664/peerj-cs-06-254-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/94a8fbc8b331/peerj-cs-06-254-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/ee3bb0012787/peerj-cs-06-254-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/3ff1f6d23827/peerj-cs-06-254-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/2ec5e1e2e231/peerj-cs-06-254-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/e2b4bdd4f296/peerj-cs-06-254-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ec5d/7924686/ab0426ddfbe4/peerj-cs-06-254-g011.jpg

相似文献

1
An approach for semantic integration of heterogeneous data sources.一种异构数据源语义集成的方法。
PeerJ Comput Sci. 2020 Mar 2;6:e254. doi: 10.7717/peerj-cs.254. eCollection 2020.
2
Clinical data integration model. Core interoperability ontology for research using primary care data.临床数据整合模型。使用初级保健数据进行研究的核心互操作性本体。
Methods Inf Med. 2015;54(1):16-23. doi: 10.3414/ME13-02-0024. Epub 2014 Jun 18.
3
Toward a view-oriented approach for aligning RDF-based biomedical repositories.迈向一种基于视图的方法来对齐基于RDF的生物医学知识库。
Methods Inf Med. 2015;54(1):50-5. doi: 10.3414/ME13-02-0020. Epub 2014 Apr 29.
4
Ontology-based interoperability service for HL7 interfaces implementation.用于HL7接口实现的基于本体的互操作性服务。
Stud Health Technol Inform. 2010;155:108-14.
5
Conceptual Model Formalization in a Semantic Interoperability Service Framework: Transforming Relational Database Schemas to OWL.语义互操作性服务框架中的概念模型形式化:将关系数据库模式转换为OWL
Stud Health Technol Inform. 2014;200:35-41.
6
JXP4BIGI: a generalized, Java XML-based approach for biological information gathering and integration.JXP4BIGI:一种基于Java XML的通用生物信息收集与整合方法。
Bioinformatics. 2003 Dec 12;19(18):2351-8. doi: 10.1093/bioinformatics/btg327.
7
Using ontologies to improve semantic interoperability in health data.利用本体来提高健康数据中的语义互操作性。
J Innov Health Inform. 2015 Jul 10;22(2):309-15. doi: 10.14236/jhi.v22i2.159.
8
A unified structural/terminological interoperability framework based on LexEVS: application to TRANSFoRm.基于 LexEVS 的统一结构/术语互操作性框架:在 TRANSFoRm 中的应用。
J Am Med Inform Assoc. 2013 Sep-Oct;20(5):986-94. doi: 10.1136/amiajnl-2012-001312. Epub 2013 Apr 9.
9
Semantic integration of gene expression analysis tools and data sources using software connectors.使用软件连接器实现基因表达分析工具和数据源的语义集成。
BMC Genomics. 2013 Oct 25;14 Suppl 6(Suppl 6):S2. doi: 10.1186/1471-2164-14-S6-S2.
10
An ontology-guided semantic data integration framework to support integrative data analysis of cancer survival.本体指导的语义数据集成框架,支持癌症生存的综合数据分析。
BMC Med Inform Decis Mak. 2018 Jul 23;18(Suppl 2):41. doi: 10.1186/s12911-018-0636-4.

引用本文的文献

1
Heterogeneous data integration: Challenges and opportunities.异构数据集成:挑战与机遇。
Data Brief. 2024 Aug 29;56:110853. doi: 10.1016/j.dib.2024.110853. eCollection 2024 Oct.
2
The proposal of a modeling methodology for an industrial internet information model.一种工业互联网信息模型建模方法的提议。
PeerJ Comput Sci. 2022 Nov 15;8:e1150. doi: 10.7717/peerj-cs.1150. eCollection 2022.
3
The external and data loose coupling for the integration of software units: a systematic mapping study.用于软件单元集成的外部与数据松耦合:一项系统映射研究。
PeerJ Comput Sci. 2021 Dec 7;7:e796. doi: 10.7717/peerj-cs.796. eCollection 2021.