• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于 caGrid 的语义网数据仓库。

Semantic web data warehousing for caGrid.

机构信息

Department of Pathology, Yale University School of Medicine, New Haven, CT, USA

出版信息

BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S2. doi: 10.1186/1471-2105-10-S10-S2.

DOI:10.1186/1471-2105-10-S10-S2
PMID:19796399
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2755823/
Abstract

The National Cancer Institute (NCI) is developing caGrid as a means for sharing cancer-related data and services. As more data sets become available on caGrid, we need effective ways of accessing and integrating this information. Although the data models exposed on caGrid are semantically well annotated, it is currently up to the caGrid client to infer relationships between the different models and their classes. In this paper, we present a Semantic Web-based data warehouse (Corvus) for creating relationships among caGrid models. This is accomplished through the transformation of semantically-annotated caBIG Unified Modeling Language (UML) information models into Web Ontology Language (OWL) ontologies that preserve those semantics. We demonstrate the validity of the approach by Semantic Extraction, Transformation and Loading (SETL) of data from two caGrid data sources, caTissue and caArray, as well as alignment and query of those sources in Corvus. We argue that semantic integration is necessary for integration of data from distributed web services and that Corvus is a useful way of accomplishing this. Our approach is generalizable and of broad utility to researchers facing similar integration challenges.

摘要

美国国家癌症研究所(NCI)正在开发 caGrid,作为共享癌症相关数据和服务的一种手段。随着更多的数据集在 caGrid 上可用,我们需要有效的方法来访问和整合这些信息。虽然 caGrid 上公开的数据模型在语义上有很好的注释,但目前需要 caGrid 客户端来推断不同模型及其类之间的关系。在本文中,我们提出了一个基于语义网的数据仓库(Corvus),用于在 caGrid 模型之间创建关系。这是通过将语义注释的 caBIG 统一建模语言(UML)信息模型转换为保留这些语义的 Web 本体语言(OWL)本体来实现的。我们通过从两个 caGrid 数据源(caTissue 和 caArray)进行数据的语义提取、转换和加载(SETL),以及在 Corvus 中对这些数据源进行对齐和查询,证明了该方法的有效性。我们认为语义集成对于来自分布式 Web 服务的数据集成是必要的,而 Corvus 是实现这一目标的一种有用方法。我们的方法具有通用性,对于面临类似集成挑战的研究人员具有广泛的实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/00d4f25b9460/12859_2009_3370_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/b70672f3217a/12859_2009_3370_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/cfb323d300e7/12859_2009_3370_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/87ad2c1b1f3b/12859_2009_3370_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/ce0144167fc9/12859_2009_3370_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/00d4f25b9460/12859_2009_3370_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/b70672f3217a/12859_2009_3370_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/cfb323d300e7/12859_2009_3370_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/87ad2c1b1f3b/12859_2009_3370_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/ce0144167fc9/12859_2009_3370_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99ad/2755823/00d4f25b9460/12859_2009_3370_Fig5_HTML.jpg

相似文献

1
Semantic web data warehousing for caGrid.用于 caGrid 的语义网数据仓库。
BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S2. doi: 10.1186/1471-2105-10-S10-S2.
2
caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid.caGrid:癌症生物医学信息学网格核心架构的设计与实现
Bioinformatics. 2006 Aug 1;22(15):1910-6. doi: 10.1093/bioinformatics/btl272. Epub 2006 Jun 9.
3
Federated ontology-based queries over cancer data.基于联邦本体的癌症数据查询。
BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-13-S1-S9.
4
CaGrid Workflow Toolkit: a Taverna based workflow tool for cancer grid.CaGrid 工作流工具包:一个基于 Taverna 的癌症网格工作流工具。
BMC Bioinformatics. 2010 Nov 2;11:542. doi: 10.1186/1471-2105-11-542.
5
SSWAP: A Simple Semantic Web Architecture and Protocol for semantic web services.SSWAP:一种用于语义 Web 服务的简单语义 Web 架构和协议。
BMC Bioinformatics. 2009 Sep 23;10:309. doi: 10.1186/1471-2105-10-309.
6
caGrid 1.0: an enterprise Grid infrastructure for biomedical research.caGrid 1.0:用于生物医学研究的企业级网格基础设施。
J Am Med Inform Assoc. 2008 Mar-Apr;15(2):138-49. doi: 10.1197/jamia.M2522. Epub 2007 Dec 20.
7
The caCORE Software Development Kit: streamlining construction of interoperable biomedical information services.caCORE软件开发工具包:简化可互操作生物医学信息服务的构建
BMC Med Inform Decis Mak. 2006 Jan 6;6:2. doi: 10.1186/1472-6947-6-2.
8
semCDI: a query formulation for semantic data integration in caBIG.语义临床数据集成(semCDI):caBIG中语义数据集成的查询公式化
J Am Med Inform Assoc. 2008 Jul-Aug;15(4):559-68. doi: 10.1197/jamia.M2732. Epub 2008 Apr 24.
9
Querying phenotype-genotype relationships on patient datasets using semantic web technology: the example of Cerebrotendinous xanthomatosis.使用语义网技术在患者数据集上查询表型-基因型关系:以脑腱黄瘤病为例。
BMC Med Inform Decis Mak. 2012 Jul 31;12:78. doi: 10.1186/1472-6947-12-78.
10
Bio-jETI: a framework for semantics-based service composition.生物-JETI:一种基于语义的服务组合框架。
BMC Bioinformatics. 2009 Oct 1;10 Suppl 10(Suppl 10):S8. doi: 10.1186/1471-2105-10-S10-S8.

引用本文的文献

1
A digital repository with an extensible data model for biobanking and genomic analysis management.一个具有可扩展数据模型的数字存储库,用于生物样本库和基因组分析管理。
BMC Genomics. 2014;15 Suppl 3(Suppl 3):S3. doi: 10.1186/1471-2164-15-S3-S3. Epub 2014 May 6.
2
Cheminformatics and the Semantic Web: adding value with linked data and enhanced provenance.化学信息学与语义网:通过关联数据和增强的出处增加价值。
Wiley Interdiscip Rev Comput Mol Sci. 2013 Sep;3(5):465-481. doi: 10.1002/wcms.1127. Epub 2013 Jan 8.
3
Developing a semantically rich ontology for the biobank-administration domain.

本文引用的文献

1
MAGETabulator, a suite of tools to support the microarray data format MAGE-TAB.MAGETabulator,一套支持微阵列数据格式MAGE-TAB的工具。
Bioinformatics. 2009 Jan 15;25(2):279-80. doi: 10.1093/bioinformatics/btn617. Epub 2008 Nov 27.
2
semCDI: a query formulation for semantic data integration in caBIG.语义临床数据集成(semCDI):caBIG中语义数据集成的查询公式化
J Am Med Inform Assoc. 2008 Jul-Aug;15(4):559-68. doi: 10.1197/jamia.M2732. Epub 2008 Apr 24.
3
Sharing data and analytical resources securely in a biomedical research Grid environment.
为生物样本库管理领域开发一个语义丰富的本体。
J Biomed Semantics. 2013 Oct 8;4(1):23. doi: 10.1186/2041-1480-4-23.
4
SHRINE: enabling nationally scalable multi-site disease studies.SHRINE:实现全国范围内可扩展的多站点疾病研究。
PLoS One. 2013;8(3):e55811. doi: 10.1371/journal.pone.0055811. Epub 2013 Mar 7.
5
Interfaces to PeptideAtlas: a case study of standard data access systems.肽图集接口:标准数据访问系统案例研究。
Brief Bioinform. 2012 Sep;13(5):615-26. doi: 10.1093/bib/bbr067. Epub 2011 Nov 22.
6
Nanoinformatics: a new area of research in nanomedicine.纳米信息学:纳米医学的一个新研究领域。
Int J Nanomedicine. 2012;7:3867-90. doi: 10.2147/IJN.S24582. Epub 2012 Jul 24.
7
A semantic web framework to integrate cancer omics data with biological knowledge.一种用于将癌症组学数据与生物学知识集成的语义网框架。
BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2105-13-S1-S10.
8
Federated ontology-based queries over cancer data.基于联邦本体的癌症数据查询。
BMC Bioinformatics. 2012 Jan 25;13 Suppl 1(Suppl 1):S9. doi: 10.1186/1471-2105-13-S1-S9.
9
Applications of the pipeline environment for visual informatics and genomics computations.管道环境在视觉信息学和基因组计算中的应用。
BMC Bioinformatics. 2011 Jul 26;12:304. doi: 10.1186/1471-2105-12-304.
10
IGRhCellID: integrated genomic resources of human cell lines for identification.IGRhCellID:用于识别的人类细胞系综合基因组资源。
Nucleic Acids Res. 2011 Jan;39(Database issue):D520-4. doi: 10.1093/nar/gkq1075. Epub 2010 Nov 4.
在生物医学研究网格环境中安全地共享数据和分析资源。
J Am Med Inform Assoc. 2008 May-Jun;15(3):363-73. doi: 10.1197/jamia.M2662. Epub 2008 Feb 28.
4
caCORE version 3: Implementation of a model driven, service-oriented architecture for semantic interoperability.caCORE版本3:用于语义互操作性的模型驱动、面向服务架构的实现。
J Biomed Inform. 2008 Feb;41(1):106-23. doi: 10.1016/j.jbi.2007.03.009. Epub 2007 Apr 2.
5
caGrid: design and implementation of the core architecture of the cancer biomedical informatics grid.caGrid:癌症生物医学信息学网格核心架构的设计与实现
Bioinformatics. 2006 Aug 1;22(15):1910-6. doi: 10.1093/bioinformatics/btl272. Epub 2006 Jun 9.
6
NCI Thesaurus: a semantic model integrating cancer-related clinical and molecular information.美国国立癌症研究所叙词表:整合癌症相关临床和分子信息的语义模型。
J Biomed Inform. 2007 Feb;40(1):30-43. doi: 10.1016/j.jbi.2006.02.013. Epub 2006 Mar 15.
7
GenePattern 2.0.基因模式2.0
Nat Genet. 2006 May;38(5):500-1. doi: 10.1038/ng0506-500.
8
Cyberinfrastructure: empowering a "third way" in biomedical research.网络基础设施:助力生物医学研究的“第三条道路”
Science. 2005 May 6;308(5723):821-4. doi: 10.1126/science.1112120.
9
Modeling a description logic vocabulary for cancer research.为癌症研究构建描述逻辑词汇表。
J Biomed Inform. 2005 Apr;38(2):114-29. doi: 10.1016/j.jbi.2004.09.001.
10
caCORE: a common infrastructure for cancer informatics.caCORE:癌症信息学的通用基础设施。
Bioinformatics. 2003 Dec 12;19(18):2404-12. doi: 10.1093/bioinformatics/btg335.