Suppr超能文献

DASMiner:从DAS源发现并整合数据。

DASMiner: discovering and integrating data from DAS sources.

作者信息

Veiga Diogo F T, Deus Helena F, Akdemir Caner, Vasconcelos Ana Tereza R, Almeida Jonas S

机构信息

Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Blvd Houston, TX 77030, USA.

出版信息

BMC Syst Biol. 2009 Nov 17;3:109. doi: 10.1186/1752-0509-3-109.

Abstract

BACKGROUND

DAS is a widely adopted protocol for providing syntactic interoperability among biological databases. The popularity of DAS is due to a simplified and elegant mechanism for data exchange that consists of sources exposing their RESTful interfaces for data access. As a growing number of DAS services are available for molecular biology resources, there is an incentive to explore this protocol in order to advance data discovery and integration among these resources.

RESULTS

We developed DASMiner, a Matlab toolkit for querying DAS data sources that enables creation of integrated biological models using the information available in DAS-compliant repositories. DASMiner is composed by a browser application and an API that work together to facilitate gathering of data from different DAS sources, which can be used for creating enriched datasets from multiple sources. The browser is used to formulate queries and navigate data contained in DAS sources. Users can execute queries against these sources in an intuitive fashion, without the need of knowing the specific DAS syntax for the particular source. Using the source's metadata provided by the DAS Registry, the browser's layout adapts to expose only the set of commands and coordinate systems supported by the specific source. For this reason, the browser can interrogate any DAS source, independently of the type of data being served. The API component of DASMiner may be used for programmatic access of DAS sources by programs in Matlab. Once the desired data is found during navigation, the query is exported in the format of an API call to be used within any Matlab application. We illustrate the use of DASMiner by creating integrative models of histone modification maps and protein-protein interaction networks. These enriched datasets were built by retrieving and integrating distributed genomic and proteomic DAS sources using the API.

CONCLUSION

The support of the DAS protocol allows that hundreds of molecular biology databases to be treated as a federated, online collection of resources. DASMiner enables full exploration of these resources, and can be used to deploy applications and create integrated views of biological systems using the information deposited in DAS repositories.

摘要

背景

DAS是一种广泛采用的协议,用于在生物数据库之间提供句法互操作性。DAS之所以受欢迎,是因为它有一个简化而优雅的数据交换机制,该机制由数据源公开其用于数据访问的RESTful接口组成。随着越来越多的DAS服务可用于分子生物学资源,人们有动力探索该协议,以促进这些资源之间的数据发现和整合。

结果

我们开发了DASMiner,这是一个用于查询DAS数据源的Matlab工具包,它能够利用符合DAS的存储库中可用的信息创建综合生物学模型。DASMiner由一个浏览器应用程序和一个应用程序编程接口(API)组成,它们协同工作以促进从不同DAS源收集数据,这些数据可用于从多个源创建丰富的数据集。浏览器用于制定查询并浏览DAS源中包含的数据。用户可以以直观的方式针对这些源执行查询,而无需了解特定源的特定DAS语法。利用DAS注册中心提供的源元数据,浏览器的布局会进行调整,仅显示特定源支持的命令集和坐标系。因此,浏览器可以询问任何DAS源,而与所提供的数据类型无关。DASMiner的API组件可用于Matlab中的程序对DAS源进行编程访问。一旦在浏览过程中找到所需数据,查询将以API调用的格式导出,以便在任何Matlab应用程序中使用。我们通过创建组蛋白修饰图谱和蛋白质-蛋白质相互作用网络的整合模型来说明DASMiner的使用。这些丰富的数据集是通过使用API检索和整合分布式基因组和蛋白质组DAS源而构建的。

结论

对DAS协议的支持使得数百个分子生物学数据库能够被视为一个联合的在线资源集合。DASMiner能够充分探索这些资源,并可用于部署应用程序,以及利用存储在DAS存储库中的信息创建生物系统的综合视图。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f354/2789070/134de4173b15/1752-0509-3-109-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验