• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于开发云部署生物数据库的分布式检索引擎。

Distributed retrieval engine for the development of cloud-deployed biological databases.

作者信息

Bouzaglo David, Chasida Israel, Ezra Tsur Elishai

机构信息

Neuro-biomorphic Engineering Lab, Faculty of Engineering, Jerusalem College of Technology, Jerusalem, Israel.

出版信息

BioData Min. 2018 Nov 12;11:26. doi: 10.1186/s13040-018-0185-5. eCollection 2018.

DOI:10.1186/s13040-018-0185-5
PMID:30459848
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6233384/
Abstract

The integration of cloud resources with federated data retrieval has the potential of improving the maintenance, accessibility and performance of specialized databases in the biomedical field. However, such an integrative approach requires technical expertise in cloud computing, usage of a data retrieval engine and development of a unified data-model, which can encapsulate the heterogeneity of biological data. Here, a framework for the development of cloud-based biological specialized databases is proposed. It is powered by a distributed biodata retrieval system, able to interface with different data formats, as well as provides an integrated way for data exploration. The proposed framework was implemented using Java as the development environment, and MongoDB as the database manager. Syntactic analysis was based on BSON, jsoup, Apache Commons and w3c.dom open libraries. Framework is available in: http://nbel-lab.com and is distributed under the creative common agreement.

摘要

云资源与联合数据检索的整合具有改善生物医学领域专业数据库的维护、可访问性和性能的潜力。然而,这种整合方法需要云计算方面的技术专长、数据检索引擎的使用以及统一数据模型的开发,该模型能够封装生物数据的异质性。在此,提出了一个用于开发基于云的生物专业数据库的框架。它由一个分布式生物数据检索系统提供支持,该系统能够与不同的数据格式进行交互,并为数据探索提供一种集成方式。所提出的框架使用Java作为开发环境,MongoDB作为数据库管理器来实现。句法分析基于BSON、jsoup、Apache Commons和w3c.dom开放库。该框架可在以下网址获取:http://nbel-lab.com ,并根据知识共享协议进行分发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cac/6233384/f681c69ef10b/13040_2018_185_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cac/6233384/053056a3bba8/13040_2018_185_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cac/6233384/2a540f7bc7b4/13040_2018_185_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cac/6233384/f681c69ef10b/13040_2018_185_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cac/6233384/053056a3bba8/13040_2018_185_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cac/6233384/2a540f7bc7b4/13040_2018_185_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7cac/6233384/f681c69ef10b/13040_2018_185_Fig3_HTML.jpg

相似文献

1
Distributed retrieval engine for the development of cloud-deployed biological databases.用于开发云部署生物数据库的分布式检索引擎。
BioData Min. 2018 Nov 12;11:26. doi: 10.1186/s13040-018-0185-5. eCollection 2018.
2
Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces.基于实体的数据模型在生物信息学中的快速发展,采用持久化面向对象设计和结构化接口。
BioData Min. 2017 Mar 11;10:11. doi: 10.1186/s13040-017-0130-z. eCollection 2017.
3
A model-driven framework for data-driven applications in serverless cloud computing.无服务器云计算中数据驱动应用的模型驱动框架。
PLoS One. 2020 Aug 28;15(8):e0237317. doi: 10.1371/journal.pone.0237317. eCollection 2020.
4
A cloud-based framework for large-scale traditional Chinese medical record retrieval.基于云的大规模传统中医病历检索框架。
J Biomed Inform. 2018 Jan;77:21-33. doi: 10.1016/j.jbi.2017.11.013. Epub 2017 Nov 22.
5
Large-scale virtual screening on public cloud resources with Apache Spark.利用Apache Spark在公共云资源上进行大规模虚拟筛选。
J Cheminform. 2017 Mar 6;9:15. doi: 10.1186/s13321-017-0204-4. eCollection 2017.
6
Federated Galaxy: Biomedical Computing at the Frontier.联合星系:前沿的生物医学计算
IEEE Int Conf Cloud Comput. 2018 Jul;2018. doi: 10.1109/cloud.2018.00124. Epub 2018 Sep 10.
7
MAPI: a software framework for distributed biomedical applications.MAPI:用于分布式生物医学应用的软件框架。
J Biomed Semantics. 2013 Jan 11;4(1):4. doi: 10.1186/2041-1480-4-4.
8
An effective model for store and retrieve big health data in cloud computing.一种在云计算中存储和检索大健康数据的有效模型。
Comput Methods Programs Biomed. 2016 Aug;132:75-82. doi: 10.1016/j.cmpb.2016.04.016. Epub 2016 Apr 19.
9
A comparative experimental study of distributed storage engines for big spatial data processing using GeoSpark.一项使用GeoSpark进行大空间数据处理的分布式存储引擎的比较实验研究。
J Supercomput. 2022;78(2):2556-2579. doi: 10.1007/s11227-021-03946-7. Epub 2021 Jul 1.
10
Cloud based evaluation of databases for stock market data.基于云的股票市场数据数据库评估。
J Cloud Comput (Heidelb). 2022;11(1):53. doi: 10.1186/s13677-022-00323-4. Epub 2022 Sep 29.

引用本文的文献

1
Towards a European health research and innovation cloud (HRIC).迈向欧洲健康研究与创新云(HRIC)。
Genome Med. 2020 Feb 19;12(1):18. doi: 10.1186/s13073-020-0713-z.

本文引用的文献

1
Rapid development of entity-based data models for bioinformatics with persistence object-oriented design and structured interfaces.基于实体的数据模型在生物信息学中的快速发展,采用持久化面向对象设计和结构化接口。
BioData Min. 2017 Mar 11;10:11. doi: 10.1186/s13040-017-0130-z. eCollection 2017.
2
Big data analytics in healthcare: promise and potential.医疗保健中的大数据分析:前景与潜力。
Health Inf Sci Syst. 2014 Feb 7;2:3. doi: 10.1186/2047-2501-2-3. eCollection 2014.
3
MalaCards: A Comprehensive Automatically-Mined Database of Human Diseases.
MalaCards:一个全面的自动挖掘的人类疾病数据库。
Curr Protoc Bioinformatics. 2014 Sep 8;47:1.24.1-19. doi: 10.1002/0471250953.bi0124s47.
4
Lean Big Data integration in systems biology and systems pharmacology.系统生物学和系统药理学中的精益大数据整合
Trends Pharmacol Sci. 2014 Sep;35(9):450-60. doi: 10.1016/j.tips.2014.07.001. Epub 2014 Aug 7.
5
Extending the NIF DISCO framework to automate complex workflow: coordinating the harvest and integration of data from diverse neuroscience information resources.将 NIF DISCO 框架扩展到自动化复杂工作流程:协调来自不同神经科学信息资源的数据的采集和整合。
Front Neuroinform. 2014 May 28;8:58. doi: 10.3389/fninf.2014.00058. eCollection 2014.
6
SeqWare Query Engine: storing and searching sequence data in the cloud.SeqWare 查询引擎:在云端存储和搜索序列数据。
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-11-S12-S2.
7
The neuroscience information framework: a data and knowledge environment for neuroscience.神经科学信息框架:一个用于神经科学的数据与知识环境。
Neuroinformatics. 2008 Sep;6(3):149-60. doi: 10.1007/s12021-008-9024-z. Epub 2008 Oct 23.
8
The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome Databases.代谢途径与酶的MetaCyc数据库以及途径/基因组数据库的BioCyc集合。
Nucleic Acids Res. 2008 Jan;36(Database issue):D623-31. doi: 10.1093/nar/gkm900. Epub 2007 Oct 27.
9
The FlyBase database of the Drosophila genome projects and community literature.果蝇基因组计划及相关文献的FlyBase数据库。
Nucleic Acids Res. 2003 Jan 1;31(1):172-5. doi: 10.1093/nar/gkg094.
10
WormBase: network access to the genome and biology of Caenorhabditis elegans.WormBase:线虫基因组与生物学的网络访问资源。
Nucleic Acids Res. 2001 Jan 1;29(1):82-6. doi: 10.1093/nar/29.1.82.