• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
ArcaDB: A Disaggregated Query Engine for Heterogenous Computational Environments.ArcaDB:一种用于异构计算环境的分布式查询引擎。
IEEE Int Conf Cloud Comput. 2024 Jul;2024:42-53. doi: 10.1109/cloud62652.2024.00015. Epub 2024 Aug 28.
2
A oneM2M-Based Query Engine for Internet of Things (IoT) Data Streams.基于 oneM2M 的物联网 (IoT) 数据流查询引擎。
Sensors (Basel). 2018 Sep 27;18(10):3253. doi: 10.3390/s18103253.
3
SeqWare Query Engine: storing and searching sequence data in the cloud.SeqWare 查询引擎:在云端存储和搜索序列数据。
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S2. doi: 10.1186/1471-2105-11-S12-S2.
4
Performance Modeling in CUDA Streams - A Means for High-Throughput Data Processing.CUDA流中的性能建模——一种实现高吞吐量数据处理的方法。
Proc IEEE Int Conf Big Data. 2014 Oct;2014:301-310. doi: 10.1109/BigData.2014.7004245.
5
LocationSpark: In-memory Distributed Spatial Query Processing and Optimization.位置Spark:内存中分布式空间查询处理与优化
Front Big Data. 2020 Oct 16;3:30. doi: 10.3389/fdata.2020.00030. eCollection 2020.
6
A new framework based on features modeling and ensemble learning to predict query performance.基于特征建模和集成学习的查询性能预测新框架。
PLoS One. 2021 Oct 18;16(10):e0258439. doi: 10.1371/journal.pone.0258439. eCollection 2021.
7
An adaptive spark-based framework for querying large-scale NoSQL and relational databases.一种适用于查询大规模 NoSQL 和关系型数据库的基于火花的自适应框架。
PLoS One. 2021 Aug 19;16(8):e0255562. doi: 10.1371/journal.pone.0255562. eCollection 2021.
8
A distributed query execution engine of big attributed graphs.一种带属性大图的分布式查询执行引擎。
Springerplus. 2016 May 23;5(1):665. doi: 10.1186/s40064-016-2251-0. eCollection 2016.
9
Accumulating computational resource usage of genomic data analysis workflow to optimize cloud computing instance selection.积累基因组数据分析工作流程的计算资源使用情况,以优化云计算实例选择。
Gigascience. 2019 Apr 1;8(4). doi: 10.1093/gigascience/giz052.
10
TeraChem Cloud: A High-Performance Computing Service for Scalable Distributed GPU-Accelerated Electronic Structure Calculations.泰瑞化学云:用于可扩展分布式 GPU 加速电子结构计算的高性能计算服务。
J Chem Inf Model. 2020 Apr 27;60(4):2126-2137. doi: 10.1021/acs.jcim.9b01152. Epub 2020 Apr 20.

本文引用的文献

1
Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment.多GPU环境下大型数据库表的高效连接算法
Proceedings VLDB Endowment. 2020 Dec;14(4):708-720. doi: 10.14778/3436905.3436927. Epub 2020 Dec 1.
2
PubChem 2023 update.PubChem 2023 更新。
Nucleic Acids Res. 2023 Jan 6;51(D1):D1373-D1380. doi: 10.1093/nar/gkac956.

ArcaDB:一种用于异构计算环境的分布式查询引擎。

ArcaDB: A Disaggregated Query Engine for Heterogenous Computational Environments.

作者信息

Ruiz-Rohena Kristalys, Rodriguez-Martínez Manuel

机构信息

Department of Electrical and Computer Engineering, University of Puerto Rico, Mayagüez.

Department of Computer Science and Engineering, University of Puerto Rico, Mayagüez.

出版信息

IEEE Int Conf Cloud Comput. 2024 Jul;2024:42-53. doi: 10.1109/cloud62652.2024.00015. Epub 2024 Aug 28.

DOI:10.1109/cloud62652.2024.00015
PMID:39493462
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11529753/
Abstract

Modern enterprises rely on data management systems to collect, store, and analyze vast amounts of data related to their operations. Nowadays, clusters and hardware accelerators (e.g., GPUs, TPUs) have become a necessity to scale with the data processing demands in many applications related to social media, bioinformatics, surveillance systems, remote sensing, and medical informatics. Given this new scenario, the architecture of data analytics engines must evolve to take advantage of these new technological trends. In this paper, we present ArcaDB: a disaggregated query engine that leverages container technology to place operators at compute nodes that fit their performance profile. In ArcaDB, a query plan is dispatched to worker nodes that have different computing characteristics. Each operator is annotated with the preferred type of compute node for execution, and ArcaDB ensures that the operator gets picked up by the appropriate workers. We have implemented a prototype version of ArcaDB using Java, Python, and Docker containers. We have also completed a preliminary performance study of this prototype, using images and scientific data. This study shows that ArcaDB can speed up query performance by a factor of 3.5x in comparison with a shared-nothing, symmetric arrangement.

摘要

现代企业依赖数据管理系统来收集、存储和分析与运营相关的大量数据。如今,集群和硬件加速器(如GPU、TPU)已成为满足社交媒体、生物信息学、监控系统、遥感和医学信息学等众多应用中数据处理需求的必要条件。在这种新情况下,数据分析引擎的架构必须不断演进,以利用这些新的技术趋势。在本文中,我们介绍了ArcaDB:一种分布式查询引擎,它利用容器技术将操作符放置在符合其性能配置文件的计算节点上。在ArcaDB中,查询计划被调度到具有不同计算特性的工作节点。每个操作符都被标注了执行所需的首选计算节点类型,并且ArcaDB确保该操作符能被合适的工作节点选中。我们使用Java、Python和Docker容器实现了ArcaDB的原型版本。我们还使用图像和科学数据对该原型进行了初步性能研究。该研究表明,与无共享对称架构相比,ArcaDB可将查询性能提高3.5倍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/105c871b3bbd/nihms-2029447-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/63101caa9f24/nihms-2029447-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/3e7d8e336542/nihms-2029447-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/e33b19c5b935/nihms-2029447-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/314c44da2e38/nihms-2029447-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/c139b31e78b6/nihms-2029447-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/87dff1028bae/nihms-2029447-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/105c871b3bbd/nihms-2029447-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/63101caa9f24/nihms-2029447-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/3e7d8e336542/nihms-2029447-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/e33b19c5b935/nihms-2029447-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/314c44da2e38/nihms-2029447-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/c139b31e78b6/nihms-2029447-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/87dff1028bae/nihms-2029447-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c039/11529753/105c871b3bbd/nihms-2029447-f0007.jpg