• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新审视多查询优化:一种全查询代数方法。

Multi-Query Optimization Revisited: A Full-Query Algebraic Method.

作者信息

Tu Yicheng, Eslami Mehrad, Xu Zichen, Charkhgard Hadi

机构信息

Dept. of Computer Science, University of South Florida, Tampa, Florida, USA.

Jiaxing Neofelis, Technology Co. Ltd, Jiaxing, Zhejiang, China.

出版信息

Proc IEEE Int Conf Big Data. 2022 Dec;2022:252-261. doi: 10.1109/bigdata55660.2022.10020338. Epub 2023 Jan 26.

DOI:10.1109/bigdata55660.2022.10020338
PMID:37637192
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10460125/
Abstract

Sharing data and computation among concurrent queries has been an active research topic in database systems. While work in this area developed algorithms and systems that are shown to be effective, there is a lack of logical foundation for query processing and optimization. In this paper, we present PsiDB, a system model for processing a large number of database queries in a batch. The key idea is to generate a single query expression that returns a global relation containing all the data needed for individual queries. For that, we propose the use of a type of relational operators called -operators in combining the individual queries into the global expression. We tackle the algebraic optimization problem in PsiDB by developing equivalence rules to transform concurrent queries with the purpose of revealing query optimization opportunities. Centering around the -operator, our rules not only cover many optimization techniques adopted in existing batch processing systems, but also revealed new optimization opportunities. Experiments conducted on an early prototype of PsiDB show a performance improvement of up to 36X over a mainstream commercial DBMS.

摘要

在并发查询之间共享数据和计算一直是数据库系统中一个活跃的研究课题。虽然该领域的工作开发了一些被证明有效的算法和系统,但查询处理和优化缺乏逻辑基础。在本文中,我们提出了PsiDB,这是一种用于批量处理大量数据库查询的系统模型。关键思想是生成一个单一的查询表达式,该表达式返回一个包含各个查询所需所有数据的全局关系。为此,我们建议使用一种称为-运算符的关系运算符类型,将各个查询组合成全局表达式。我们通过开发等价规则来处理PsiDB中的代数优化问题,以转换并发查询,目的是揭示查询优化机会。围绕-运算符,我们的规则不仅涵盖了现有批处理系统中采用的许多优化技术,还揭示了新的优化机会。在PsiDB的早期原型上进行的实验表明,与主流商业数据库管理系统相比,性能提高了多达36倍。

相似文献

1
Multi-Query Optimization Revisited: A Full-Query Algebraic Method.重新审视多查询优化:一种全查询代数方法。
Proc IEEE Int Conf Big Data. 2022 Dec;2022:252-261. doi: 10.1109/bigdata55660.2022.10020338. Epub 2023 Jan 26.
2
LocationSpark: In-memory Distributed Spatial Query Processing and Optimization.位置Spark:内存中分布式空间查询处理与优化
Front Big Data. 2020 Oct 16;3:30. doi: 10.3389/fdata.2020.00030. eCollection 2020.
3
SOOM: Sort-Based Optimizer for Big Data Multi-Query.SOOM:大数据多查询的基于排序的优化器。
Big Data. 2020 Feb;8(1):38-61. doi: 10.1089/big.2019.0023. Epub 2020 Jan 30.
4
A technique for parallel query optimization using MapReduce framework and a semantic-based clustering method.一种使用MapReduce框架和基于语义的聚类方法进行并行查询优化的技术。
PeerJ Comput Sci. 2021 Jun 1;7:e580. doi: 10.7717/peerj-cs.580. eCollection 2021.
5
Exploiting Reused-Based Sharing Work Opportunities in Big Data Multiquery Optimization with Flink.利用 Flink 在大数据多查询优化中的基于重用的共享工作机会。
Big Data. 2021 Dec;9(6):454-479. doi: 10.1089/big.2020.0141. Epub 2021 Oct 6.
6
An adaptive spark-based framework for querying large-scale NoSQL and relational databases.一种适用于查询大规模 NoSQL 和关系型数据库的基于火花的自适应框架。
PLoS One. 2021 Aug 19;16(8):e0255562. doi: 10.1371/journal.pone.0255562. eCollection 2021.
7
DCMS: A data analytics and management system for molecular simulation.DCMS:一种用于分子模拟的数据分析与管理系统。
J Big Data. 2015;2(1):9. doi: 10.1186/s40537-014-0009-5. Epub 2014 Nov 26.
8
Benchmarking attribute cardinality maps for database systems using the TPC-D specifications.
IEEE Trans Syst Man Cybern B Cybern. 2003;33(6):913-24. doi: 10.1109/TSMCB.2003.810909.
9
Efficient processing of multiple nested event pattern queries over multi-dimensional event streams based on a triaxial hierarchical model.基于三轴层次模型对多维事件流中的多个嵌套事件模式查询进行高效处理。
Artif Intell Med. 2016 Sep;72:56-71. doi: 10.1016/j.artmed.2016.08.002. Epub 2016 Aug 19.
10
Implementation of a query interface for a generic record server.为通用记录服务器实现查询接口。
Int J Med Inform. 2008 Nov;77(11):754-64. doi: 10.1016/j.ijmedinf.2008.05.003. Epub 2008 Jul 9.