Castro-Medina Felipe, Rodríguez-Mazahua Lisbeth, López-Chau Asdrúbal, Cervantes Jair, Alor-Hernández Giner, Machorro-Cano Isaac
Tecnológico Nacional de México/I. T. Orizaba, Division of Research and Postgraduate Studies, Av. Oriente 9 852. Col. Emiliano Zapata, C.P. 94320 Orizaba, Mexico.
Universidad Autónoma del Estado de México, Centro Universitario UAEM Zumpango, Camino viejo a Jilotzingo continuación Calle Rayón, Valle Hermoso, C.P. 55600 Zumpango, Estado de México, Mexico.
Entropy (Basel). 2020 Nov 30;22(12):1352. doi: 10.3390/e22121352.
Fragmentation is a design technique widely used in multimedia databases, because it produces substantial benefits in reducing response times, causing lower execution costs in each operation performed. Multimedia databases include data whose main characteristic is their large size, therefore, database administrators face a challenge of great importance, since they must contemplate the different qualities of non-trivial data. These databases over time undergo changes in their access patterns. Different fragmentation techniques presented in related studies show adequate workflows, however, some do not contemplate changes in access patterns. This paper aims to provide an in-depth review of the literature related to dynamic fragmentation of multimedia databases, to identify the main challenges, technologies employed, types of fragmentation used, and characteristics of the cost model. This review provides valuable information for database administrators by showing essential characteristics to perform proper fragmentation and to improve the performance of fragmentation schemes. The reduction of costs in fragmentation methods is one of the most desired main properties. To fulfill this objective, the works include cost models, covering different qualities. In this analysis, a set of characteristics used in the cost models of each work is presented to facilitate the creation of a new cost model including the most used qualities. In addition, different data sets or reference points used in the testing stage of each work analyzed are presented.
碎片化是一种在多媒体数据库中广泛使用的设计技术,因为它在减少响应时间方面产生了显著的效益,使得每个执行操作的成本更低。多媒体数据库包含的主要特征是其数据量巨大,因此,数据库管理员面临着一项极为重要的挑战,因为他们必须考虑到非平凡数据的不同特性。随着时间的推移,这些数据库的访问模式会发生变化。相关研究中提出的不同碎片化技术展示了适当的工作流程,然而,有些技术并未考虑到访问模式的变化。本文旨在对与多媒体数据库动态碎片化相关的文献进行深入综述,以确定主要挑战、所采用的技术、使用的碎片化类型以及成本模型的特征。通过展示进行适当碎片化和提高碎片化方案性能的基本特征,该综述为数据库管理员提供了有价值的信息。碎片化方法中成本的降低是最期望的主要特性之一。为实现这一目标,相关研究工作包含了涵盖不同特性的成本模型。在本分析中,展示了每项研究工作的成本模型中使用的一组特征,以促进创建一个包含最常用特性的新成本模型。此外,还展示了在分析的每项研究工作的测试阶段使用的不同数据集或参考点。