• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

Hadoop MapReduce中的实验分析:深入探讨故障检测与恢复技术

Experimental Analysis in Hadoop MapReduce: A Closer Look at Fault Detection and Recovery Techniques.

作者信息

Saadoon Muntadher, Hamid Siti Hafizah Ab, Sofian Hazrina, Altarturi Hamza, Nasuha Nur, Azizul Zati Hakim, Sani Asmiza Abdul, Asemi Adeleh

机构信息

Department of Software Engineering, Faculty of Computer Science and Information Technology, University Malaya, Kuala Lumpur 50603, Malaysia.

出版信息

Sensors (Basel). 2021 May 31;21(11):3799. doi: 10.3390/s21113799.

DOI:10.3390/s21113799
PMID:34072632
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8199096/
Abstract

Hadoop MapReduce reactively detects and recovers faults after they occur based on the static heartbeat detection and the re-execution from scratch techniques. However, these techniques lead to excessive response time penalties and inefficient resource consumption during detection and recovery. Existing fault-tolerance solutions intend to mitigate the limitations without considering critical conditions such as fail-slow faults, the impact of faults at various infrastructure levels and the relationship between the detection and recovery stages. This paper analyses the response time under two main conditions: fail-stop and fail-slow, when they manifest with node, service, and the task at runtime. In addition, we focus on the relationship between the time for detecting and recovering faults. The experimental analysis is conducted on a real Hadoop cluster comprising MapReduce, YARN and HDFS frameworks. Our analysis shows that the recovery of a single fault leads to an average of 67.6% response time penalty. Even though the detection and recovery times are well-turned, data locality and resource availability must also be considered to obtain the optimum tolerance time and the lowest penalties.

摘要

Hadoop MapReduce基于静态心跳检测和从头重新执行技术,在故障发生后以反应式方式检测并恢复故障。然而,这些技术在检测和恢复期间会导致过长的响应时间惩罚以及资源消耗效率低下。现有的容错解决方案试图减轻这些限制,但未考虑诸如故障缓慢、不同基础设施级别故障的影响以及检测和恢复阶段之间的关系等关键情况。本文分析了在两种主要情况下的响应时间:故障停止和故障缓慢,这两种情况在运行时表现为节点、服务和任务故障。此外,我们关注故障检测时间和恢复时间之间的关系。实验分析是在一个包含MapReduce、YARN和HDFS框架的真实Hadoop集群上进行的。我们的分析表明,单个故障的恢复平均会导致67.6%的响应时间惩罚。即使检测和恢复时间调整得很好,为了获得最佳的容错时间和最低的惩罚,还必须考虑数据局部性和资源可用性。

相似文献

1
Experimental Analysis in Hadoop MapReduce: A Closer Look at Fault Detection and Recovery Techniques.Hadoop MapReduce中的实验分析:深入探讨故障检测与恢复技术
Sensors (Basel). 2021 May 31;21(11):3799. doi: 10.3390/s21113799.
2
Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends.MapReduce 编程框架在临床大数据分析中的应用:现状与未来趋势。
BioData Min. 2014 Oct 29;7:22. doi: 10.1186/1756-0381-7-22. eCollection 2014.
3
Using Hadoop MapReduce for Parallel Genetic Algorithms: A Comparison of the Global, Grid and Island Models.使用Hadoop MapReduce实现并行遗传算法:全局模型、网格模型和孤岛模型的比较
Evol Comput. 2018 Winter;26(4):535-567. doi: 10.1162/evco_a_00213. Epub 2017 Jun 29.
4
STDADS: An Efficient Slow Task Detection Algorithm for Deadline Schedulers.STDADS:一种用于截止期调度器的高效慢速任务检测算法。
Big Data. 2020 Feb;8(1):62-69. doi: 10.1089/big.2019.0039. Epub 2020 Jan 29.
5
An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics.Hadoop/MapReduce/HBase 框架概述及其在生物信息学中的当前应用。
BMC Bioinformatics. 2010 Dec 21;11 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-11-S12-S1.
6
High-Availability Computing Platform with Sensor Fault Resilience.具备传感器故障恢复能力的高可用性计算平台。
Sensors (Basel). 2021 Jan 13;21(2):542. doi: 10.3390/s21020542.
7
Design and development of a medical big data processing system based on Hadoop.基于Hadoop的医学大数据处理系统的设计与开发。
J Med Syst. 2015 Mar;39(3):23. doi: 10.1007/s10916-015-0220-8. Epub 2015 Feb 10.
8
Demonstration of Hadoop-GIS: A Spatial Data Warehousing System Over MapReduce.Hadoop-GIS演示:一种基于MapReduce的空间数据仓库系统
Proc ACM SIGSPATIAL Int Conf Adv Inf. 2013 Nov;2013:528-531. doi: 10.1145/2525314.2525320.
9
Leverage hadoop framework for large scale clinical informatics applications.利用Hadoop框架进行大规模临床信息学应用。
AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:53. eCollection 2013.
10
Fault Injection with Multiple Fault Patterns for Experimental Evaluation of Demand-Controlled Ventilation and Heating Systems.用于需求控制通风和加热系统实验评估的具有多种故障模式的故障注入
Sensors (Basel). 2022 Oct 25;22(21):8180. doi: 10.3390/s22218180.