• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

非均衡环境下的 Hadoop 集群负载均衡算法。

Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment.

机构信息

School of Computer Science and Technology, China University of Mining and Technology, Xuzhou, Jiangsu 221116, China.

Jiangsu Vocational College of Finance and Economics, Huai'an, Jiangsu 223003, China.

出版信息

Comput Intell Neurosci. 2022 Oct 7;2022:1545024. doi: 10.1155/2022/1545024. eCollection 2022.

DOI:10.1155/2022/1545024
PMID:36248928
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9568311/
Abstract

Considering that in the process of job scheduling, the cluster load should be prebalanced rather than remedied when the load is seriously unbalanced; therefore, in this paper, the task scheduling flow of the Hadoop cluster is analyzed deeply. On the Hadoop platform, a self-dividing algorithm is proposed for load balancing. An intelligent optimization algorithm is used to solve load balance. A dynamic feedback load balancing scheduling method is proposed from the point of view of task scheduling. In order to solve the shortcoming of the fair scheduling algorithm, this paper proposes two ways to improve the resource utilization and overall performance of Hadoop. When the mapping task is completed and the tasks to be reduced are assigned, the task assignment is based on the performance of the nodes to be reduced. It gives full play to the advantages of the ant colony algorithm and the hive colony algorithm so that the fusion algorithm can better deal with load balance. Then, three existing scheduling algorithms are introduced in detail: single queue scheduling, capacity scheduling, and fair scheduling. On this basis, an improved task scheduling strategy based on genetic algorithm is proposed to allocate and execute application tasks to reduce task completion time. The experiment verifies the effectiveness of the algorithm. The LBNP algorithm greatly improves the efficiency of reducing task execution and job execution. The delay capacity scheduling algorithm can ensure that most tasks can achieve localization scheduling, improve resource utilization, improve load balance, and speed up job completion time.

摘要

考虑到在作业调度过程中,集群负载应在负载严重失衡之前进行预平衡,而不是在负载严重失衡时进行补救;因此,本文深入分析了 Hadoop 集群的任务调度流程。在 Hadoop 平台上,提出了一种用于负载均衡的自划分算法。采用智能优化算法解决负载均衡问题。从任务调度的角度出发,提出了一种动态反馈的负载均衡调度方法。为了解决公平调度算法的缺点,本文提出了两种提高 Hadoop 资源利用率和整体性能的方法。当映射任务完成且要减少的任务被分配时,任务分配基于要减少的节点的性能。它充分发挥了蚁群算法和蜂群算法的优势,使融合算法能够更好地处理负载平衡问题。然后,详细介绍了三种现有的调度算法:单队列调度、容量调度和公平调度。在此基础上,提出了一种基于遗传算法的改进任务调度策略,以分配和执行应用任务,从而减少任务完成时间。实验验证了算法的有效性。LBNP 算法大大提高了任务执行和作业执行的效率。延迟容量调度算法可以确保大多数任务实现本地化调度,提高资源利用率,实现负载平衡,加快作业完成时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/a2b81bb62910/CIN2022-1545024.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/58d1f944e45c/CIN2022-1545024.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/0d322373f60f/CIN2022-1545024.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/6a59273a4bac/CIN2022-1545024.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/2dee9df9bf9f/CIN2022-1545024.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/c3d9095f6161/CIN2022-1545024.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/4cade3138cab/CIN2022-1545024.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/a2b81bb62910/CIN2022-1545024.007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/58d1f944e45c/CIN2022-1545024.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/0d322373f60f/CIN2022-1545024.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/6a59273a4bac/CIN2022-1545024.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/2dee9df9bf9f/CIN2022-1545024.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/c3d9095f6161/CIN2022-1545024.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/4cade3138cab/CIN2022-1545024.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/60a6/9568311/a2b81bb62910/CIN2022-1545024.007.jpg

相似文献

1
Load Balancing Algorithms for Hadoop Cluster in Unbalanced Environment.非均衡环境下的 Hadoop 集群负载均衡算法。
Comput Intell Neurosci. 2022 Oct 7;2022:1545024. doi: 10.1155/2022/1545024. eCollection 2022.
2
Cloud-Based Advanced Shuffled Frog Leaping Algorithm for Tasks Scheduling.基于云的高级混合蛙跳算法在任务调度中的应用
Big Data. 2024 Apr;12(2):110-126. doi: 10.1089/big.2022.0095. Epub 2023 Mar 3.
3
A deadline constrained scheduling algorithm for cloud computing system based on the driver of dynamic essential path.基于动态关键路径驱动的云计算系统有时间约束调度算法
PLoS One. 2019 Mar 8;14(3):e0213234. doi: 10.1371/journal.pone.0213234. eCollection 2019.
4
AdPSO: Adaptive PSO-Based Task Scheduling Approach for Cloud Computing.基于自适应粒子群优化算法的云计算任务调度方法
Sensors (Basel). 2022 Jan 25;22(3):920. doi: 10.3390/s22030920.
5
GCWOAS2: Multiobjective Task Scheduling Strategy Based on Gaussian Cloud-Whale Optimization in Cloud Computing.GCWOAS2:云计算中基于高斯云-鲸鱼优化的多目标任务调度策略
Comput Intell Neurosci. 2021 Jun 10;2021:5546758. doi: 10.1155/2021/5546758. eCollection 2021.
6
Load Balancing in Cloud Computing Environment Using Improved Weighted Round Robin Algorithm for Nonpreemptive Dependent Tasks.基于改进加权循环算法的云计算环境中针对非抢占式相关任务的负载均衡
ScientificWorldJournal. 2016;2016:3896065. doi: 10.1155/2016/3896065. Epub 2016 Feb 3.
7
A short-term operating room surgery scheduling problem integrating multiple nurses roster constraints.一种整合了多名护士排班约束的短期手术室手术调度问题。
Artif Intell Med. 2015 Feb;63(2):91-106. doi: 10.1016/j.artmed.2014.12.005. Epub 2014 Dec 12.
8
STDADS: An Efficient Slow Task Detection Algorithm for Deadline Schedulers.STDADS:一种用于截止期调度器的高效慢速任务检测算法。
Big Data. 2020 Feb;8(1):62-69. doi: 10.1089/big.2019.0039. Epub 2020 Jan 29.
9
EOTE-FSC: An efficient offloaded task execution for fog enabled smart cities.EOTE-FSC:一种用于实现雾计算赋能的智慧城市的高效卸载任务执行方法。
PLoS One. 2024 Apr 5;19(4):e0298363. doi: 10.1371/journal.pone.0298363. eCollection 2024.
10
An Observation Scheduling Approach Based on Task Clustering for High-Altitude Airship.基于任务聚类的高空飞艇观测调度方法。
Sensors (Basel). 2022 Mar 6;22(5):2050. doi: 10.3390/s22052050.

本文引用的文献

1
cl-dash: rapid configuration and deployment of Hadoop clusters for bioinformatics research in the cloud.CL-Dash:用于云环境中生物信息学研究的Hadoop集群的快速配置与部署
Bioinformatics. 2016 Jan 15;32(2):301-3. doi: 10.1093/bioinformatics/btv553. Epub 2015 Oct 1.