• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

异构分布式环境中运行时任务执行时间的估计准确性

Estimation Accuracy on Execution Time of Run-Time Tasks in a Heterogeneous Distributed Environment.

作者信息

Liu Qi, Cai Weidong, Jin Dandan, Shen Jian, Fu Zhangjie, Liu Xiaodong, Linge Nigel

机构信息

Jiangsu Collaborative Innovation Centre of Atmospheric Environment and Equipment Technology (CICAEET), Nanjing University of Information Science & Technology, Nanjing 210044, China.

School of Computer & Software, Nanjing University of Information Science & Technology, Nanjing 210044, China.

出版信息

Sensors (Basel). 2016 Aug 30;16(9):1386. doi: 10.3390/s16091386.

DOI:10.3390/s16091386
PMID:27589753
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5038664/
Abstract

Distributed Computing has achieved tremendous development since cloud computing was proposed in 2006, and played a vital role promoting rapid growth of data collecting and analysis models, e.g., Internet of things, Cyber-Physical Systems, Big Data Analytics, etc. Hadoop has become a data convergence platform for sensor networks. As one of the core components, MapReduce facilitates allocating, processing and mining of collected large-scale data, where speculative execution strategies help solve straggler problems. However, there is still no efficient solution for accurate estimation on execution time of run-time tasks, which can affect task allocation and distribution in MapReduce. In this paper, task execution data have been collected and employed for the estimation. A two-phase regression (TPR) method is proposed to predict the finishing time of each task accurately. Detailed data of each task have drawn interests with detailed analysis report being made. According to the results, the prediction accuracy of concurrent tasks' execution time can be improved, in particular for some regular jobs.

摘要

自2006年云计算被提出以来,分布式计算取得了巨大发展,并在促进数据收集和分析模型(如物联网、信息物理系统、大数据分析等)的快速增长方面发挥了至关重要的作用。Hadoop已成为传感器网络的数据融合平台。作为核心组件之一,MapReduce有助于对收集到的大规模数据进行分配、处理和挖掘,其中推测执行策略有助于解决掉队者问题。然而,对于运行时任务的执行时间仍没有有效的准确估计解决方案,这可能会影响MapReduce中的任务分配和分布。在本文中,收集了任务执行数据并用于估计。提出了一种两阶段回归(TPR)方法来准确预测每个任务的完成时间。每个任务的详细数据通过详细的分析报告引起了关注。根据结果,可以提高并发任务执行时间的预测准确性,特别是对于一些常规作业。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/7cc2cf84e571/sensors-16-01386-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/530665c56c00/sensors-16-01386-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/fe75cdd6bc9a/sensors-16-01386-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/5feb60f63a39/sensors-16-01386-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/0f99d7a59f14/sensors-16-01386-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/4245d5c5ce1d/sensors-16-01386-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/3f136b0e595f/sensors-16-01386-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/a3e09cc765b4/sensors-16-01386-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/7cc2cf84e571/sensors-16-01386-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/530665c56c00/sensors-16-01386-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/fe75cdd6bc9a/sensors-16-01386-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/5feb60f63a39/sensors-16-01386-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/0f99d7a59f14/sensors-16-01386-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/4245d5c5ce1d/sensors-16-01386-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/3f136b0e595f/sensors-16-01386-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/a3e09cc765b4/sensors-16-01386-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fbf0/5038664/7cc2cf84e571/sensors-16-01386-g008.jpg

相似文献

1
Estimation Accuracy on Execution Time of Run-Time Tasks in a Heterogeneous Distributed Environment.异构分布式环境中运行时任务执行时间的估计准确性
Sensors (Basel). 2016 Aug 30;16(9):1386. doi: 10.3390/s16091386.
2
STDADS: An Efficient Slow Task Detection Algorithm for Deadline Schedulers.STDADS:一种用于截止期调度器的高效慢速任务检测算法。
Big Data. 2020 Feb;8(1):62-69. doi: 10.1089/big.2019.0039. Epub 2020 Jan 29.
3
Applications of the MapReduce programming framework to clinical big data analysis: current landscape and future trends.MapReduce 编程框架在临床大数据分析中的应用:现状与未来趋势。
BioData Min. 2014 Oct 29;7:22. doi: 10.1186/1756-0381-7-22. eCollection 2014.
4
Behavior life style analysis for mobile sensory data in cloud computing through MapReduce.基于MapReduce的云计算环境下移动感知数据行为生活方式分析。
Sensors (Basel). 2014 Nov 20;14(11):22001-20. doi: 10.3390/s141122001.
5
Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies.并行 MapReduce:利用并行执行策略最大化云资源利用率和提升性能。
Biomed Res Int. 2018 Oct 17;2018:7501042. doi: 10.1155/2018/7501042. eCollection 2018.
6
MRPack: Multi-Algorithm Execution Using Compute-Intensive Approach in MapReduce.MRPack:在MapReduce中使用计算密集型方法的多算法执行
PLoS One. 2015 Aug 25;10(8):e0136259. doi: 10.1371/journal.pone.0136259. eCollection 2015.
7
Long Read Alignment with Parallel MapReduce Cloud Platform.使用并行MapReduce云平台进行长读段比对
Biomed Res Int. 2015;2015:807407. doi: 10.1155/2015/807407. Epub 2015 Dec 29.
8
Scheduling-Guided Automatic Processing of Massive Hyperspectral Image Classification on Cloud Computing Architectures.云计算架构上大规模高光谱图像分类的调度引导自动处理。
IEEE Trans Cybern. 2021 Jul;51(7):3588-3601. doi: 10.1109/TCYB.2020.3026673. Epub 2021 Jun 23.
9
Reduced Time Compression in Big Data Using MapReduce Approach and Hadoop.使用 MapReduce 方法和 Hadoop 减少大数据的时间压缩。
J Med Syst. 2019 Jun 19;43(8):239. doi: 10.1007/s10916-019-1369-3.
10
A Novel Hadoop Security Model for Addressing Malicious Collusive Workers.一种用于解决恶意共谋工作者的新型 Hadoop 安全模型。
Comput Intell Neurosci. 2021 Jul 8;2021:5753948. doi: 10.1155/2021/5753948. eCollection 2021.

引用本文的文献

1
A Distributed Parallel Algorithm Based on Low-Rank and Sparse Representation for Anomaly Detection in Hyperspectral Images.基于低秩稀疏表示的高光谱图像异常检测分布式并行算法。
Sensors (Basel). 2018 Oct 25;18(11):3627. doi: 10.3390/s18113627.

本文引用的文献

1
Behavior life style analysis for mobile sensory data in cloud computing through MapReduce.基于MapReduce的云计算环境下移动感知数据行为生活方式分析。
Sensors (Basel). 2014 Nov 20;14(11):22001-20. doi: 10.3390/s141122001.