Suppr超能文献

预测多尺度计算的队列等待时间概率。

Predicting queue wait time probabilities for multi-scale computing.

机构信息

1 Leibniz Supercomputing Centre of the Bavarian Academy of Sciences and Humanities , Boltzmannstraße 1 , 85748 Garching near Munich , Germany.

2 Poznan Supercomputing and Networking Center , Institute of Bioorganic Chemistry of the Polish Academy of Sciences , ul Z. Noskowskiego 12/14 , 61-704 Poznan , Poland.

出版信息

Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180151. doi: 10.1098/rsta.2018.0151.

Abstract

We describe a method for queue wait time prediction in supercomputing clusters. It was designed for use as a part of multi-criteria brokering mechanisms for resource selection in a multi-site High Performance Computing environment. The aim is to incorporate the time jobs stay queued in the scheduling system into the selection criteria. Our method can also be used by the end users to estimate the time to completion of their computing jobs. It uses historical data about the particular system to make predictions. It returns a list of probability estimates of the form ( t ,  p ), where p is the probability that the job will start before time t . Times t can be chosen more or less freely when deploying the system. Compared to regression methods that only return a single number as a queue wait time estimate (usually without error bars) our prediction system provides more useful information. The probability estimates are calculated using the Bayes theorem with the naive assumption that the attributes describing the jobs are independent. They are further calibrated to make sure they are as accurate as possible, given available data. We describe our service and its REST API and the underlying methods in detail and provide empirical evidence in support of the method's efficacy. This article is part of the theme issue 'Multiscale modelling, simulation and computing: from the desktop to the exascale'.

摘要

我们描述了一种用于超级计算集群中排队等待时间预测的方法。它旨在作为多站点高性能计算环境中资源选择的多标准中介机制的一部分使用。其目的是将作业在调度系统中排队的时间纳入选择标准。我们的方法也可以被终端用户用来估计他们的计算作业的完成时间。它使用有关特定系统的历史数据进行预测。它返回一个概率估计列表,形式为 ( t,  p ),其中 p 是作业将在时间 t 之前开始的概率。在部署系统时,可以更自由地选择时间 t 。与仅返回单个数字作为队列等待时间估计的回归方法(通常没有误差条)相比,我们的预测系统提供了更有用的信息。概率估计是使用贝叶斯定理计算的,假设描述作业的属性是独立的。进一步对它们进行校准,以确保在可用数据的基础上尽可能准确。我们详细描述了我们的服务及其 REST API 和底层方法,并提供了支持该方法有效性的经验证据。本文是“多尺度建模、模拟和计算:从桌面到 exascale”主题的一部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/02aa/6388012/357958c23042/rsta20180151-g1.jpg

相似文献

1
Predicting queue wait time probabilities for multi-scale computing.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180151. doi: 10.1098/rsta.2018.0151.
2
Multiscale modelling, simulation and computing: from the desktop to the exascale.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180355. doi: 10.1098/rsta.2018.0355.
3
Multiscale computing for science and engineering in the era of exascale performance.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180144. doi: 10.1098/rsta.2018.0144.
4
Assessing the scales in numerical weather and climate predictions: will exascale be the rescue?
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180148. doi: 10.1098/rsta.2018.0148.
5
Mastering the scales: a survey on the benefits of multiscale computing software.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180147. doi: 10.1098/rsta.2018.0147.
6
Application of the extreme scaling computing pattern on multiscale fusion plasma modelling.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180152. doi: 10.1098/rsta.2018.0152.
7
Multi-scale high-performance computing in astrophysics: simulating clusters with stars, binaries and planets.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180153. doi: 10.1098/rsta.2018.0153.
8
Big data: the end of the scientific method?
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180145. doi: 10.1098/rsta.2018.0145.
9
The heterogeneous multiscale method applied to inelastic polymer mechanics.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180150. doi: 10.1098/rsta.2018.0150.
10
Mesoscale modelling of soft flowing crystals.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180149. doi: 10.1098/rsta.2018.0149.

引用本文的文献

1
Prediction of outpatient waiting time: using machine learning in a tertiary children's hospital.
Transl Pediatr. 2023 Nov 28;12(11):2030-2043. doi: 10.21037/tp-23-58. Epub 2023 Nov 23.
2
Multiscale modelling, simulation and computing: from the desktop to the exascale.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180355. doi: 10.1098/rsta.2018.0355.
3
Application of the extreme scaling computing pattern on multiscale fusion plasma modelling.
Philos Trans A Math Phys Eng Sci. 2019 Apr 8;377(2142):20180152. doi: 10.1098/rsta.2018.0152.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验