Lynker, Fort Collins, CO, USA.
University of California, Santa Barbara, USA.
Sci Data. 2023 Oct 20;10(1):725. doi: 10.1038/s41597-023-02316-7.
In 2016, the National Oceanic and Atmospheric Administration deployed the first iteration of an operational National Water Model (NWM) to forecast the water cycle in the continental United States. With many versions, an hourly, multi-decadal historic simulation is made available to the public. In all released to date, the files containing simulated streamflow contain a snapshot of model conditions across the entire domain for a single timestep which makes accessing time series a technical and resource-intensive challenge. In the most recent release, extracting a complete streamflow time series for a single location requires managing 367,920 files (~16.2 TB). In this work we describe a reproducable process for restructuring a sequential set of NWM steamflow files for efficient time series access and provide restructured datasets for versions 1.2 (1993-2018), 2.0 (1993-2020), and 2.1 (1979-2022). These datasets have been made accessible via an OPeNDAP enabled THREDDS data server for public use and a brief analysis highlights the latest version of the model should not be assumed best for all locations. Laslty, we describe an R package that expedites data retrieval with examples for multiple use-cases.
2016 年,美国国家海洋和大气管理局部署了首个运行中的国家水模型 (NWM) 版本,以预测美国大陆的水循环。该模型有多个版本,提供了一个每小时、多十年的历史模拟供公众使用。在迄今为止发布的所有版本中,包含模拟流量的文件包含整个模型域在单个时间步长的模型条件的快照,这使得访问时间序列成为一项技术和资源密集型挑战。在最新版本中,提取单个位置的完整流量时间序列需要管理 367,920 个文件(约 16.2 TB)。在这项工作中,我们描述了一种可重复的过程,用于对 NWM 蒸汽流文件进行重组,以便高效地访问时间序列,并提供了版本 1.2(1993-2018 年)、2.0(1993-2020 年)和 2.1(1979-2022 年)的重组数据集。这些数据集已通过启用 OPeNDAP 的 THREDDS 数据服务器供公众使用,并进行了简要分析,强调不应假设最新版本的模型适用于所有位置。最后,我们描述了一个 R 包,它通过多个用例的示例加快了数据检索。