Google Research, Vienna, Austria.
Google Research, Mountain View, CA, USA.
Sci Data. 2023 Jan 31;10(1):61. doi: 10.1038/s41597-023-01975-w.
High-quality datasets are essential to support hydrological science and modeling. Several CAMELS (Catchment Attributes and Meteorology for Large-sample Studies) datasets exist for specific countries or regions, however these datasets lack standardization, which makes global studies difficult. This paper introduces a dataset called Caravan (a series of CAMELS) that standardizes and aggregates seven existing large-sample hydrology datasets. Caravan includes meteorological forcing data, streamflow data, and static catchment attributes (e.g., geophysical, sociological, climatological) for 6830 catchments. Most importantly, Caravan is both a dataset and open-source software that allows members of the hydrology community to extend the dataset to new locations by extracting forcing data and catchment attributes in the cloud. Our vision is for Caravan to democratize the creation and use of globally-standardized large-sample hydrology datasets. Caravan is a truly global open-source community resource.
高质量的数据集对于支持水文科学和建模至关重要。已经存在一些针对特定国家或地区的 CAMELS(集水区属性和气象学用于大样本研究)数据集,但是这些数据集缺乏标准化,这使得全球研究变得困难。本文介绍了一个名为 Caravan(一系列 CAMELS)的数据集,该数据集对七个现有的大样本水文数据集进行了标准化和聚合。Caravan 包含气象强迫数据、流量数据和 6830 个集水区的静态集水区属性(例如地球物理、社会学、气候学)。最重要的是,Caravan 既是一个数据集,也是一个开源软件,允许水文界成员通过在云中提取强迫数据和集水区属性,将数据集扩展到新的地点。我们的愿景是使 Caravan 能够民主化地创建和使用全球标准化的大样本水文数据集。Caravan 是一个真正的全球开源社区资源。