Efstathiou Christos I, Adams Elizabeth, Coats Carlie J, Zelt Robert, Reed Mark, McGee John, Foley Kristen M, Sidi Fahim I, Wong David C, Fine Steven, Arunachalam Saravanan
Institute for the Environment, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
Research Computing, Information Technology Services, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
Geosci Model Dev. 2024 Sep 19;17(18):7001-7027. doi: 10.5194/gmd-17-7001-2024.
The Community Multiscale Air Quality Model (CMAQ) is a local- to hemispheric-scale numerical air quality modeling system developed by the U.S. Environmental Protection Agency (USEPA) and supported by the Community Modeling and Analysis System (CMAS) center. CMAQ is used for regulatory purposes by the USEPA program offices and state and local air agencies and is also widely used by the broader global research community to simulate and understand complex air quality processes and for computational environmental fate and transport and climate and health impact studies. Leveraging state-of-the-science cloud computing resources for high-performance computing (HPC) applications, CMAQ is now available as a fully tested, publicly available technology stack (HPC cluster and software stack) for two major cloud service providers (CSPs). Specifically, CMAQ configurations and supporting materials have been developed for use on their HPC clusters, including extensive online documentation, tutorials and guidelines to scale and optimize air quality simulations using their services. These resources allow modelers to rapidly bring together CMAQ, cloud-hosted datasets, and visualization and evaluation tools on ephemeral clusters that can be deployed quickly and reliably worldwide. Described here are considerations in CMAQ version 5.3.3 cloud use and the supported resources for each CSP, presented through a benchmark application suite that was developed as an example of a typical simulation for testing and verifying components of the modeling system. The outcomes of this effort are to provide findings from performing CMAQ simulations on the cloud using popular vendor-provided resources, to enable the user community to adapt this for their own needs, and to identify specific areas of potential optimization with respect to storage and compute architectures.
社区多尺度空气质量模型(CMAQ)是美国环境保护局(USEPA)开发的一个从局部到半球尺度的数值空气质量建模系统,由社区建模与分析系统(CMAS)中心提供支持。CMAQ被USEPA项目办公室以及州和地方空气机构用于监管目的,也被更广泛的全球研究界广泛用于模拟和理解复杂的空气质量过程,以及用于计算环境归宿和传输以及气候和健康影响研究。利用用于高性能计算(HPC)应用的先进云计算资源,CMAQ现在作为一个经过全面测试的、可供公众使用的技术栈(HPC集群和软件栈)提供给两家主要的云服务提供商(CSP)。具体而言,已经开发了CMAQ配置和支持材料,以便在其HPC集群上使用,包括广泛的在线文档、教程和指南,用于使用其服务来扩展和优化空气质量模拟。这些资源使建模人员能够在临时集群上快速整合CMAQ、云托管数据集以及可视化和评估工具,这些集群可以在全球范围内快速可靠地部署。本文介绍了CMAQ 5.3.3版本在云端使用的注意事项以及每个CSP支持的资源,通过一个基准应用套件进行展示,该套件是作为测试和验证建模系统组件的典型模拟示例而开发的。这项工作的成果是提供使用流行的供应商提供的资源在云端进行CMAQ模拟的结果,使用户群体能够根据自身需求进行调整,并确定在存储和计算架构方面潜在优化的特定领域。