Pennino Michael J, Stamp Jen, Leppo Erik W, Gibbs David A, Bierwagen Britta G
U.S. Environmental Protection Agency, Office of Research and Development, Center for Public Health and Environmental Assessment, U.S. EPA, Ronald Reagan Building 71277, 1300 Pennsylvania Ave., NW, MC8623R, Washington, D.C. 20004, United States.
Tetra Tech Center for Ecological Sciences, United States.
SoftwareX. 2025 Mar 22;30:102124. doi: 10.1016/j.softx.2025.102124.
The ContDataQC R package is a free, open-source tool that was developed to help water quality monitoring programs perform quality control (QC) procedures on continuous sensor data. ContDataQC helps users speed up and standardize the QC process, minimize undetected data errors, and make full use of their sensor data. It has three main functions: generate QC reports to detect anomalies and erroneous data values, merge QC'd data files from different time periods, and generate time series plots and basic summary statistics. ContDataQC is currently configured to run on nine different parameters: air and water temperature, dissolved oxygen, conductivity, chlorophyll-a, air and water pressure, sensor depth, pH, turbidity, and salinity. Users can add new parameters and customize many of the requirements by editing a plain text configuration file. A web app version, through R Shiny, is available within the package or via a weblink. If accessed via the URL, it will not require the installation of R software. In this paper, we describe the main functions of ContDataQC and discuss how it is being applied in long-term regional monitoring networks for streams and lakes. Both the R Shiny web app and the R package are for users who have no existing workflow for sensor data and wish to adopt the approach of ContDataQC (which has a particular organizational scheme and sequential workflow). People without R coding experience can use the Shiny app, which has a more user-friendly interface, while users who are proficient in R may choose to use the code package.
ContDataQC R包是一个免费的开源工具,旨在帮助水质监测项目对连续传感器数据执行质量控制(QC)程序。ContDataQC有助于用户加快并标准化质量控制过程,将未检测到的数据错误降至最低,并充分利用其传感器数据。它有三个主要功能:生成质量控制报告以检测异常和错误数据值、合并来自不同时间段的经过质量控制的数据文件,以及生成时间序列图和基本汇总统计数据。ContDataQC目前配置为针对九个不同参数运行:空气和水温、溶解氧、电导率、叶绿素a、空气和水压、传感器深度、pH值、浊度和盐度。用户可以通过编辑纯文本配置文件添加新参数并自定义许多要求。通过R Shiny提供了一个网络应用程序版本,可在包内或通过网络链接访问。如果通过URL访问,它不需要安装R软件。在本文中,我们描述了ContDataQC的主要功能,并讨论了它如何应用于河流和湖泊的长期区域监测网络。R Shiny网络应用程序和R包均面向那些没有现有传感器数据工作流程且希望采用ContDataQC方法(具有特定组织方案和顺序工作流程)的用户。没有R编码经验的人可以使用界面更友好的Shiny应用程序,而精通R的用户可能会选择使用代码包。