• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ARCHI:用于区域相关水文记录自动插补的一个新的R软件包。

ARCHI: A New R Package for Automated Imputation of Regionally Correlated Hydrologic Records.

作者信息

Levy Zeno F, Glas Robin L, Stagnitta Timothy J, Terry Neil

机构信息

U.S. Geological Survey, California Water Science Center, 6000 J Street, Placer Hall, Sacramento, CA, 95819, USA.

U.S. Geological Survey, New York Water Science Center, 425 Jordan Road, Troy, NY, 12180, USA.

出版信息

Ground Water. 2025 Jul-Aug;63(4):595-610. doi: 10.1111/gwat.13474. Epub 2025 Feb 28.

DOI:10.1111/gwat.13474
PMID:40019092
Abstract

Missing data in hydrological records can limit resource assessment, process understanding, and predictive modeling. Here, we present ARCHI (Automated Regional Correlation Analysis for Hydrologic Record Imputation), a new, open-source software package in R designed to aggregate, impute, cluster, and visualize regionally correlated hydrologic records. ARCHI imputes missing data in "target" records by linear regression using more complete "reference" records as predictors. Automated imputation is implemented using a novel, iterative algorithm that allows each site to be considered a target or reference for regression, growing the pool of complete references with each imputed record until viable gap-filling ceases. Users can limit artifacts from spurious correlations by specifying model-acceptance criteria and applying geospatial, correlation, and group-based filters to control reference selection. ARCHI provides additional functions for visualizing results, clustering records with similar correlation structures, evaluating holdout data, and interactive parameterization with an accessible and intuitive graphical user interface (GUI). This methods brief provides an overview of the ARCHI package, modeling guidelines, and benchmarking on two regional groundwater-level datasets from the Central Valley, CA and Long Island, NY. We evaluate ARCHI alongside widely used multivariate imputation software to highlight and contextualize its computational efficiency, imputation accuracy, and model transparency when applied to large, groundwater-level datasets.

摘要

水文记录中的缺失数据会限制资源评估、过程理解和预测建模。在此,我们介绍ARCHI(用于水文记录插补的自动区域相关性分析),这是一个用R语言编写的全新开源软件包,旨在汇总、插补、聚类和可视化区域相关的水文记录。ARCHI通过线性回归,使用更完整的“参考”记录作为预测变量来插补“目标”记录中的缺失数据。自动插补是通过一种新颖的迭代算法实现的,该算法允许将每个站点视为回归的目标或参考,随着每条插补记录的增加,完整参考记录的池也会扩大,直到无法进行有效的缺口填充为止。用户可以通过指定模型接受标准,并应用地理空间、相关性和基于组的过滤器来控制参考选择,从而减少虚假相关性带来的影响。ARCHI还提供了其他功能,用于可视化结果、对具有相似相关结构的记录进行聚类、评估留存数据以及通过一个易于使用且直观的图形用户界面(GUI)进行交互式参数化。本方法简报概述了ARCHI软件包、建模指南,并对来自加利福尼亚州中央谷地和纽约州长岛的两个区域地下水位数据集进行了基准测试。我们将ARCHI与广泛使用的多元插补软件一起进行评估,以突出并说明其在应用于大型地下水位数据集时的计算效率、插补准确性和模型透明度。

相似文献

1
ARCHI: A New R Package for Automated Imputation of Regionally Correlated Hydrologic Records.ARCHI:用于区域相关水文记录自动插补的一个新的R软件包。
Ground Water. 2025 Jul-Aug;63(4):595-610. doi: 10.1111/gwat.13474. Epub 2025 Feb 28.
2
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
3
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
4
Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备:证据综合和成本效益分析。
Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.
5
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
6
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
7
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
8
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
9
A New Measure of Quantified Social Health Is Associated With Levels of Discomfort, Capability, and Mental and General Health Among Patients Seeking Musculoskeletal Specialty Care.一种新的量化社会健康指标与寻求肌肉骨骼专科护理的患者的不适程度、能力以及心理和总体健康水平相关。
Clin Orthop Relat Res. 2025 Apr 1;483(4):647-663. doi: 10.1097/CORR.0000000000003394. Epub 2025 Feb 5.
10
Diagnostic test accuracy and cost-effectiveness of tests for codeletion of chromosomal arms 1p and 19q in people with glioma.染色体臂 1p 和 19q 缺失的检测在胶质瘤患者中的诊断准确性和成本效益。
Cochrane Database Syst Rev. 2022 Mar 2;3(3):CD013387. doi: 10.1002/14651858.CD013387.pub2.

本文引用的文献

1
Toward a standardized evaluation of imputation methodology.向着评估插补方法的标准化迈进。
Biom J. 2024 Jan;66(1):e2200107. doi: 10.1002/bimj.202200107. Epub 2023 Mar 17.
2
Machine learning based downscaling of GRACE-estimated groundwater in Central Valley, California.基于机器学习的加利福尼亚中央谷地GRACE估算地下水降尺度分析
Sci Total Environ. 2023 Mar 20;865:161138. doi: 10.1016/j.scitotenv.2022.161138. Epub 2022 Dec 28.
3
Ridge Regularization: An Essential Concept in Data Science.岭回归正则化:数据科学中的一个重要概念。
Technometrics. 2020;62(4):426-433. doi: 10.1080/00401706.2020.1791959. Epub 2020 Aug 10.
4
Pastas: Open Source Software for the Analysis of Groundwater Time Series.地下水时间序列分析的开源软件:Pastas。
Ground Water. 2019 Nov;57(6):877-885. doi: 10.1111/gwat.12925. Epub 2019 Aug 24.
5
MissForest--non-parametric missing value imputation for mixed-type data.MissForest--用于混合类型数据的非参数缺失值插补。
Bioinformatics. 2012 Jan 1;28(1):112-8. doi: 10.1093/bioinformatics/btr597. Epub 2011 Oct 28.
6
Regularization Paths for Generalized Linear Models via Coordinate Descent.基于坐标下降法的广义线性模型正则化路径
J Stat Softw. 2010;33(1):1-22.
7
Multiple imputation of discrete and continuous data by fully conditional specification.通过完全条件设定对离散和连续数据进行多重填补
Stat Methods Med Res. 2007 Jun;16(3):219-42. doi: 10.1177/0962280206074463.