• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Imputation methods for addressing missing data in short-term monitoring of air pollutants.用于解决短期空气污染物监测中缺失数据的插补方法。
Sci Total Environ. 2020 Aug 15;730:139140. doi: 10.1016/j.scitotenv.2020.139140. Epub 2020 May 3.
2
Selection of statistical technique for imputation of single site-univariate and multisite-multivariate methods for particulate pollutants time series data with long gaps and high missing percentage.单站点单变量和多站点多变量方法在长时间间隔和高缺失率的颗粒物污染物时间序列数据插补中的统计技术选择。
Environ Sci Pollut Res Int. 2023 Jun;30(30):75469-75488. doi: 10.1007/s11356-023-27659-x. Epub 2023 May 23.
3
Effects of missing data imputation methods on univariate blood pressure time series data analysis and forecasting with ARIMA and LSTM.缺失数据插补方法对使用ARIMA和LSTM进行单变量血压时间序列数据分析及预测的影响。
BMC Med Res Methodol. 2024 Dec 26;24(1):320. doi: 10.1186/s12874-024-02448-3.
4
Imputation and Missing Indicators for Handling Missing Longitudinal Data: Data Simulation Analysis Based on Electronic Health Record Data.处理纵向缺失数据的插补与缺失指示符:基于电子健康记录数据的模拟分析
JMIR Med Inform. 2025 Mar 13;13:e64354. doi: 10.2196/64354.
5
A novel scaling methodology to reduce the biases associated with missing data from commercial activity monitors.一种新颖的定标方法,可减少来自商业活动监测器的缺失数据相关的偏差。
PLoS One. 2020 Jun 24;15(6):e0235144. doi: 10.1371/journal.pone.0235144. eCollection 2020.
6
The performance of prognostic models depended on the choice of missing value imputation algorithm: a simulation study.预后模型的性能取决于缺失值插补算法的选择:一项模拟研究。
J Clin Epidemiol. 2024 Dec;176:111539. doi: 10.1016/j.jclinepi.2024.111539. Epub 2024 Sep 24.
7
Dealing with missing delirium assessments in prospective clinical studies of the critically ill: a simulation study and reanalysis of two delirium studies.处理危重症患者前瞻性临床研究中缺失的谵妄评估:一项模拟研究和两项谵妄研究的重新分析。
BMC Med Res Methodol. 2021 May 6;21(1):97. doi: 10.1186/s12874-021-01274-1.
8
Handling of Missing Outcome Data in Acute Stroke Trials: Advantages of Multiple Imputation Using Baseline and Postbaseline Variables.急性中风试验中缺失结局数据的处理:使用基线和基线后变量进行多重填补的优势
J Stroke Cerebrovasc Dis. 2018 Dec;27(12):3662-3669. doi: 10.1016/j.jstrokecerebrovasdis.2018.08.040. Epub 2018 Oct 6.
9
Spatial imputation for air pollutants data sets via low rank matrix completion algorithm.基于低秩矩阵补全算法的大气污染物数据集的空间插补。
Environ Int. 2020 Jun;139:105713. doi: 10.1016/j.envint.2020.105713. Epub 2020 Apr 11.
10
Outcome-sensitive multiple imputation: a simulation study.结果敏感多重填补:一项模拟研究。
BMC Med Res Methodol. 2017 Jan 9;17(1):2. doi: 10.1186/s12874-016-0281-5.

引用本文的文献

1
Filling gaps in PM2.5 time series: A broad evaluation from statistical to advanced neural network models.填补细颗粒物(PM2.5)时间序列中的空白:从统计模型到先进神经网络模型的全面评估
PLoS One. 2025 Aug 14;20(8):e0330211. doi: 10.1371/journal.pone.0330211. eCollection 2025.
2
Effects of Ambient Temperature During Pregnancy on Newborn Birthweight.孕期环境温度对新生儿出生体重的影响。
bioRxiv. 2025 Jun 15:2025.06.10.658978. doi: 10.1101/2025.06.10.658978.
3
The impact of data imputation on air quality prediction problem.数据插补对空气质量预测问题的影响。
PLoS One. 2024 Sep 12;19(9):e0306303. doi: 10.1371/journal.pone.0306303. eCollection 2024.
4
Age and mean platelet volume-based nomogram for predicting the therapeutic efficacy of metoprolol in Chinese pediatric patients with vasovagal syncope.基于年龄和平均血小板体积的列线图预测美托洛尔治疗中国儿童血管迷走性晕厥的疗效。
World J Pediatr. 2024 Sep;20(9):957-965. doi: 10.1007/s12519-024-00802-5. Epub 2024 Apr 13.
5
Technology-enhanced in vivo exposures in Prolonged Exposure for PTSD: A pilot randomized controlled trial.创伤后应激障碍延长暴露治疗中增强技术的体内暴露:一项初步随机对照试验。
J Psychiatr Res. 2022 Dec;156:467-475. doi: 10.1016/j.jpsychires.2022.10.056. Epub 2022 Nov 1.
6
Assessing temporal correlation in environmental risk factors to design efficient area-specific COVID-19 regulations: Delhi based case study.评估环境风险因素的时间相关性,以制定有效的特定区域 COVID-19 法规:基于德里的案例研究。
Sci Rep. 2022 Jul 28;12(1):12949. doi: 10.1038/s41598-022-16781-4.
7
Spatial Correlation of Air Pollution and Its Causes in Northeast China.中国东北地区空气污染及其成因的空间相关性。
Int J Environ Res Public Health. 2021 Oct 11;18(20):10619. doi: 10.3390/ijerph182010619.
8
Household and behavioral determinants of indoor PM in a rural solid fuel burning Native American community.农村地区固体燃料燃烧的美国原住民社区室内 PM 的家庭和行为决定因素。
Indoor Air. 2021 Nov;31(6):2008-2019. doi: 10.1111/ina.12904. Epub 2021 Jul 7.
9
Short-term exposure to sulphur dioxide (SO) and all-cause and respiratory mortality: A systematic review and meta-analysis.短期接触二氧化硫 (SO) 与全因和呼吸原因死亡率:系统评价和荟萃分析。
Environ Int. 2021 May;150:106434. doi: 10.1016/j.envint.2021.106434. Epub 2021 Feb 15.

本文引用的文献

1
A 'missing not at random' (MNAR) and 'missing at random' (MAR) growth model comparison with a buprenorphine/naloxone clinical trial.丁丙诺啡/纳洛酮临床试验中“非随机缺失”(MNAR)与“随机缺失”(MAR)增长模型的比较
Addiction. 2015 Jan;110(1):51-8. doi: 10.1111/add.12714. Epub 2014 Oct 16.
2
Review: a gentle introduction to imputation of missing values.综述:缺失值插补的简要介绍
J Clin Epidemiol. 2006 Oct;59(10):1087-91. doi: 10.1016/j.jclinepi.2006.01.014. Epub 2006 Jul 11.
3
Imputation of missing longitudinal data: a comparison of methods.缺失纵向数据的插补:方法比较
J Clin Epidemiol. 2003 Oct;56(10):968-76. doi: 10.1016/s0895-4356(03)00170-7.

用于解决短期空气污染物监测中缺失数据的插补方法。

Imputation methods for addressing missing data in short-term monitoring of air pollutants.

机构信息

The Mel and Enid Zuckerman College of Public Health, The University of Arizona, 1295 N. Martin Ave, Tucson, AZ 85724, USA.

The Mel and Enid Zuckerman College of Public Health, The University of Arizona, 1295 N. Martin Ave, Tucson, AZ 85724, USA.

出版信息

Sci Total Environ. 2020 Aug 15;730:139140. doi: 10.1016/j.scitotenv.2020.139140. Epub 2020 May 3.

DOI:10.1016/j.scitotenv.2020.139140
PMID:32402974
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7745257/
Abstract

Monitoring of environmental contaminants is a critical part of exposure sciences research and public health practice. Missing data are often encountered when performing short-term monitoring (<24 h) of air pollutants with real-time monitors, especially in resource-limited areas. Approaches for handling consecutive periods of missing and incomplete data in this context remain unclear. Our aim is to evaluate existing imputation methods for handling missing data for real-time monitors operating for short durations. In a current field-study, realtime PM2.5 monitors were placed outside of 20 households and ran for 24-hours. Missing data was simulated in these households at four consecutive periods of missingness (20%, 40%, 60%, 80%). Univariate (Mean, Median, Last Observation Carried Forward, Kalman Filter, Random, Markov) and multivariate time-series (Predictive Mean Matching, Row Mean Method) methods were used to impute missing concentrations, and performance was evaluated using five error metrics (Absolute Bias, Percent Absolute Error in Means, R2 Coefficient of Determination, Root Mean Square Error, Mean Absolute Error). Univariate methods of Markov, random, and mean imputations were the best performing methods that yielded 24-hour mean concentrations with the lowest error and highest R2 values across all levels of missingness. When evaluating error metrics minute-by-minute, Kalman filters, median, and Markov methods performed well at low levels of missingness (20-40%). However, at higher levels of missingness (60-80%), Markov, random, median, and mean imputation performed best on average. Multivariate methods were the worst performing imputation methods across all levels of missingness. Imputation using univariate methods may provide a reasonable solution to addressing missing data for short-term monitoring of air pollutants, especially in resource-limited areas. Further efforts are needed to evaluate imputation methods that are generalizable across a diverse range of study environments.

摘要

环境污染物监测是暴露科学研究和公共卫生实践的重要组成部分。使用实时监测器进行空气污染物的短期监测(<24 小时)时,经常会遇到缺失数据,尤其是在资源有限的地区。在这种情况下,处理连续缺失和不完整数据的方法仍不清楚。我们的目的是评估现有的用于处理实时监测器短期运行时缺失数据的插补方法。在当前的现场研究中,实时 PM2.5 监测器放置在 20 户家庭的外部,运行 24 小时。在这些家庭中,以四个连续缺失期(20%、40%、60%、80%)模拟缺失数据。使用单变量(均值、中位数、末次观测值延续、卡尔曼滤波、随机、马尔可夫)和多变量时间序列(预测均值匹配、行均值法)方法来插补缺失浓度,并使用五个误差度量(绝对偏差、均值的百分比绝对误差、R2 决定系数、均方根误差、平均绝对误差)来评估性能。在所有缺失水平下,马尔可夫、随机和均值插补的单变量方法是表现最好的方法,产生的 24 小时平均浓度误差最小,R2 值最高。在逐分钟评估误差度量时,卡尔曼滤波器、中位数和马尔可夫方法在缺失率较低(20-40%)时表现良好。然而,在更高的缺失率(60-80%)下,马尔可夫、随机、中位数和均值插补平均表现最佳。多变量方法在所有缺失水平下表现最差。使用单变量方法进行插补可能是解决空气污染物短期监测中缺失数据的合理方法,尤其是在资源有限的地区。需要进一步努力评估可推广到各种研究环境的插补方法。