• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

针对存在缺失值和低于阈值测量值的多变量数据的多重填补:北极地区污染物的时间序列浓度

Multiple imputation for multivariate data with missing and below-threshold measurements: time-series concentrations of pollutants in the Arctic.

作者信息

Hopke P K, Liu C, Rubin D B

机构信息

Department of Chemistry, Clarkson University, Potsdam, New York 13699, USA.

出版信息

Biometrics. 2001 Mar;57(1):22-33. doi: 10.1111/j.0006-341x.2001.00022.x.

DOI:10.1111/j.0006-341x.2001.00022.x
PMID:11252602
Abstract

Many chemical and environmental data sets are complicated by the existence of fully missing values or censored values known to lie below detection thresholds. For example, week-long samples of airborne particulate matter were obtained at Alert, NWT, Canada, between 1980 and 1991, where some of the concentrations of 24 particulate constituents were coarsened in the sense of being either fully missing or below detection limits. To facilitate scientific analysis, it is appealing to create complete data by filling in missing values so that standard complete-data methods can be applied. We briefly review commonly used strategies for handling missing values and focus on the multiple-imputation approach, which generally leads to valid inferences when faced with missing data. Three statistical models are developed for multiply imputing the missing values of airborne particulate matter. We expect that these models are useful for creating multiple imputations in a variety of incomplete multivariate time series data sets.

摘要

许多化学和环境数据集因存在完全缺失值或已知低于检测阈值的截尾值而变得复杂。例如,1980年至1991年期间在加拿大西北地区的阿勒特采集了为期一周的空气传播颗粒物样本,其中24种颗粒物成分的一些浓度在完全缺失或低于检测限的意义上被粗略化了。为便于科学分析,通过填补缺失值来创建完整数据很有吸引力,这样就可以应用标准的完整数据方法。我们简要回顾了处理缺失值的常用策略,并重点关注多重填补方法,当面对缺失数据时,该方法通常能得出有效的推断。开发了三种统计模型来多重填补空气传播颗粒物的缺失值。我们期望这些模型可用于在各种不完整的多元时间序列数据集中创建多重填补。

相似文献

1
Multiple imputation for multivariate data with missing and below-threshold measurements: time-series concentrations of pollutants in the Arctic.针对存在缺失值和低于阈值测量值的多变量数据的多重填补:北极地区污染物的时间序列浓度
Biometrics. 2001 Mar;57(1):22-33. doi: 10.1111/j.0006-341x.2001.00022.x.
2
A context-intensive approach to imputation of missing values in data sets from networks of environmental monitors.一种针对环境监测网络数据集中缺失值插补的上下文密集型方法。
J Air Waste Manag Assoc. 2016 Jan;66(1):38-52. doi: 10.1080/10962247.2015.1108251.
3
Selection of statistical technique for imputation of single site-univariate and multisite-multivariate methods for particulate pollutants time series data with long gaps and high missing percentage.单站点单变量和多站点多变量方法在长时间间隔和高缺失率的颗粒物污染物时间序列数据插补中的统计技术选择。
Environ Sci Pollut Res Int. 2023 Jun;30(30):75469-75488. doi: 10.1007/s11356-023-27659-x. Epub 2023 May 23.
4
Imputation of data values that are less than a detection limit.低于检测限的数据值的插补。
J Occup Environ Hyg. 2004 Jul;1(7):436-41. doi: 10.1080/15459620490462797.
5
Reproducibility and imputation of air toxics data.空气有毒物质数据的可重复性与插补
J Environ Monit. 2007 Dec;9(12):1358-72. doi: 10.1039/b709816b. Epub 2007 Oct 12.
6
Part 1. Statistical Learning Methods for the Effects of Multiple Air Pollution Constituents.第1部分. 多种空气污染成分影响的统计学习方法
Res Rep Health Eff Inst. 2015 Jun(183 Pt 1-2):5-50.
7
Multiple imputation for handling missing outcome data when estimating the relative risk.采用多重插补处理估计相对危险度时丢失的结局数据。
BMC Med Res Methodol. 2017 Sep 6;17(1):134. doi: 10.1186/s12874-017-0414-5.
8
Resolving the long-term trends of polycyclic aromatic hydrocarbons in the Canadian Arctic atmosphere.
Environ Sci Technol. 2006 May 15;40(10):3217-22. doi: 10.1021/es052346l.
9
Handling missing rows in multi-omics data integration: multiple imputation in multiple factor analysis framework.多组学数据整合中缺失行的处理:多因素分析框架下的多重填补
BMC Bioinformatics. 2016 Oct 3;17(1):402. doi: 10.1186/s12859-016-1273-5.
10
Multivariate outlier detection applied to multiply imputed laboratory data.应用于多重填补实验室数据的多变量异常值检测。
Stat Med. 1999 Jul 30;18(14):1879-95; dicussion 1897. doi: 10.1002/(sici)1097-0258(19990730)18:14<1879::aid-sim225>3.0.co;2-6.

引用本文的文献

1
Time Series of Counts under Censoring: A Bayesian Approach.删失情形下计数的时间序列:一种贝叶斯方法。
Entropy (Basel). 2023 Mar 23;25(4):549. doi: 10.3390/e25040549.
2
Infinite hidden Markov models for multiple multivariate time series with missing data.无限隐马尔可夫模型用于具有缺失数据的多个多元时间序列。
Biometrics. 2023 Sep;79(3):2592-2604. doi: 10.1111/biom.13715. Epub 2022 Jul 22.
3
Strategies for Imputation of High-Resolution Environmental Data in Clinical Randomized Controlled Trials.
Int J Environ Res Public Health. 2022 Jan 24;19(3):1307. doi: 10.3390/ijerph19031307.
4
Right-Censored Time Series Modeling by Modified Semi-Parametric A-Spline Estimator.基于修正半参数A样条估计器的右删失时间序列建模
Entropy (Basel). 2021 Nov 27;23(12):1586. doi: 10.3390/e23121586.
5
A Model-Based Approach to Detection Limits in Studying Environmental Exposures and Human Fecundity.一种基于模型的方法用于研究环境暴露与人类生育力中的检测限
Stat Biosci. 2019 Dec;11(3):524-547. doi: 10.1007/s12561-019-09243-5. Epub 2019 Jun 7.
6
Optimal pseudolikelihood estimation in the analysis of multivariate missing data with nonignorable nonresponse.具有不可忽视的无应答的多元缺失数据分析中的最优伪似然估计。
Biometrika. 2018 Jun;105(2):479-486. doi: 10.1093/biomet/asy007. Epub 2018 Feb 28.
7
Statistical methods for characterizing transfusion-related changes in regional oxygenation using near-infrared spectroscopy (NIRS) in preterm infants.使用近红外光谱法(NIRS)表征早产儿输血相关区域氧合变化的统计方法。
Stat Methods Med Res. 2019 Sep;28(9):2710-2723. doi: 10.1177/0962280218786302. Epub 2018 Jul 12.
8
A Bayesian Approach for Summarizing and Modeling Time-Series Exposure Data with Left Censoring.贝叶斯方法在左删失时间序列暴露数据中的总结与建模。
Ann Work Expo Health. 2017 Aug 1;61(7):773-783. doi: 10.1093/annweh/wxx046.
9
Developing standardized corticosteroid treatment for Duchenne muscular dystrophy.开发针对杜氏肌营养不良症的标准化皮质类固醇治疗方法。
Contemp Clin Trials. 2017 Jul;58:34-39. doi: 10.1016/j.cct.2017.04.008. Epub 2017 Apr 24.
10
Effect of Coenzyme Q on Biomarkers of Oxidative Stress and Cardiac Function in Hemodialysis Patients: The CoQ Biomarker Trial.辅酶Q对血液透析患者氧化应激生物标志物和心脏功能的影响:辅酶Q生物标志物试验
Am J Kidney Dis. 2017 Mar;69(3):389-399. doi: 10.1053/j.ajkd.2016.08.041. Epub 2016 Dec 4.