Suppr超能文献

人类免疫缺陷病毒(HIV)队列和电子健康记录数据中多变量的错误:统计挑战与机遇

Errors in multiple variables in human immunodeficiency virus (HIV) cohort and electronic health record data: statistical challenges and opportunities.

作者信息

Shepherd Bryan E, Shaw Pamela A

机构信息

Biostatistics, Vanderbilt University, 2525 West End, Suite 11000, 37203Nashville, Tennessee, USA.

Biostatistics, Epidemiology, and Informatics, University of Pennsylvania, Philadelphia, Pennsylvania, USA.

出版信息

Stat Commun Infect Dis. 2020 Oct 7;12(Suppl1):20190015. doi: 10.1515/scid-2019-0015. eCollection 2020 Sep 1.

Abstract

Observational data derived from patient electronic health records (EHR) data are increasingly used for human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS) research. There are challenges to using these data, in particular with regards to data quality; some are recognized, some unrecognized, and some recognized but ignored. There are great opportunities for the statistical community to improve inference by incorporating validation subsampling into analyses of EHR data. Methods to address measurement error, misclassification, and missing data are relevant, as are sampling designs such as two-phase sampling. However, many of the existing statistical methods for measurement error, for example, only address relatively simple settings, whereas the errors seen in these datasets span multiple variables (both predictors and outcomes), are correlated, and even affect who is included in the study. We will discuss some preliminary methods in this area with a particular focus on time-to-event outcomes and outline areas of future research.

摘要

从患者电子健康记录(EHR)数据中获取的观察性数据越来越多地用于人类免疫缺陷病毒/获得性免疫缺陷综合征(HIV/AIDS)研究。使用这些数据存在挑战,尤其是在数据质量方面;有些挑战已被认识到,有些未被认识到,还有些虽被认识到但被忽视了。统计界有很大的机会通过将验证子抽样纳入EHR数据分析来改进推断。解决测量误差、错误分类和缺失数据的方法很重要,诸如两阶段抽样等抽样设计也很重要。然而,许多现有的测量误差统计方法,例如,仅适用于相对简单的情况,而这些数据集中出现的误差跨越多个变量(预测变量和结果变量),相互关联,甚至会影响研究的纳入对象。我们将讨论该领域的一些初步方法,特别关注事件发生时间结局,并概述未来的研究领域。

相似文献

本文引用的文献

4
Optimal Designs of Two-Phase Studies.两阶段研究的最优设计
J Am Stat Assoc. 2020;115(532):1946-1959. doi: 10.1080/01621459.2019.1671200. Epub 2019 Oct 29.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验