Suppr超能文献

评估从医院提取的临床和行政数据的质量:综合内科住院患者倡议(GEMINI)的经验。

Assessing the quality of clinical and administrative data extracted from hospitals: the General Medicine Inpatient Initiative (GEMINI) experience.

机构信息

Li Ka Shing Knowledge Institute, St. Michael's Hospital, Toronto, Ontario, Canada.

Department of Medicine, University of Toronto, Toronto, Ontario, Canada.

出版信息

J Am Med Inform Assoc. 2021 Mar 1;28(3):578-587. doi: 10.1093/jamia/ocaa225.

Abstract

OBJECTIVE

Large clinical databases are increasingly used for research and quality improvement. We describe an approach to data quality assessment from the General Medicine Inpatient Initiative (GEMINI), which collects and standardizes administrative and clinical data from hospitals.

METHODS

The GEMINI database contained 245 559 patient admissions at 7 hospitals in Ontario, Canada from 2010 to 2017. We performed 7 computational data quality checks and iteratively re-extracted data from hospitals to correct problems. Thereafter, GEMINI data were compared to data that were manually abstracted from the hospital's electronic medical record for 23 419 selected data points on a sample of 7488 patients.

RESULTS

Computational checks flagged 103 potential data quality issues, which were either corrected or documented to inform future analysis. For example, we identified the inclusion of canceled radiology tests, a time shift of transfusion data, and mistakenly processing the chemical symbol for sodium ("Na") as a missing value. Manual validation identified 1 important data quality issue that was not detected by computational checks: transfusion dates and times at 1 site were unreliable. Apart from that single issue, across all data tables, GEMINI data had high overall accuracy (ranging from 98%-100%), sensitivity (95%-100%), specificity (99%-100%), positive predictive value (93%-100%), and negative predictive value (99%-100%) compared to the gold standard.

DISCUSSION AND CONCLUSION

Computational data quality checks with iterative re-extraction facilitated reliable data collection from hospitals but missed 1 critical quality issue. Combining computational and manual approaches may be optimal for assessing the quality of large multisite clinical databases.

摘要

目的

越来越多的大型临床数据库被用于研究和质量改进。我们描述了一种从综合内科住院患者倡议(GEMINI)中评估数据质量的方法,该倡议从医院收集和标准化管理及临床数据。

方法

GEMINI 数据库包含了 2010 年至 2017 年期间加拿大安大略省 7 家医院的 245559 例患者入院记录。我们进行了 7 项计算数据质量检查,并从医院迭代地重新提取数据以纠正问题。此后,将 GEMINI 数据与从医院电子病历中手动提取的 7488 名患者中 23419 个选定数据点的样本数据进行了比较。

结果

计算检查标记了 103 个潜在的数据质量问题,这些问题要么得到纠正,要么被记录下来,以便为未来的分析提供信息。例如,我们发现包括已取消的放射学检查、输血数据的时间移位以及将化学符号“Na”错误处理为缺失值。手动验证发现了一个计算检查未检测到的重要数据质量问题:1 个地点的输血日期和时间不可靠。除了这一个问题外,在所有数据表中,GEMINI 数据的总体准确性(范围为 98%-100%)、敏感性(95%-100%)、特异性(99%-100%)、阳性预测值(93%-100%)和阴性预测值(99%-100%)都高于金标准。

讨论和结论

具有迭代重新提取功能的计算数据质量检查有助于从医院可靠地收集数据,但错过了 1 个关键质量问题。结合计算和手动方法可能是评估大型多站点临床数据库质量的最佳方法。

相似文献

引用本文的文献

本文引用的文献

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验