• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

提高临床研究信息学工具中的数据质量

Improving Data Quality in Clinical Research Informatics Tools.

作者信息

AbuHalimeh Ahmed

机构信息

Information Science Department, University of Arkansas at Little Rock, Little Rock, AR, United States.

出版信息

Front Big Data. 2022 Apr 29;5:871897. doi: 10.3389/fdata.2022.871897. eCollection 2022.

DOI:10.3389/fdata.2022.871897
PMID:35574572
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9102971/
Abstract

Maintaining data quality is a fundamental requirement for any successful and long-term data management. Providing high-quality, reliable, and statistically sound data is a primary goal for clinical research informatics. In addition, effective data governance and management are essential to ensuring accurate data counts, reports, and validation. As a crucial step of the clinical research process, it is important to establish and maintain organization-wide standards for data quality management to ensure consistency across all systems designed primarily for cohort identification, allowing users to perform an enterprise-wide search on a clinical research data repository to determine the existence of a set of patients meeting certain inclusion or exclusion criteria. Some of the clinical research tools are referred to as de-identified data tools. Assessing and improving the quality of data used by clinical research informatics tools are both important and difficult tasks. For an increasing number of users who rely on information as one of their most important assets, enforcing high data quality levels represents a strategic investment to preserve the value of the data. In clinical research informatics, better data quality translates into better research results and better patient care. However, achieving high-quality data standards is a major task because of the variety of ways that errors might be introduced in a system and the difficulty of correcting them systematically. Problems with data quality tend to fall into two categories. The first category is related to inconsistency among data resources such as format, syntax, and semantic inconsistencies. The second category is related to poor ETL and data mapping processes. In this paper, we describe a real-life case study on assessing and improving the data quality at one of healthcare organizations. This paper compares between the results obtained from two de-identified data systems i2b2, and Epic Slicedicer, and discuss the data quality dimensions' specific to the clinical research informatics context, and the possible data quality issues between the de-identified systems. This work in paper aims to propose steps/rules for maintaining the data quality among different systems to help data managers, information systems teams, and informaticists at any health care organization to monitor and sustain data quality as part of their business intelligence, data governance, and data democratization processes.

摘要

维护数据质量是任何成功且长期的数据管理的基本要求。提供高质量、可靠且统计合理的数据是临床研究信息学的主要目标。此外,有效的数据治理和管理对于确保准确的数据计数、报告和验证至关重要。作为临床研究过程的关键步骤,建立并维护全组织范围的数据质量管理标准很重要,以确保主要用于队列识别的所有系统之间的一致性,允许用户在临床研究数据存储库上进行全企业范围的搜索,以确定是否存在符合某些纳入或排除标准的一组患者。一些临床研究工具被称为去标识化数据工具。评估和提高临床研究信息学工具所使用数据的质量既是重要任务也是困难任务。对于越来越多将信息视为其最重要资产之一的用户而言,实施高数据质量水平是一项保护数据价值的战略投资。在临床研究信息学中,更好的数据质量转化为更好的研究结果和更好的患者护理。然而,实现高质量数据标准是一项重大任务,因为系统中可能引入错误的方式多种多样,且系统地纠正这些错误存在困难。数据质量问题往往分为两类。第一类与数据资源之间的不一致有关,如格式、语法和语义不一致。第二类与不良的ETL和数据映射过程有关。在本文中,我们描述了一个关于评估和提高一家医疗保健机构数据质量的实际案例研究。本文比较了从两个去标识化数据系统i2b2和Epic Slicedicer获得的结果,并讨论了临床研究信息学背景下特定的数据质量维度,以及去标识化系统之间可能存在的数据质量问题。本文的工作旨在提出在不同系统之间维护数据质量的步骤/规则,以帮助任何医疗保健机构的数据管理人员、信息系统团队和信息学家将监测和维持数据质量作为其商业智能、数据治理和数据民主化过程的一部分。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b20/9102971/0d2be9c63cf9/fdata-05-871897-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b20/9102971/0d2be9c63cf9/fdata-05-871897-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9b20/9102971/0d2be9c63cf9/fdata-05-871897-g0001.jpg

相似文献

1
Improving Data Quality in Clinical Research Informatics Tools.提高临床研究信息学工具中的数据质量
Front Big Data. 2022 Apr 29;5:871897. doi: 10.3389/fdata.2022.871897. eCollection 2022.
2
The future of Cochrane Neonatal.考克兰新生儿协作网的未来。
Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.
3
Critical Care Network in the State of Qatar.卡塔尔国重症监护网络。
Qatar Med J. 2019 Nov 7;2019(2):2. doi: 10.5339/qmj.2019.qccc.2. eCollection 2019.
4
The Effectiveness of Integrated Care Pathways for Adults and Children in Health Care Settings: A Systematic Review.综合护理路径在医疗环境中对成人和儿童的有效性:一项系统评价。
JBI Libr Syst Rev. 2009;7(3):80-129. doi: 10.11124/01938924-200907030-00001.
5
How has the impact of 'care pathway technologies' on service integration in stroke care been measured and what is the strength of the evidence to support their effectiveness in this respect?“护理路径技术”对卒中护理服务整合的影响是如何衡量的,以及有哪些证据支持其在这方面的有效性?
Int J Evid Based Healthc. 2008 Mar;6(1):78-110. doi: 10.1111/j.1744-1609.2007.00098.x.
6
[A proposal for reforming psychologists' training in France and in the European Union].[关于法国及欧盟心理学家培训改革的一项提议]
Encephale. 2009 Feb;35(1):18-24. doi: 10.1016/j.encep.2007.11.008. Epub 2008 Apr 2.
7
Tuberculosis结核病
8
Ethical Use of Electronic Health Record Data and Artificial Intelligence: Recommendations of the Primary Care Informatics Working Group of the International Medical Informatics Association.电子健康记录数据和人工智能的伦理使用:国际医学信息学协会初级保健信息学工作组的建议。
Yearb Med Inform. 2020 Aug;29(1):51-57. doi: 10.1055/s-0040-1701980. Epub 2020 Apr 17.
9
The patient experience of patient-centered communication with nurses in the hospital setting: a qualitative systematic review protocol.医院环境中患者与护士以患者为中心的沟通体验:一项定性系统评价方案
JBI Database System Rev Implement Rep. 2015 Jan;13(1):76-87. doi: 10.11124/jbisrir-2015-1072.
10
Six Sigma: not for the faint of heart.六西格玛:并非胆小者所能驾驭。
Radiol Manage. 2003 Mar-Apr;25(2):40-53.

引用本文的文献

1
Enhancing Gen3 for clinical trial time series analytics and data discovery: a data commons framework for NIH clinical trials.增强Gen3用于临床试验时间序列分析和数据发现:美国国立卫生研究院临床试验的数据共享框架
Front Digit Health. 2025 Jul 23;7:1570009. doi: 10.3389/fdgth.2025.1570009. eCollection 2025.
2
Examine frameworks policies and strategies for effective information governance in healthcare organizations.审视医疗保健组织中有效信息治理的框架、政策和策略。
PLoS One. 2025 Jul 11;20(7):e0327496. doi: 10.1371/journal.pone.0327496. eCollection 2025.
3
Discrepancies in Aggregate Patient Data between Two Sources with Data Originating from the Same Electronic Health Record: A Case Study.
来自同一电子健康记录的两个数据源之间患者总体数据的差异:一项案例研究。
Appl Clin Inform. 2025 Jan;16(1):137-144. doi: 10.1055/a-2441-3677. Epub 2025 Feb 12.
4
Impact of COVID-19 in pregnancy on maternal and perinatal outcomes during the Delta variant period: a comparison of the Delta and pre-delta time periods, 2020-2021.2020 - 2021年德尔塔变异株流行期间,新冠病毒感染孕妇对母婴结局的影响:德尔塔变异株与德尔塔变异株流行前时期的比较
Matern Health Neonatol Perinatol. 2024 Oct 1;10(1):20. doi: 10.1186/s40748-024-00189-1.
5
Frameworks, Dimensions, Definitions of Aspects, and Assessment Methods for the Appraisal of Quality of Health Data for Secondary Use: Comprehensive Overview of Reviews.二次使用健康数据质量评估的框架、维度、方面定义及评估方法:综述的全面概述
JMIR Med Inform. 2024 Mar 6;12:e51560. doi: 10.2196/51560.
6
Clinical Informatics needs to be a competency for Intensive care training.临床信息学应成为重症监护培训的一项技能要求。
Crit Care Resusc. 2023 May 20;25(1):6-8. doi: 10.1016/j.ccrj.2023.04.003. eCollection 2023 Mar.