一个交互式的适用性数据完整性工具，用于评估活动跟踪器数据。

An interactive fitness-for-use data completeness tool to assess activity tracker data.

机构信息

Department of Biomedical Informatics, Columbia University, New York, New York, USA.

Department of Artificial Intelligence and Human Health, Icahn School of Medicine, New York, New York, USA.

出版信息

J Am Med Inform Assoc. 2022 Nov 14;29(12):2032-2040. doi: 10.1093/jamia/ocac166.

DOI:10.1093/jamia/ocac166

PMID:36173371

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9667174/

Abstract

OBJECTIVE

To design and evaluate an interactive data quality (DQ) characterization tool focused on fitness-for-use completeness measures to support researchers' assessment of a dataset.

MATERIALS AND METHODS

Design requirements were identified through a conceptual framework on DQ, literature review, and interviews. The prototype of the tool was developed based on the requirements gathered and was further refined by domain experts. The Fitness-for-Use Tool was evaluated through a within-subjects controlled experiment comparing it with a baseline tool that provides information on missing data based on intrinsic DQ measures. The tools were evaluated on task performance and perceived usability.

RESULTS

The Fitness-for-Use Tool allows users to define data completeness by customizing the measures and its thresholds to fit their research task and provides a data summary based on the customized definition. Using the Fitness-for-Use Tool, study participants were able to accurately complete fitness-for-use assessment in less time than when using the Intrinsic DQ Tool. The study participants perceived that the Fitness-for-Use Tool was more useful in determining the fitness-for-use of a dataset than the Intrinsic DQ Tool.

DISCUSSION

Incorporating fitness-for-use measures in a DQ characterization tool could provide data summary that meets researchers needs. The design features identified in this study has potential to be applied to other biomedical data types.

CONCLUSION

A tool that summarizes a dataset in terms of fitness-for-use dimensions and measures specific to a research question supports dataset assessment better than a tool that only presents information on intrinsic DQ measures.

摘要

目的

设计和评估一种交互式数据质量（DQ）特征描述工具，该工具专注于可用性完整性措施，以支持研究人员对数据集的评估。

材料与方法

通过 DQ 概念框架、文献回顾和访谈确定了设计要求。根据收集到的要求开发了工具原型，并由领域专家进一步改进。通过一项内部对照实验对可用性工具进行了评估，该实验将其与提供基于内在 DQ 措施的缺失数据信息的基线工具进行了比较。在任务绩效和感知可用性方面评估了这些工具。

结果

可用性工具允许用户通过自定义措施及其阈值来定义数据完整性，以适应其研究任务，并根据自定义定义提供数据摘要。使用可用性工具，研究参与者能够在比使用内在 DQ 工具更短的时间内准确完成可用性评估。研究参与者认为，可用性工具比内在 DQ 工具更有助于确定数据集的可用性。

讨论

在 DQ 特征描述工具中纳入可用性措施可以提供满足研究人员需求的数据摘要。本研究中确定的设计特点有可能应用于其他生物医学数据类型。

结论

一种能够根据特定研究问题的可用性维度和措施总结数据集的工具，比仅提供内在 DQ 措施信息的工具更能支持数据集评估。

相似文献

An interactive fitness-for-use data completeness tool to assess activity tracker data.一个交互式的适用性数据完整性工具，用于评估活动跟踪器数据。

J Am Med Inform Assoc. 2022 Nov 14;29(12):2032-2040. doi: 10.1093/jamia/ocac166.

Identifying Data Quality Dimensions for Person-Generated Wearable Device Data: Multi-Method Study.确定个人生成的可穿戴设备数据的数据质量维度：多方法研究。

JMIR Mhealth Uhealth. 2021 Dec 23;9(12):e31618. doi: 10.2196/31618.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

DQAgui: a graphical user interface for the MIRACUM data quality assessment tool.DQAgui：MIRACUM 数据质量评估工具的图形用户界面。

BMC Med Inform Decis Mak. 2022 Aug 11;22(1):213. doi: 10.1186/s12911-022-01961-z.

Assessing the practice of data quality evaluation in a national clinical data research network through a systematic scoping review in the era of real-world data.在真实世界数据时代，通过系统的范围综述评估国家临床数据研究网络中的数据质量评估实践。

J Am Med Inform Assoc. 2020 Dec 9;27(12):1999-2010. doi: 10.1093/jamia/ocaa245.

Consumer-Based Activity Trackers as a Tool for Physical Activity Monitoring in Epidemiological Studies During the COVID-19 Pandemic: Development and Usability Study.基于消费者的活动追踪器在 COVID-19 大流行期间作为流行病学研究中身体活动监测工具的开发和可用性研究。

JMIR Public Health Surveill. 2021 Apr 23;7(4):e23806. doi: 10.2196/23806.

Quality assessment of real-world data repositories across the data life cycle: A literature review.贯穿数据生命周期的真实世界数据存储库质量评估：文献综述。

J Am Med Inform Assoc. 2021 Jul 14;28(7):1591-1599. doi: 10.1093/jamia/ocaa340.

The relationship between electronic health records user interface features and data quality of patient clinical information: an integrative review.电子健康记录用户界面特征与患者临床信息数据质量之间的关系：综合述评。

J Am Med Inform Assoc. 2023 Dec 22;31(1):240-255. doi: 10.1093/jamia/ocad188.

Physical Activity Trend eXtraction: A Framework for Extracting Moderate-Vigorous Physical Activity Trends From Wearable Fitness Tracker Data.体力活动趋势提取：从可穿戴健身追踪器数据中提取中等至剧烈体力活动趋势的框架。

JMIR Mhealth Uhealth. 2019 Mar 12;7(3):e11075. doi: 10.2196/11075.

引用本文的文献

Evaluating and Enhancing the Fitness-for-Purpose of Electronic Health Record Data: Qualitative Study on Current Practices and Pathway to an Automated Approach Within the Medical Informatics for Research and Care in University Medicine Consortium.评估和提高电子健康记录数据的适用性：大学医学联合会研究与护理医学信息学中当前实践及自动化方法途径的定性研究

JMIR Med Inform. 2024 Aug 19;12:e57153. doi: 10.2196/57153.

本文引用的文献

Identifying Data Quality Dimensions for Person-Generated Wearable Device Data: Multi-Method Study.确定个人生成的可穿戴设备数据的数据质量维度：多方法研究。

JMIR Mhealth Uhealth. 2021 Dec 23;9(12):e31618. doi: 10.2196/31618.

Factors Affecting the Quality of Person-Generated Wearable Device Data and Associated Challenges: Rapid Systematic Review.影响可穿戴设备数据质量的因素及相关挑战：快速系统综述。

JMIR Mhealth Uhealth. 2021 Mar 19;9(3):e20738. doi: 10.2196/20738.

Harnessing wearable device data to improve state-level real-time surveillance of influenza-like illness in the USA: a population-based study.利用可穿戴设备数据改善美国州级实时流感样疾病监测：一项基于人群的研究。

Lancet Digit Health. 2020 Feb;2(2):e85-e93. doi: 10.1016/S2589-7500(19)30222-5. Epub 2020 Jan 16.

Confidence intervals for difference in proportions for matched pairs compatible with exact McNemar's or sign tests.适用于精确 McNemar 或符号检验的配对设计率差值的置信区间。

Stat Med. 2021 Feb 28;40(5):1147-1159. doi: 10.1002/sim.8829. Epub 2020 Dec 1.

Wearable sensor data and self-reported symptoms for COVID-19 detection.可穿戴传感器数据和自我报告症状用于 COVID-19 检测。

Nat Med. 2021 Jan;27(1):73-77. doi: 10.1038/s41591-020-1123-x. Epub 2020 Oct 29.

The "All of Us" Research Program.“All of Us”研究计划。

N Engl J Med. 2019 Aug 15;381(7):668-676. doi: 10.1056/NEJMsr1809937.

Reporting Data Quality Assessment Results: Identifying Individual and Organizational Barriers and Solutions.报告数据质量评估结果：识别个人和组织层面的障碍及解决方案。

EGEMS (Wash DC). 2017 Sep 4;5(1):16. doi: 10.5334/egems.214.

A Framework for Data Quality Assessment in Clinical Research Datasets.临床研究数据集数据质量评估框架

AMIA Annu Symp Proc. 2018 Apr 16;2017:1080-1089. eCollection 2017.

Beyond fitness tracking: The use of consumer-grade wearable data from normal volunteers in cardiovascular and lipidomics research.超越健身追踪：正常志愿者的消费级可穿戴数据在心血管和脂质组学研究中的应用。

PLoS Biol. 2018 Feb 27;16(2):e2004285. doi: 10.1371/journal.pbio.2004285. eCollection 2018 Feb.

Real World Home Blood Pressure Variability in Over 56,000 Individuals With Nearly 17 Million Measurements.超过 56000 人、近 1700 万次测量的真实世界家庭血压变异性。

Am J Hypertens. 2018 Apr 13;31(5):566-573. doi: 10.1093/ajh/hpx221.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验