Shang Ning, Weng Chunhua, Hripcsak George
Department of Biomedical Informatics, Columbia University, New York, NY, USA.
J Am Med Inform Assoc. 2018 Mar 1;25(3):248-258. doi: 10.1093/jamia/ocx095.
To contribute a conceptual framework for evaluating data suitability to satisfy the research needs of observational studies.
Suitability considerations were derived from a systematic literature review on researchers' common data needs in observational studies and a scoping review on frequent clinical database design considerations, and were harmonized to construct a suitability conceptual framework using a bottom-up approach. The relationships among the suitability categories are explored from the perspective of 4 facets of data: intrinsic, contextual, representational, and accessible. A web-based national survey of domain experts was conducted to validate the framework.
Data suitability for observational studies hinges on the following key categories: Explicitness of Policy and Data Governance, Relevance, Availability of Descriptive Metadata and Provenance Documentation, Usability, and Quality. We describe 16 measures and 33 sub-measures. The survey uncovered the relevance of all categories, with a 5-point Likert importance score of 3.9 ± 1.0 for Explicitness of Policy and Data Governance, 4.1 ± 1.0 for Relevance, 3.9 ± 0.9 for Availability of Descriptive Metadata and Provenance Documentation, 4.2 ± 1.0 for Usability, and 4.0 ± 0.9 for Quality.
The suitability framework evaluates a clinical data source's fitness for research use. Its construction reflects both researchers' points of view and data custodians' design features. The feedback from domain experts rated Usability, Relevance, and Quality categories as the most important considerations.
构建一个概念框架,用于评估数据的适用性,以满足观察性研究的研究需求。
适用性考量源于对观察性研究中研究人员常见数据需求的系统文献综述以及对常见临床数据库设计考量的范围综述,并采用自下而上的方法进行协调,以构建适用性概念框架。从数据的四个方面:内在、背景、表示和可访问性,探讨了适用性类别之间的关系。对领域专家进行了基于网络的全国性调查,以验证该框架。
观察性研究的数据适用性取决于以下关键类别:政策与数据治理的明确性、相关性、描述性元数据和来源文档的可用性、可用性和质量。我们描述了16项措施和33项子措施。调查揭示了所有类别的相关性,政策与数据治理明确性的5点李克特重要性评分为3.9±1.0,相关性为4.1±1.0,描述性元数据和来源文档可用性为3.9±0.9,可用性为4.2±1.0,质量为4.0±0.9。
适用性框架评估临床数据源用于研究的适用性。其构建既反映了研究人员的观点,也反映了数据保管人的设计特点。领域专家的反馈将可用性、相关性和质量类别评为最重要的考量因素。