观察性研究中基于常规收集数据来填补健康状况的算法的开发、验证和评估指南（DEVELOP-RCD）。

Guidance of development, validation, and evaluation of algorithms for populating health status in observational studies of routinely collected data (DEVELOP-RCD).

机构信息

Institute of Integrated Traditional Chinese and Western Medicine, Chinese Evidence-Based Medicine and Cochrane China Center, West China Hospital, Sichuan University, Chengdu, 610041, China.

NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu, 610041, China.

出版信息

Mil Med Res. 2024 Aug 6;11(1):52. doi: 10.1186/s40779-024-00559-y.

DOI:10.1186/s40779-024-00559-y

PMID:39107834

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11302358/

Abstract

BACKGROUND

In recent years, there has been a growing trend in the utilization of observational studies that make use of routinely collected healthcare data (RCD). These studies rely on algorithms to identify specific health conditions (e.g. diabetes or sepsis) for statistical analyses. However, there has been substantial variation in the algorithm development and validation, leading to frequently suboptimal performance and posing a significant threat to the validity of study findings. Unfortunately, these issues are often overlooked.

METHODS

We systematically developed guidance for the development, validation, and evaluation of algorithms designed to identify health status (DEVELOP-RCD). Our initial efforts involved conducting both a narrative review and a systematic review of published studies on the concepts and methodological issues related to algorithm development, validation, and evaluation. Subsequently, we conducted an empirical study on an algorithm for identifying sepsis. Based on these findings, we formulated specific workflow and recommendations for algorithm development, validation, and evaluation within the guidance. Finally, the guidance underwent independent review by a panel of 20 external experts who then convened a consensus meeting to finalize it.

RESULTS

A standardized workflow for algorithm development, validation, and evaluation was established. Guided by specific health status considerations, the workflow comprises four integrated steps: assessing an existing algorithm's suitability for the target health status; developing a new algorithm using recommended methods; validating the algorithm using prescribed performance measures; and evaluating the impact of the algorithm on study results. Additionally, 13 good practice recommendations were formulated with detailed explanations. Furthermore, a practical study on sepsis identification was included to demonstrate the application of this guidance.

CONCLUSIONS

The establishment of guidance is intended to aid researchers and clinicians in the appropriate and accurate development and application of algorithms for identifying health status from RCD. This guidance has the potential to enhance the credibility of findings from observational studies involving RCD.

摘要

背景

近年来，利用常规收集的医疗保健数据（RCD）进行观察性研究的趋势日益增长。这些研究依赖于算法来识别特定的健康状况（例如糖尿病或败血症）进行统计分析。然而，算法的开发和验证存在很大差异，导致性能经常不理想，并对研究结果的有效性构成重大威胁。不幸的是，这些问题往往被忽视。

方法

我们系统地制定了用于开发、验证和评估旨在识别健康状况的算法的指南（DEVELOP-RCD）。我们的初步工作包括对与算法开发、验证和评估相关的概念和方法学问题进行叙述性综述和系统综述。随后，我们对用于识别败血症的算法进行了实证研究。基于这些发现，我们在指南中制定了算法开发、验证和评估的具体工作流程和建议。最后，该指南由 20 名外部专家组成的小组进行了独立审查，然后召开了共识会议对其进行了最终确定。

结果

建立了算法开发、验证和评估的标准化工作流程。在特定健康状况考虑因素的指导下，工作流程包括四个集成步骤：评估现有算法对目标健康状况的适用性；使用推荐方法开发新算法；使用规定的性能指标验证算法；以及评估算法对研究结果的影响。此外，还制定了 13 条良好实践建议，并附有详细说明。此外，还包括一项关于败血症识别的实际研究，以展示该指南的应用。

结论

该指南的建立旨在帮助研究人员和临床医生适当地、准确地开发和应用从 RCD 中识别健康状况的算法。该指南有可能提高涉及 RCD 的观察性研究结果的可信度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1910/11302358/329041931f77/40779_2024_559_Fig1_HTML.jpg

相似文献

Guidance of development, validation, and evaluation of algorithms for populating health status in observational studies of routinely collected data (DEVELOP-RCD).观察性研究中基于常规收集数据来填补健康状况的算法的开发、验证和评估指南（DEVELOP-RCD）。

Mil Med Res. 2024 Aug 6;11(1):52. doi: 10.1186/s40779-024-00559-y.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

Validation and impact of algorithms for identifying variables in observational studies of routinely collected data.常规收集数据的观察性研究中变量识别算法的验证与影响

J Clin Epidemiol. 2024 Feb;166:111232. doi: 10.1016/j.jclinepi.2023.111232. Epub 2023 Dec 1.

Development and Evaluation of the Algorithm CErtaInty Tool (ACE-IT) to Assess Electronic Medical Record and Claims-based Algorithms' Fit for Purpose for Safety Outcomes.开发和评估算法确定性工具（ACE-IT），以评估电子病历和基于索赔的算法在安全性结果方面的适用性。

Drug Saf. 2023 Jan;46(1):87-97. doi: 10.1007/s40264-022-01254-4. Epub 2022 Nov 17.

The reporting of studies using routinely collected health data was often insufficient.使用常规收集的健康数据进行研究的报告往往不够充分。

J Clin Epidemiol. 2016 Nov;79:104-111. doi: 10.1016/j.jclinepi.2016.06.005. Epub 2016 Jun 23.

The future of Cochrane Neonatal.考克兰新生儿协作网的未来。

Early Hum Dev. 2020 Nov;150:105191. doi: 10.1016/j.earlhumdev.2020.105191. Epub 2020 Sep 12.

A systematic review of fall prediction models for community-dwelling older adults: comparison between models based on research cohorts and models based on routinely collected data.社区居住的老年人跌倒预测模型的系统评价：基于研究队列的模型与基于常规收集数据的模型之间的比较。

Age Ageing. 2024 Jul 2;53(7). doi: 10.1093/ageing/afae131.

Real-world research and the role of observational data in the field of gynaecology - a practical review.真实世界研究及观察性数据在妇科领域的作用——实践综述

Eur J Contracept Reprod Health Care. 2017 Aug;22(4):250-259. doi: 10.1080/13625187.2017.1361528. Epub 2017 Aug 17.

引用本文的文献

Evaluating the agreement between sensitivity and primary analyses in observational studies using routinely collected healthcare data: a meta-epidemiology study.使用常规收集的医疗保健数据评估观察性研究中敏感性分析与主要分析之间的一致性：一项元流行病学研究。

BMC Med. 2025 Jul 1;23(1):393. doi: 10.1186/s12916-025-04199-4.

Validity and accuracy of artificial intelligence-based dietary intake assessment methods: a systematic review.基于人工智能的膳食摄入量评估方法的有效性和准确性：一项系统综述。

Br J Nutr. 2025 May 14;133(9):1241-1253. doi: 10.1017/S0007114525000522. Epub 2025 Apr 10.

Methods for identifying health status from routinely collected health data: An overview.从常规收集的健康数据中识别健康状况的方法：概述。

Integr Med Res. 2025 Mar;14(1):101100. doi: 10.1016/j.imr.2024.101100. Epub 2024 Nov 15.

Air pollution and risk of 32 health conditions: outcome-wide analyses in a population-based prospective cohort in Southwest China.空气污染与 32 种健康状况的风险：中国西南部基于人群的前瞻性队列研究中的全结局分析。

BMC Med. 2024 Sep 11;22(1):370. doi: 10.1186/s12916-024-03596-5.

本文引用的文献

Validation and impact of algorithms for identifying variables in observational studies of routinely collected data.常规收集数据的观察性研究中变量识别算法的验证与影响

J Clin Epidemiol. 2024 Feb;166:111232. doi: 10.1016/j.jclinepi.2023.111232. Epub 2023 Dec 1.

Quantitative bias analysis of prevalence under misclassification: evaluation indicators, calculation method and case analysis.定量偏倚分析在错误分类下的患病率：评价指标、计算方法及案例分析。

Int J Epidemiol. 2023 Jun 6;52(3):942-951. doi: 10.1093/ije/dyac239.

Core concepts in pharmacoepidemiology: Validation of health outcomes of interest within real-world healthcare databases.药物流行病学的核心概念：在真实医疗保健数据库中验证感兴趣的健康结果。

Pharmacoepidemiol Drug Saf. 2023 Jan;32(1):1-8. doi: 10.1002/pds.5537. Epub 2022 Sep 14.

Prospective, multi-site study of patient outcomes after implementation of the TREWS machine learning-based early warning system for sepsis.采用 TREWS 机器学习为基础的脓毒症早期预警系统后，对患者预后的前瞻性、多中心研究。

Nat Med. 2022 Jul;28(7):1455-1460. doi: 10.1038/s41591-022-01894-0. Epub 2022 Jul 21.

Studies of diagnostic test accuracy: Partial verification bias and test result-based sampling.诊断试验准确性研究：部分验证偏倚和基于检验结果的抽样。

J Clin Epidemiol. 2022 May;145:179-182. doi: 10.1016/j.jclinepi.2022.01.022. Epub 2022 Feb 3.

Global, regional, and national burden of kidney, bladder, and prostate cancers and their attributable risk factors, 1990-2019.全球、区域和国家的肾脏、膀胱和前列腺癌负担及其归因风险因素，1990-2019 年。

Mil Med Res. 2021 Nov 24;8(1):60. doi: 10.1186/s40779-021-00354-z.

Monte Carlo Simulation Approaches for Quantitative Bias Analysis: A Tutorial.蒙特卡罗模拟方法在定量偏倚分析中的应用：教程。

Epidemiol Rev. 2022 Jan 14;43(1):106-117. doi: 10.1093/epirev/mxab012.

Drug exposure misclassification in pharmacoepidemiology: Sources and relative impact.药物暴露错误分类在药物流行病学中的来源和相对影响。

Pharmacoepidemiol Drug Saf. 2021 Dec;30(12):1703-1715. doi: 10.1002/pds.5346. Epub 2021 Sep 7.

Data mining in clinical big data: the frequently used databases, steps, and methodological models.临床大数据中的数据挖掘：常用数据库、步骤和方法学模型。

Mil Med Res. 2021 Aug 11;8(1):44. doi: 10.1186/s40779-021-00338-z.

A systematic review of quantitative bias analysis applied to epidemiological research.对应用于流行病学研究的定量偏倚分析的系统评价。

Int J Epidemiol. 2021 Nov 10;50(5):1708-1730. doi: 10.1093/ije/dyab061.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

观察性研究中基于常规收集数据来填补健康状况的算法的开发、验证和评估指南（DEVELOP-RCD）。

Guidance of development, validation, and evaluation of algorithms for populating health status in observational studies of routinely collected data (DEVELOP-RCD).

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献