PheValuator 2.0：用于半自动化表型算法评估的 PheValuator 方法的方法学改进。

PheValuator 2.0: Methodological improvements for the PheValuator approach to semi-automated phenotype algorithm evaluation.

机构信息

Janssen Research and Development, Titusville, NJ, USA; Observational Health Data Sciences and Informatics (OHDSI), New York, NY.

出版信息

J Biomed Inform. 2022 Nov;135:104177. doi: 10.1016/j.jbi.2022.104177. Epub 2022 Aug 19.

DOI:10.1016/j.jbi.2022.104177

PMID:35995107

Abstract

PURPOSE

Phenotype algorithms are central to performing analyses using observational data. These algorithms translate the clinical idea of a health condition into an executable set of rules allowing for queries of data elements from a database. PheValuator, a software package in the Observational Health Data Sciences and Informatics (OHDSI) tool stack, provides a method to assess the performance characteristics of these algorithms, namely, sensitivity, specificity, and positive and negative predictive value. It uses machine learning to develop predictive models for determining a probabilistic gold standard of subjects for assessment of cases and non-cases of health conditions. PheValuator was developed to complement or even replace the traditional approach of algorithm validation, i.e., by expert assessment of subject records through chart review. Results in our first PheValuator paper suggest a systematic underestimation of the PPV compared to previous results using chart review. In this paper we evaluate modifications made to the method designed to improve its performance.

METHODS

The major changes to PheValuator included allowing all diagnostic conditions, clinical observations, drug prescriptions, and laboratory measurements to be included as predictors within the modeling process whereas in the prior version there were significant restrictions on the included predictors. We also have allowed for the inclusion of the temporal relationships of the predictors in the model. To evaluate the performance of the new method, we compared the results from the new and original methods against results found from the literature using traditional validation of algorithms for 19 phenotypes. We performed these tests using data from five commercial databases.

RESULTS

In the assessment aggregating all phenotype algorithms, the median difference between the PheValuator estimate and the gold standard estimate for PPV was reduced from -21 (IQR -34, -3) in Version 1.0 to 4 (IQR -3, 15) using Version 2.0. We found a median difference in specificity of 3 (IQR 1, 4.25) for Version 1.0 and 3 (IQR 1, 4) for Version 2.0. The median difference between the two versions of PheValuator and the gold standard for estimates of sensitivity was reduced from -39 (-51, -20) to -16 (-34, -6).

CONCLUSION

PheValuator 2.0 produces estimates for the performance characteristics for phenotype algorithms that are significantly closer to estimates from traditional validation through chart review compared to version 1.0. With this tool in researcher's toolkits, methods, such as quantitative bias analysis, may now be used to improve the reliability and reproducibility of research studies using observational data.

摘要

目的

表型算法是使用观察数据进行分析的核心。这些算法将健康状况的临床概念转化为可执行的规则集，允许对数据库中的数据元素进行查询。在观察性健康数据科学和信息学（OHDSI）工具集中，PheValuator 软件包提供了一种评估这些算法性能特征的方法，即敏感性、特异性、阳性和阴性预测值。它使用机器学习为确定健康状况病例和非病例的受试者概率金标准开发预测模型。PheValuator 的开发旨在补充甚至替代传统的算法验证方法，即通过专家对病历进行图表审查来评估。我们的第一篇 PheValuator 论文中的结果表明，与使用图表审查的先前结果相比，PPV 存在系统性低估。在本文中，我们评估了为提高其性能而对方法进行的修改。

方法

对 PheValuator 的主要更改包括允许将所有诊断条件、临床观察、药物处方和实验室测量作为预测因子包含在建模过程中，而在以前的版本中，对包含的预测因子有重大限制。我们还允许在模型中包含预测因子的时间关系。为了评估新方法的性能，我们将新方法和原始方法的结果与使用传统算法验证的 19 种表型的文献结果进行了比较。我们使用来自五个商业数据库的数据进行了这些测试。

结果

在评估汇总所有表型算法的结果时，使用新版本 2.0 时，PPV 的 PheValuator 估计值与黄金标准估计值之间的中位数差异从版本 1.0 的-21（IQR-34，-3）减少到 4（IQR-3，15）。我们发现版本 1.0 的特异性中位数差异为 3（IQR1,4.25），版本 2.0 的特异性中位数差异为 3（IQR1,4）。与旧版本相比，PheValuator 新版本 2.0 与黄金标准之间的敏感性估计值的中位数差异从-39（-51，-20）减少到-16（-34，-6）。

结论

与版本 1.0 相比，PheValuator 2.0 生成的表型算法性能特征估计值与通过图表审查进行的传统验证估计值更为接近。有了这个工具在研究人员的工具包中，现在可以使用定量偏差分析等方法来提高使用观察数据的研究的可靠性和可重复性。

相似文献

PheValuator 2.0: Methodological improvements for the PheValuator approach to semi-automated phenotype algorithm evaluation.PheValuator 2.0：用于半自动化表型算法评估的 PheValuator 方法的方法学改进。

J Biomed Inform. 2022 Nov;135:104177. doi: 10.1016/j.jbi.2022.104177. Epub 2022 Aug 19.

PheValuator: Development and evaluation of a phenotype algorithm evaluator.PheValuator：表型算法评估器的开发与评估。

J Biomed Inform. 2019 Sep;97:103258. doi: 10.1016/j.jbi.2019.103258. Epub 2019 Jul 29.

Comparing broad and narrow phenotype algorithms: differences in performance characteristics and immortal time incurred.比较宽表型和窄表型算法：性能特征差异和所导致的永恒时间。

J Pharm Pharm Sci. 2024 Jan 3;26:12095. doi: 10.3389/jpps.2023.12095. eCollection 2023.

Performance characteristics of code-based algorithms to identify urinary tract infections in large United States administrative claims databases.基于代码算法在大型美国行政索赔数据库中识别尿路感染的性能特征。

Pharmacoepidemiol Drug Saf. 2022 Sep;31(9):953-962. doi: 10.1002/pds.5492. Epub 2022 Jul 4.

Phenotype Algorithms to Identify Hidradenitis Suppurativa Using Real-World Data: Development and Validation Study.利用真实世界数据识别化脓性汗腺炎的表型算法：开发与验证研究

JMIR Dermatol. 2022 Nov 30;5(4):e38783. doi: 10.2196/38783.

Using a data-driven approach for the development and evaluation of phenotype algorithms for systemic lupus erythematosus.运用数据驱动方法开发和评估系统性红斑狼疮表型算法。

PLoS One. 2023 Feb 16;18(2):e0281929. doi: 10.1371/journal.pone.0281929. eCollection 2023.

Identifying lupus patients in electronic health records: Development and validation of machine learning algorithms and application of rule-based algorithms.在电子健康记录中识别狼疮患者：机器学习算法的开发和验证以及基于规则算法的应用。

Semin Arthritis Rheum. 2019 Aug;49(1):84-90. doi: 10.1016/j.semarthrit.2019.01.002. Epub 2019 Jan 4.

Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择

Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.

Adapting electronic health records-derived phenotypes to claims data: Lessons learned in using limited clinical data for phenotyping.使源自电子健康记录的表型适应理赔数据：利用有限临床数据进行表型分析的经验教训。

J Biomed Inform. 2020 Feb;102:103363. doi: 10.1016/j.jbi.2019.103363. Epub 2019 Dec 19.

Automatic generation of case-detection algorithms to identify children with asthma from large electronic health record databases.从大型电子健康记录数据库中自动生成病例检测算法以识别哮喘儿童。

Pharmacoepidemiol Drug Saf. 2013 Aug;22(8):826-33. doi: 10.1002/pds.3438. Epub 2013 Apr 17.

引用本文的文献

Semaglutide and Nonarteritic Anterior Ischemic Optic Neuropathy.司美格鲁肽与非动脉炎性前部缺血性视神经病变

JAMA Ophthalmol. 2025 Apr 1;143(4):304-314. doi: 10.1001/jamaophthalmol.2024.6555.

CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.队列诊断：使用人群水平特征在观察性数据源网络中进行表型评估。

PLoS One. 2025 Jan 16;20(1):e0310634. doi: 10.1371/journal.pone.0310634. eCollection 2025.

Objective study validity diagnostics: a framework requiring pre-specified, empirical verification to increase trust in the reliability of real-world evidence.客观研究有效性诊断：一个需要预先指定的实证验证的框架，以增强对真实世界证据可靠性的信任。

J Am Med Inform Assoc. 2025 Mar 1;32(3):518-525. doi: 10.1093/jamia/ocae317.

The necessity of validity diagnostics when drawing causal inferences from observational data: lessons from a multi-database evaluation of the risk of non-infectious uveitis among patients exposed to Remicade.从观察性数据得出因果推断时进行有效性诊断的必要性：来自一项针对接受类克治疗的患者发生非感染性葡萄膜炎风险的多数据库评估的经验教训。

BMC Med Res Methodol. 2024 Dec 27;24(1):322. doi: 10.1186/s12874-024-02428-7.

Authors' Response to Huang et al.'s Comment on "Serially Combining Epidemiological Designs Does Not Improve Overall Signal Detection in Vaccine Safety Surveillance".作者对黄等人就《在疫苗安全性监测中连续结合流行病学设计并不能提高总体信号检测能力》一文的评论的回应。

Drug Saf. 2024 Apr;47(4):403-404. doi: 10.1007/s40264-024-01411-x. Epub 2024 Mar 5.

Health-Analytics Data to Evidence Suite (HADES): Open-Source Software for Observational Research.健康分析数据到证据集 (HADES)：用于观察性研究的开源软件。

Stud Health Technol Inform. 2024 Jan 25;310:966-970. doi: 10.3233/SHTI231108.

J Pharm Pharm Sci. 2024 Jan 3;26:12095. doi: 10.3389/jpps.2023.12095. eCollection 2023.

Semi-supervised ROC analysis for reliable and streamlined evaluation of phenotyping algorithms.半监督 ROC 分析用于可靠且精简的表型算法评估。

J Am Med Inform Assoc. 2024 Feb 16;31(3):640-650. doi: 10.1093/jamia/ocad226.

JMIR Dermatol. 2022 Nov 30;5(4):e38783. doi: 10.2196/38783.

Using a data-driven approach for the development and evaluation of phenotype algorithms for systemic lupus erythematosus.运用数据驱动方法开发和评估系统性红斑狼疮表型算法。

PLoS One. 2023 Feb 16;18(2):e0281929. doi: 10.1371/journal.pone.0281929. eCollection 2023.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

PheValuator 2.0：用于半自动化表型算法评估的 PheValuator 方法的方法学改进。

PheValuator 2.0: Methodological improvements for the PheValuator approach to semi-automated phenotype algorithm evaluation.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献