• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

队列诊断:使用人群水平特征在观察性数据源网络中进行表型评估。

CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.

作者信息

Rao Gowtham A, Shoaibi Azza, Makadia Rupa, Hardin Jill, Swerdel Joel, Weaver James, Voss Erica A, Conover Mitchell M, Fortin Stephen, Sena Anthony G, Knoll Chris, Hughes Nigel, Gilbert James P, Blacketer Clair, Andryc Alan, DeFalco Frank, Molinaro Anthony, Reps Jenna, Schuemie Martijn J, Ryan Patrick B

机构信息

Observational Health Data Analytics, Janssen Research and Development, LLC, Titusville, NJ, United States of America.

OHDSI Collaborators, Observational Health Data Sciences and Informatics (OHDSI), New York, NY, United States of America.

出版信息

PLoS One. 2025 Jan 16;20(1):e0310634. doi: 10.1371/journal.pone.0310634. eCollection 2025.

DOI:10.1371/journal.pone.0310634
PMID:39820599
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11737733/
Abstract

OBJECTIVE

This paper introduces a novel framework for evaluating phenotype algorithms (PAs) using the open-source tool, Cohort Diagnostics.

MATERIALS AND METHODS

The method is based on several diagnostic criteria to evaluate a patient cohort returned by a PA. Diagnostics include estimates of incidence rate, index date entry code breakdown, and prevalence of all observed clinical events prior to, on, and after index date. We test our framework by evaluating one PA for systemic lupus erythematosus (SLE) and two PAs for Alzheimer's disease (AD) across 10 different observational data sources.

RESULTS

By utilizing CohortDiagnostics, we found that the population-level characteristics of individuals in the cohort of SLE closely matched the disease's anticipated clinical profile. Specifically, the incidence rate of SLE was consistently higher in occurrence among females. Moreover, expected clinical events like laboratory tests, treatments, and repeated diagnoses were also observed. For AD, although one PA identified considerably fewer patients, absence of notable differences in clinical characteristics between the two cohorts suggested similar specificity.

DISCUSSION

We provide a practical and data-driven approach to evaluate PAs, using two clinical diseases as examples, across a network of OMOP data sources. Cohort Diagnostics can ensure the subjects identified by a specific PA align with those intended for inclusion in a research study.

CONCLUSION

Diagnostics based on large-scale population-level characterization can offer insights into the misclassification errors of PAs.

摘要

目的

本文介绍一种使用开源工具“队列诊断”(Cohort Diagnostics)评估表型算法(PAs)的新框架。

材料与方法

该方法基于多个诊断标准来评估PA返回的患者队列。诊断内容包括发病率估计、索引日期录入代码细分,以及索引日期之前、当日和之后所有观察到的临床事件的患病率。我们通过在10个不同的观察性数据源中评估一种系统性红斑狼疮(SLE)的PA和两种阿尔茨海默病(AD)的PA来测试我们的框架。

结果

通过使用队列诊断,我们发现SLE队列中个体的人群水平特征与该疾病预期的临床概况密切匹配。具体而言,SLE的发病率在女性中始终较高。此外,还观察到了如实验室检查、治疗和重复诊断等预期的临床事件。对于AD,尽管一种PA识别出的患者数量少得多,但两个队列在临床特征上没有显著差异,表明特异性相似。

讨论

我们以两种临床疾病为例,在OMOP数据源网络中提供了一种实用的、数据驱动的方法来评估PA。队列诊断可以确保特定PA识别的受试者与研究中打算纳入的受试者一致。

结论

基于大规模人群水平特征的诊断可以深入了解PA的错误分类误差。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/44e7252dd504/pone.0310634.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/c708e5192a5f/pone.0310634.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/2c7b13434ab5/pone.0310634.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/8d1749eba0e8/pone.0310634.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/44e7252dd504/pone.0310634.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/c708e5192a5f/pone.0310634.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/2c7b13434ab5/pone.0310634.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/8d1749eba0e8/pone.0310634.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c4f7/11737733/44e7252dd504/pone.0310634.g004.jpg

相似文献

1
CohortDiagnostics: Phenotype evaluation across a network of observational data sources using population-level characterization.队列诊断:使用人群水平特征在观察性数据源网络中进行表型评估。
PLoS One. 2025 Jan 16;20(1):e0310634. doi: 10.1371/journal.pone.0310634. eCollection 2025.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Using a data-driven approach for the development and evaluation of phenotype algorithms for systemic lupus erythematosus.运用数据驱动方法开发和评估系统性红斑狼疮表型算法。
PLoS One. 2023 Feb 16;18(2):e0281929. doi: 10.1371/journal.pone.0281929. eCollection 2023.
4
A population-based assessment of systemic lupus erythematosus incidence and prevalence--results and implications of using administrative data for epidemiological studies.基于人群的系统性红斑狼疮发病率和患病率评估——使用行政数据进行流行病学研究的结果及意义。
Rheumatology (Oxford). 2007 Dec;46(12):1814-8. doi: 10.1093/rheumatology/kem233.
5
Incidence and prevalence of adult systemic lupus erythematosus in a large US managed-care population.美国大型管理式医疗人群中成人系统性红斑狼疮的发病率和患病率。
Lupus. 2013 Jan;22(1):99-105. doi: 10.1177/0961203312463110. Epub 2012 Oct 5.
6
Validating claims-based algorithms for a systemic lupus erythematosus diagnosis in Medicare data for informed use of the Lupus Index: a tool for geospatial research.验证医疗保险数据中基于索赔的系统性红斑狼疮诊断算法,以便明智地使用狼疮指数:一种用于地理空间研究的工具。
Lupus Sci Med. 2024 Oct 14;11(2):e001329. doi: 10.1136/lupus-2024-001329.
7
Mortality and cardiovascular burden of systemic lupus erythematosus in a US population-based cohort.美国人群队列中系统性红斑狼疮的死亡率和心血管负担
J Rheumatol. 2014 Apr;41(4):680-7. doi: 10.3899/jrheum.130874. Epub 2014 Feb 15.
8
Mortality and disease related comorbidities in systemic lupus erythematosus: Data from an Egyptian cohort.系统性红斑狼疮患者的死亡率及相关合并症:来自埃及队列的数据。
Lupus. 2022 Apr;31(5):628-636. doi: 10.1177/09612033221081691. Epub 2022 Mar 20.
9
Systemic lupus erythematosus in Denmark: clinical and epidemiological characterization of a county-based cohort.丹麦的系统性红斑狼疮:基于一个县的队列的临床和流行病学特征
Scand J Rheumatol. 1998;27(2):98-105. doi: 10.1080/030097498440958.
10
Association of systemic lupus erythematosus and sleep disorders: a nationwide population-based cohort study.系统性红斑狼疮与睡眠障碍的关联:一项基于全国人口的队列研究。
Lupus. 2016 Apr;25(4):382-8. doi: 10.1177/0961203315617843. Epub 2015 Nov 18.

引用本文的文献

1
Multi-domain rule-based phenotyping algorithms enable improved GWAS signal.基于多领域规则的表型分析算法可增强全基因组关联研究(GWAS)信号。
NPJ Digit Med. 2025 Aug 2;8(1):499. doi: 10.1038/s41746-025-01815-8.
2
Variation in Phenotype Definitions in Observational Clinical Research: A Review of Three Conditions.观察性临床研究中表型定义的差异:三种疾病的综述
AMIA Annu Symp Proc. 2025 May 22;2024:1021-1029. eCollection 2024.
3
Objective study validity diagnostics: a framework requiring pre-specified, empirical verification to increase trust in the reliability of real-world evidence.

本文引用的文献

1
Standardised and Reproducible Phenotyping Using Distributed Analytics and Tools in the Data Analysis and Real World Interrogation Network (DARWIN EU).使用数据分析和真实世界探究网络(DARWIN EU)中的分布式分析和工具进行标准化和可重现的表型分析。
Pharmacoepidemiol Drug Saf. 2024 Nov;33(11):e70042. doi: 10.1002/pds.70042.
2
Phenotyping in distributed data networks: selecting the right codes for the right patients.分布式数据网络中的表型分析:为合适的患者选择合适的编码。
AMIA Annu Symp Proc. 2023 Apr 29;2022:826-835. eCollection 2022.
3
Reproducible variability: assessing investigator discordance across 9 research teams attempting to reproduce the same observational study.
客观研究有效性诊断:一个需要预先指定的实证验证的框架,以增强对真实世界证据可靠性的信任。
J Am Med Inform Assoc. 2025 Mar 1;32(3):518-525. doi: 10.1093/jamia/ocae317.
可重现的变异性:评估 9 个研究团队在试图重现相同的观察性研究时的调查员意见分歧。
J Am Med Inform Assoc. 2023 Apr 19;30(5):859-868. doi: 10.1093/jamia/ocad009.
4
Using a data-driven approach for the development and evaluation of phenotype algorithms for systemic lupus erythematosus.运用数据驱动方法开发和评估系统性红斑狼疮表型算法。
PLoS One. 2023 Feb 16;18(2):e0281929. doi: 10.1371/journal.pone.0281929. eCollection 2023.
5
PheValuator 2.0: Methodological improvements for the PheValuator approach to semi-automated phenotype algorithm evaluation.PheValuator 2.0:用于半自动化表型算法评估的 PheValuator 方法的方法学改进。
J Biomed Inform. 2022 Nov;135:104177. doi: 10.1016/j.jbi.2022.104177. Epub 2022 Aug 19.
6
Characterising the background incidence rates of adverse events of special interest for covid-19 vaccines in eight countries: multinational network cohort study.描述 8 个国家/地区新冠病毒疫苗特殊关注不良事件的背景发生率:跨国网络队列研究。
BMJ. 2021 Jun 14;373:n1435. doi: 10.1136/bmj.n1435.
7
Phenotype Algorithms for the Identification and Characterization of Vaccine-Induced Thrombotic Thrombocytopenia in Real World Data: A Multinational Network Cohort Study.基于真实世界数据的疫苗诱导性血栓性血小板减少症表型算法的鉴定和特征描述:一项多国网络队列研究。
Drug Saf. 2022 Jun;45(6):685-698. doi: 10.1007/s40264-022-01187-y. Epub 2022 Jun 2.
8
Unraveling COVID-19: A Large-Scale Characterization of 4.5 Million COVID-19 Cases Using CHARYBDIS.解析新冠病毒:使用CHARYBDIS对450万例新冠病例进行大规模特征分析。
Clin Epidemiol. 2022 Mar 22;14:369-384. doi: 10.2147/CLEP.S323292. eCollection 2022.
9
Validation of Claims Algorithms to Identify Alzheimer's Disease and Related Dementias.阿尔茨海默病及相关痴呆症理赔算法验证。
J Gerontol A Biol Sci Med Sci. 2022 Jun 1;77(6):1261-1271. doi: 10.1093/gerona/glab373.
10
Desiderata for the development of next-generation electronic health record phenotype libraries.下一代电子健康记录表型库发展的要点。
Gigascience. 2021 Sep 11;10(9). doi: 10.1093/gigascience/giab059.