• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

手动评估四个真实世界数据集中文档链接算法性能。

Manual Evaluation of Record Linkage Algorithm Performance in Four Real-World Datasets.

机构信息

Indiana University School of Medicine, Indianapolis, Indiana, United States.

Department of Biostatistics, Indiana University School of Medicine, Indianapolis, Indiana, United States.

出版信息

Appl Clin Inform. 2024 May;15(3):620-628. doi: 10.1055/a-2291-1391. Epub 2024 Mar 20.

DOI:10.1055/a-2291-1391
PMID:38508580
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11290950/
Abstract

OBJECTIVES

Patient data are fragmented across multiple repositories, yielding suboptimal and costly care. Record linkage algorithms are widely accepted solutions for improving completeness of patient records. However, studies often fail to fully describe their linkage techniques. Further, while many frameworks evaluate record linkage methods, few focus on producing gold standard datasets. This highlights a need to assess these frameworks and their real-world performance. We use real-world datasets and expand upon previous frameworks to evaluate a consistent approach to the manual review of gold standard datasets and measure its impact on algorithm performance.

METHODS

We applied the framework, which includes elements for data description, reviewer training and adjudication, and software and reviewer descriptions, to four datasets. Record pairs were formed and between 15,000 and 16,500 records were randomly sampled from these pairs. After training, two reviewers determined match status for each record pair. If reviewers disagreed, a third reviewer was used for final adjudication.

RESULTS

Between the four datasets, the percent discordant rate ranged from 1.8 to 13.6%. While reviewers' discordance rate typically ranged between 1 and 5%, one exhibited a 59% discordance rate, showing the importance of the third reviewer. The original analysis was compared with three sensitivity analyses. The original analysis most often exhibited the highest predictive values compared with the sensitivity analyses.

CONCLUSION

Reviewers vary in their assessment of a gold standard, which can lead to variances in estimates for matching performance. Our analysis demonstrates how a multireviewer process can be applied to create gold standards, identify reviewer discrepancies, and evaluate algorithm performance.

摘要

目的

患者数据分散在多个存储库中,导致治疗效果不佳且成本高昂。记录链接算法是提高患者记录完整性的广泛接受的解决方案。然而,研究往往未能充分描述其链接技术。此外,虽然许多框架评估记录链接方法,但很少有框架专注于生成黄金标准数据集。这凸显了评估这些框架及其实际性能的必要性。我们使用真实数据集并扩展以前的框架,以评估一种一致的方法来手动审查黄金标准数据集,并衡量其对算法性能的影响。

方法

我们将框架应用于四个数据集,该框架包括用于数据描述、评审员培训和裁决以及软件和评审员描述的元素。从这些对中形成记录对,并随机抽取 15000 到 16500 条记录。培训后,两名评审员为每一对记录确定匹配状态。如果评审员意见不一致,则使用第三名评审员进行最终裁决。

结果

在这四个数据集之间,不一致率的百分比范围为 1.8%到 13.6%。虽然评审员的不一致率通常在 1%到 5%之间,但有一位评审员的不一致率达到 59%,这表明第三位评审员的重要性。原始分析与三种敏感性分析进行了比较。与敏感性分析相比,原始分析通常表现出最高的预测值。

结论

评审员对黄金标准的评估存在差异,这可能导致匹配性能的估计值存在差异。我们的分析演示了如何应用多评审员流程来创建黄金标准、识别评审员差异以及评估算法性能。

相似文献

1
Manual Evaluation of Record Linkage Algorithm Performance in Four Real-World Datasets.手动评估四个真实世界数据集中文档链接算法性能。
Appl Clin Inform. 2024 May;15(3):620-628. doi: 10.1055/a-2291-1391. Epub 2024 Mar 20.
2
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
3
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
4
Automated devices for identifying peripheral arterial disease in people with leg ulceration: an evidence synthesis and cost-effectiveness analysis.用于识别下肢溃疡患者外周动脉疾病的自动化设备:证据综合和成本效益分析。
Health Technol Assess. 2024 Aug;28(37):1-158. doi: 10.3310/TWCG3912.
5
Search strategies to identify diagnostic accuracy studies in MEDLINE and EMBASE.在MEDLINE和EMBASE中识别诊断准确性研究的检索策略。
Cochrane Database Syst Rev. 2013 Sep 11;2013(9):MR000022. doi: 10.1002/14651858.MR000022.pub3.
6
Artificial intelligence for diagnosing exudative age-related macular degeneration.人工智能在渗出性年龄相关性黄斑变性诊断中的应用。
Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.
7
Education support services for improving school engagement and academic performance of children and adolescents with a chronic health condition.改善患有慢性病的儿童和青少年的学校参与度和学业成绩的教育支持服务。
Cochrane Database Syst Rev. 2023 Feb 8;2(2):CD011538. doi: 10.1002/14651858.CD011538.pub2.
8
Sexual Harassment and Prevention Training性骚扰与预防培训
9
Do peer reviewers comment on reporting items as instructed by the journal? A secondary analysis of two randomized trials.同行评审员是否按照期刊的要求对报告项目进行评论?两项随机试验的二次分析。
J Clin Epidemiol. 2025 May 8;183:111818. doi: 10.1016/j.jclinepi.2025.111818.
10
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.

本文引用的文献

1
A framework for a consistent and reproducible evaluation of manual review for patient matching algorithms.用于对患者匹配算法的人工审核进行一致且可重现的评估的框架。
J Am Med Inform Assoc. 2022 Nov 14;29(12):2105-2109. doi: 10.1093/jamia/ocac175.
2
Real-World Matching Performance of Deidentified Record-Linking Tokens.真实世界中去标识记录链接标记的匹配性能。
Appl Clin Inform. 2022 Aug;13(4):865-873. doi: 10.1055/a-1910-4154. Epub 2022 Jul 27.
3
Validating the Matching of Patients in the Linkage of a Large Hospital System's EHR with State and National Death Databases.验证大型医院系统的电子健康记录与州和国家死亡数据库的患者匹配。
Appl Clin Inform. 2021 Jan;12(1):82-89. doi: 10.1055/s-0040-1722220. Epub 2021 Feb 10.
4
Accuracy of an Electronic Health Record Patient Linkage Module Evaluated between Neighboring Academic Health Care Centers.电子健康记录患者链接模块在相邻学术医疗中心间的准确性评估。
Appl Clin Inform. 2020 Oct;11(5):725-732. doi: 10.1055/s-0040-1718374. Epub 2020 Nov 4.
5
Data linkage in pharmacoepidemiology: A call for rigorous evaluation and reporting.药物流行病学中的数据链接:呼吁进行严格的评估和报告。
Pharmacoepidemiol Drug Saf. 2020 Jan;29(1):9-17. doi: 10.1002/pds.4924. Epub 2019 Nov 17.
6
Simple Workflow Changes Enable Effective Patient Identity Matching in Poison Control.简单的工作流程变更可实现中毒控制中有效的患者身份匹配。
Appl Clin Inform. 2018 Jul;9(3):553-557. doi: 10.1055/s-0038-1667000. Epub 2018 Jul 25.
7
Linkage of Indiana State Cancer Registry and Indiana Network for Patient Care Data.印第安纳州癌症登记处与印第安纳州患者护理数据网络的关联。
J Registry Manag. 2016 Winter;43(4):174-8.
8
A guide to evaluating linkage quality for the analysis of linked data.用于分析关联数据的链接质量评估指南。
Int J Epidemiol. 2017 Oct 1;46(5):1699-1710. doi: 10.1093/ije/dyx177.
9
GUILD: GUidance for Information about Linking Data sets.GUILD:数据集链接信息指南。
J Public Health (Oxf). 2018 Mar 1;40(1):191-198. doi: 10.1093/pubmed/fdx037.
10
Measuring Preventive Care Delivery: Comparing Rates Across Three Data Sources.衡量预防保健服务的提供:比较三个数据源的比率。
Am J Prev Med. 2016 Nov;51(5):752-761. doi: 10.1016/j.amepre.2016.07.004. Epub 2016 Aug 10.