• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在记录链接中评估具有条件依赖性的潜在类别模型。

Evaluating latent class models with conditional dependence in record linkage.

作者信息

Daggy Joanne, Xu Huiping, Hui Siu, Grannis Shaun

机构信息

Department of Biostatistics, Indiana University School of Medicine, Indianapolis, IN, 46202, U.S.A.

出版信息

Stat Med. 2014 Oct 30;33(24):4250-65. doi: 10.1002/sim.6230. Epub 2014 Jun 17.

DOI:10.1002/sim.6230
PMID:24935712
Abstract

Record linkage methods commonly use a traditional latent class model to classify record pairs from different sources as true matches or non-matches. This approach was first formally described by Fellegi and Sunter and assumes that the agreement in fields is independent conditional on the latent class. Consequences of violating the conditional independence assumption include bias in parameter estimates from the model. We sought to further characterize the impact of conditional dependence on the overall misclassification rate, sensitivity, and positive predictive value in the record linkage problem when the conditional independence assumption is violated. Additionally, we evaluate various methods to account for the conditional dependence. These methods include loglinear models with appropriate interaction terms identified through the correlation residual plot as well as Gaussian random effects models. The proposed models are used to link newborn screening data obtained from a health information exchange. On the basis of simulations, loglinear models with interaction terms demonstrated the best misclassification rate, although this type of model cannot accommodate other data features such as continuous measures for agreement. Results indicate that Gaussian random effects models, which can handle additional data features, perform better than assuming conditional independence and in some situations perform as well as the loglinear model with interaction terms.

摘要

记录链接方法通常使用传统的潜在类别模型,将来自不同来源的记录对分类为真实匹配或不匹配。这种方法最早由费勒吉和桑特正式描述,并假设字段中的一致性在潜在类别条件下是独立的。违反条件独立性假设的后果包括模型参数估计中的偏差。我们试图进一步描述在违反条件独立性假设时,条件依赖性对记录链接问题中总体错误分类率、敏感性和阳性预测值的影响。此外,我们评估了各种考虑条件依赖性的方法。这些方法包括通过相关残差图识别出具有适当交互项的对数线性模型以及高斯随机效应模型。所提出的模型用于链接从健康信息交换中获得的新生儿筛查数据。基于模拟,具有交互项的对数线性模型显示出最佳的错误分类率,尽管这种类型的模型无法适应其他数据特征,如一致性的连续测量。结果表明,能够处理其他数据特征的高斯随机效应模型比假设条件独立性表现更好,并且在某些情况下与具有交互项的对数线性模型表现相当。

相似文献

1
Evaluating latent class models with conditional dependence in record linkage.在记录链接中评估具有条件依赖性的潜在类别模型。
Stat Med. 2014 Oct 30;33(24):4250-65. doi: 10.1002/sim.6230. Epub 2014 Jun 17.
2
A practical approach for incorporating dependence among fields in probabilistic record linkage.一种实用的方法,用于在概率记录链接中纳入字段之间的依赖关系。
BMC Med Inform Decis Mak. 2013 Aug 30;13:97. doi: 10.1186/1472-6947-13-97.
3
The Data-Adaptive Fellegi-Sunter Model for Probabilistic Record Linkage: Algorithm Development and Validation for Incorporating Missing Data and Field Selection.数据自适应 Fellegi-Sunter 模型在概率记录链接中的应用:纳入缺失数据和字段选择的算法开发和验证。
J Med Internet Res. 2022 Sep 29;24(9):e33775. doi: 10.2196/33775.
4
Random effects models in latent class analysis for evaluating accuracy of diagnostic tests.潜在类别分析中的随机效应模型用于评估诊断试验的准确性。
Biometrics. 1996 Sep;52(3):797-810.
5
Estimating sensitivity and specificity of diagnostic tests using latent class models that account for conditional dependence between tests: a simulation study.利用考虑到测试之间条件依赖性的潜在类别模型估计诊断测试的灵敏度和特异性:一项模拟研究。
BMC Med Res Methodol. 2023 Mar 10;23(1):58. doi: 10.1186/s12874-023-01873-0.
6
A probit latent class model with general correlation structures for evaluating accuracy of diagnostic tests.一种具有一般相关结构的概率单位潜在类别模型,用于评估诊断试验的准确性。
Biometrics. 2009 Dec;65(4):1145-55. doi: 10.1111/j.1541-0420.2008.01194.x.
7
Automated linkage of patient records from disparate sources.来自不同来源的患者记录的自动链接。
Stat Methods Med Res. 2018 Jan;27(1):172-184. doi: 10.1177/0962280215626180. Epub 2016 Jul 20.
8
A new computationally efficient algorithm for record linkage with field dependency and missing data imputation.一种新的具有字段依赖性和缺失数据插补功能的计算效率高的记录链接算法。
Int J Med Inform. 2018 Jan;109:70-75. doi: 10.1016/j.ijmedinf.2017.10.021. Epub 2017 Nov 6.
9
Variable selection for latent class analysis in the presence of missing data with application to record linkage.存在缺失数据时的潜在类别分析的变量选择及其在记录链接中的应用。
Stat Methods Med Res. 2024 Jun;33(6):966-980. doi: 10.1177/09622802241242317. Epub 2024 Apr 9.
10
Latent variable modeling of diagnostic accuracy.诊断准确性的潜在变量建模
Biometrics. 1997 Sep;53(3):948-58.

引用本文的文献

1
Detecting departures from the conditional independence assumption in diagnostic latent class models: a simulation study.诊断潜在类别模型中条件独立性假设偏离的检测:一项模拟研究
BMC Med Res Methodol. 2024 Dec 5;24(1):299. doi: 10.1186/s12874-024-02432-x.
2
A simple two-step procedure using the Fellegi-Sunter model for frequency-based record linkage.一种使用费勒吉-桑特模型进行基于频率的记录链接的简单两步程序。
J Appl Stat. 2021 May 4;49(11):2789-2804. doi: 10.1080/02664763.2021.1922615. eCollection 2022.
3
Evaluation of real-world referential and probabilistic patient matching to advance patient identification strategy.
真实世界参考和概率患者匹配评估,以推进患者识别策略。
J Am Med Inform Assoc. 2022 Jul 12;29(8):1409-1415. doi: 10.1093/jamia/ocac068.
4
Evaluating the effect of data standardization and validation on patient matching accuracy.评估数据标准化和验证对患者匹配准确性的影响。
J Am Med Inform Assoc. 2019 May 1;26(5):447-456. doi: 10.1093/jamia/ocy191.
5
Embracing the Sparse, Noisy, and Interrelated Aspects of Patient Demographics for use in Clinical Medical Record Linkage.接纳患者人口统计学的稀疏、嘈杂和相互关联的方面以用于临床病历关联。
AMIA Jt Summits Transl Sci Proc. 2015 Mar 25;2015:425-9. eCollection 2015.