• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

[流行病学中的记录链接程序:一项意大利多中心研究]

[Record-linkage procedures in epidemiology: an Italian multicentre study].

作者信息

Fornari Carla, Madotto Fabiana, Demaria Moreno, Romanelli Anna, Pepe Pasquale, Raciti Mauro, Tancioni Valeria, Chini Francesco, Trerotoli Paolo, Bartolomeo Nicola, Serio Gabriella, Cesana Giancarlo, Corrao Giovanni

机构信息

Centro di studio e ricerca sulla patologia cronico-degenerativa negli ambienti di lavoro, Dipartimento di medicina clinica e prevenzione, Facoltà di medicina e chirurgia, Università degli studi di Milano Bicocca, Italy.

出版信息

Epidemiol Prev. 2008 May-Jun;32(3 Suppl):79-88.

PMID:18928241
Abstract

OBJECTIVE

To compare record linkage (RL) procedures adopted in several Italian settings and a standard probabilistic RL procedure for matching data from electronic health care databases.

DESIGN

Two health care archives are matched: the hospital discharges (HD) archive and the population registry of four Italian areas. Exact deterministic, stepwise deterministic techniques and a standard probabilistic RL procedure are applied to match HD for acute myocardial infarction (AMI) and diabetes mellitus. Sensitivity and specificity for RL procedures are estimated after manual review. Age and gender standardized annual hospitalization rates for AMI and diabetes are computed using different RL procedures and compared.

SETTING

Municipalities of Pisa and Roma, and Regions of Puglia and Piemonte.

PARTICIPANTS

Residents in the considered areas on 31 December 2003 and corresponding episodes of hospitalization in the same areas during 2004.

MAIN OUTCOME MEASURES

Measures of accuracy of RL procedures to match health care administrative databases.

RESULTS

Data quality varies among archives and affects the decision rule of the probabilistic procedure. A unique decision rule was therefore adopted by means of choosing a positive predictive value of at least 98% for all the considered areas. The number of matched pairs identified with the probabilistic procedure is on average more then 11% greater than the number identified with the deterministic procedure. Sensitivity of probabilistic RL is similar or greater than that of other procedures. Differences between annual standardized hospitalization rates computed with stepwise deterministic RL and the standard probabilistic RL procedure vary among areas.

CONCLUSION

Exact deterministic RL works well when unique identifiers and high quality data are available. The probabilistic procedure here proposed works as well as semi-deterministic RL when the latter implements a quality control of data or a manual review of final results. Otherwise, deterministic or semi-deterministic procedures imply classification errors of unknown size and direction.

摘要

目的

比较意大利多个地区采用的记录链接(RL)程序以及用于匹配电子医疗数据库数据的标准概率性RL程序。

设计

匹配两个医疗档案:医院出院(HD)档案和意大利四个地区的人口登记册。应用精确确定性、逐步确定性技术以及标准概率性RL程序来匹配急性心肌梗死(AMI)和糖尿病的HD数据。在人工审核后估计RL程序的敏感性和特异性。使用不同的RL程序计算AMI和糖尿病的年龄和性别标准化年度住院率并进行比较。

地点

比萨市和罗马市以及普利亚大区和皮埃蒙特大区。

参与者

2003年12月31日各相关地区的居民以及2004年同一地区相应的住院病例。

主要观察指标

匹配医疗管理数据库的RL程序的准确性指标。

结果

档案之间的数据质量各不相同,并且会影响概率性程序的决策规则。因此,通过为所有相关地区选择至少98%的阳性预测值,采用了一个统一的决策规则。概率性程序识别出的匹配对数量平均比确定性程序识别出的数量多11%以上。概率性RL的敏感性与其他程序相似或更高。逐步确定性RL和标准概率性RL程序计算出的年度标准化住院率之间的差异因地区而异。

结论

当有唯一标识符和高质量数据时,精确确定性RL效果良好。当半确定性RL实施数据质量控制或对最终结果进行人工审核时,这里提出的概率性程序与半确定性RL效果相当。否则,确定性或半确定性程序会导致大小和方向未知的分类错误。

相似文献

1
[Record-linkage procedures in epidemiology: an Italian multicentre study].[流行病学中的记录链接程序:一项意大利多中心研究]
Epidemiol Prev. 2008 May-Jun;32(3 Suppl):79-88.
2
[Objectives, tools and methods for an epidemiological use of electronic health archives in various areas of Italy].[意大利不同地区电子健康档案流行病学应用的目标、工具与方法]
Epidemiol Prev. 2008 May-Jun;32(3 Suppl):5-14.
3
[Diabetes prevalence estimated using a standard algorithm based on electronic health data in various areas of Italy].[采用基于意大利不同地区电子健康数据的标准算法估算的糖尿病患病率]
Epidemiol Prev. 2008 May-Jun;32(3 Suppl):15-21.
4
[Acute myocardial infarction incidence estimated using a standard algorithm based on electronic health data in different areas of Italy].[采用基于意大利不同地区电子健康数据的标准算法估算急性心肌梗死发病率]
Epidemiol Prev. 2008 May-Jun;32(3 Suppl):30-7.
5
[Exploiting electronic health archives for epidemiological purposes. An experience using a standardized approach to estimate diseases in various areas of Italy. Forward].[利用电子健康档案进行流行病学研究。一项采用标准化方法估计意大利各地区疾病情况的经验。前言]
Epidemiol Prev. 2008 May-Jun;32(3 Suppl):3.
6
A hybrid approach to record linkage using a combination of deterministic and probabilistic methodology.一种使用确定性和概率性方法相结合的混合记录链接方法。
J Am Med Inform Assoc. 2020 Apr 1;27(4):505-513. doi: 10.1093/jamia/ocz232.
7
[Inclusion of a deterministic post-processing stage to increase the performance of probabilistic record linkage].[纳入确定性后处理阶段以提高概率性记录链接的性能]
Cad Saude Publica. 2018 Jun 21;34(6):e00088117. doi: 10.1590/0102-311X00088117.
8
Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms.在电子健康记录中关联母婴:确定性算法与概率性算法的比较
Pharmacoepidemiol Drug Saf. 2015 Jan;24(1):45-51. doi: 10.1002/pds.3728. Epub 2014 Nov 18.
9
When to conduct probabilistic linkage vs. deterministic linkage? A simulation study.何时进行概率性连锁分析与确定性连锁分析?一项模拟研究。
J Biomed Inform. 2015 Aug;56:80-6. doi: 10.1016/j.jbi.2015.05.012. Epub 2015 May 22.
10
Combining Different Privacy-Preserving Record Linkage Methods for Hospital Admission Data.结合不同的隐私保护记录链接方法用于医院入院数据
Stud Health Technol Inform. 2017;235:161-165.

引用本文的文献

1
The natural history of idiopathic pulmonary fibrosis in a large European population: the role of age, sex and comorbidities.特发性肺纤维化在欧洲大人群中的自然史:年龄、性别和合并症的作用。
Intern Emerg Med. 2021 Oct;16(7):1793-1802. doi: 10.1007/s11739-021-02651-w. Epub 2021 Feb 14.
2
Cardiovascular diseases monitoring: lessons from population-based registries to address future opportunities and challenges in Europe.心血管疾病监测:基于人群的登记处带来的经验教训,以应对欧洲未来的机遇和挑战。
Arch Public Health. 2018 Jun 28;76:31. doi: 10.1186/s13690-018-0283-3. eCollection 2018.
3
Epidemiology of Idiopathic Pulmonary Fibrosis in Northern Italy.
意大利北部特发性肺纤维化的流行病学
PLoS One. 2016 Feb 3;11(2):e0147072. doi: 10.1371/journal.pone.0147072. eCollection 2016.
4
Burden of diabetes mellitus estimated with a longitudinal population-based study using administrative databases.使用行政数据库通过一项基于人群的纵向研究估算糖尿病负担。
PLoS One. 2014 Dec 3;9(12):e113741. doi: 10.1371/journal.pone.0113741. eCollection 2014.
5
Cardiorespiratory treatments as modifiers of the relationship between particulate matter and health: a case-only analysis on hospitalized patients in Italy.心肺治疗作为颗粒物与健康关系的调节因素:对意大利住院患者的单病例分析。
Environ Res. 2015 Jan;136:491-9. doi: 10.1016/j.envres.2014.09.007. Epub 2014 Nov 26.
6
Long-term prediction of major coronary or ischaemic stroke event in a low-incidence Southern European population: model development and evaluation of clinical utility.在低发南欧人群中预测主要冠脉或缺血性卒中事件的长期风险:模型建立和临床实用性评估。
BMJ Open. 2013 Nov 12;3(11):e003630. doi: 10.1136/bmjopen-2013-003630.