• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

中等收入国家两个大型行政数据库的记录关联评估:巴西的死产与孕期登革热通报情况

Evaluation of record linkage of two large administrative databases in a middle income country: stillbirths and notifications of dengue during pregnancy in Brazil.

作者信息

Paixão Enny S, Harron Katie, Andrade Kleydson, Teixeira Maria Glória, Fiaccone Rosemeire L, Costa Maria da Conceição N, Rodrigues Laura C

机构信息

London School of Hygiene and Tropical Medicine, Keppel St, Bloomsbury, London, WC1E 7HT, UK.

Instituto de Saúde Coletiva, Rua Basílio da Gama, s/n.Canela, Salvador, Bahia, CEP 40110040, Brazil.

出版信息

BMC Med Inform Decis Mak. 2017 Jul 17;17(1):108. doi: 10.1186/s12911-017-0506-5.

DOI:10.1186/s12911-017-0506-5
PMID:28716074
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5513351/
Abstract

BACKGROUND

Due to the increasing availability of individual-level information across different electronic datasets, record linkage has become an efficient and important research tool. High quality linkage is essential for producing robust results. The objective of this study was to describe the process of preparing and linking national Brazilian datasets, and to compare the accuracy of different linkage methods for assessing the risk of stillbirth due to dengue in pregnancy.

METHODS

We linked mothers and stillbirths in two routinely collected datasets from Brazil for 2009-2010: for dengue in pregnancy, notifications of infectious diseases (SINAN); for stillbirths, mortality (SIM). Since there was no unique identifier, we used probabilistic linkage based on maternal name, age and municipality. We compared two probabilistic approaches, each with two thresholds: 1) a bespoke linkage algorithm; 2) a standard linkage software widely used in Brazil (ReclinkIII), and used manual review to identify further links. Sensitivity and positive predictive value (PPV) were estimated using a subset of gold-standard data created through manual review. We examined the characteristics of false-matches and missed-matches to identify any sources of bias.

RESULTS

From records of 678,999 dengue cases and 62,373 stillbirths, the gold-standard linkage identified 191 cases. The bespoke linkage algorithm with a conservative threshold produced 131 links, with sensitivity = 64.4% (68 missed-matches) and PPV = 92.5% (8 false-matches). Manual review of uncertain links identified an additional 37 links, increasing sensitivity to 83.7%. The bespoke algorithm with a relaxed threshold identified 132 true matches (sensitivity = 69.1%), but introduced 61 false-matches (PPV = 68.4%). ReclinkIII produced lower sensitivity and PPV than the bespoke linkage algorithm. Linkage error was not associated with any recorded study variables.

CONCLUSION

Despite a lack of unique identifiers for linking mothers and stillbirths, we demonstrate a high standard of linkage of large routine databases from a middle income country. Probabilistic linkage and manual review were essential for accurately identifying cases for a case-control study, but this approach may not be feasible for larger databases or for linkage of more common outcomes.

摘要

背景

由于不同电子数据集中个体层面信息的可得性不断提高,记录链接已成为一种高效且重要的研究工具。高质量的链接对于得出可靠结果至关重要。本研究的目的是描述巴西国家数据集的准备和链接过程,并比较不同链接方法在评估孕期登革热导致死产风险方面的准确性。

方法

我们将巴西2009 - 2010年两个常规收集的数据集中的母亲和死产记录进行了链接:关于孕期登革热,传染病通报(SINAN);关于死产,死亡率(SIM)。由于没有唯一标识符,我们基于母亲姓名、年龄和市采用概率链接。我们比较了两种概率方法,每种方法有两个阈值:1)一种定制链接算法;2)巴西广泛使用的标准链接软件(ReclinkIII),并通过人工审核来识别更多链接。使用通过人工审核创建的金标准数据子集估计敏感性和阳性预测值(PPV)。我们检查了错误匹配和漏匹配的特征,以识别任何偏差来源。

结果

从678,999例登革热病例和62,373例死产记录中,金标准链接识别出191例病例。具有保守阈值的定制链接算法产生了131个链接,敏感性 = 64.4%(68个漏匹配),PPV = 92.5%(8个错误匹配)。对不确定链接的人工审核又识别出37个链接,使敏感性提高到83.7%。具有宽松阈值的定制算法识别出132个真实匹配(敏感性 = 69.1%),但引入了61个错误匹配(PPV = 68.4%)。ReclinkIII产生的敏感性和PPV低于定制链接算法。链接错误与任何记录的研究变量均无关联。

结论

尽管缺乏用于链接母亲和死产的唯一标识符,但我们展示了一个中等收入国家大型常规数据库的高标准链接。概率链接和人工审核对于准确识别病例对照研究中的病例至关重要,但这种方法对于更大的数据库或更常见结局的链接可能不可行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3310/5513351/1692035604ab/12911_2017_506_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3310/5513351/1692035604ab/12911_2017_506_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3310/5513351/1692035604ab/12911_2017_506_Fig1_HTML.jpg

相似文献

1
Evaluation of record linkage of two large administrative databases in a middle income country: stillbirths and notifications of dengue during pregnancy in Brazil.中等收入国家两个大型行政数据库的记录关联评估:巴西的死产与孕期登革热通报情况
BMC Med Inform Decis Mak. 2017 Jul 17;17(1):108. doi: 10.1186/s12911-017-0506-5.
2
Validating linkage of multiple population-based administrative databases in Brazil.验证巴西多个基于人群的行政数据库的关联性。
PLoS One. 2019 Mar 28;14(3):e0214050. doi: 10.1371/journal.pone.0214050. eCollection 2019.
3
Linking mothers and infants within electronic health records: a comparison of deterministic and probabilistic algorithms.在电子健康记录中关联母婴:确定性算法与概率性算法的比较
Pharmacoepidemiol Drug Saf. 2015 Jan;24(1):45-51. doi: 10.1002/pds.3728. Epub 2014 Nov 18.
4
Impact of linkage quality on inferences drawn from analyses using data with high rates of linkage errors in rural Tanzania.坦桑尼亚农村地区高连锁错误率数据的分析中,连锁质量对推论的影响。
BMC Med Res Methodol. 2018 Dec 10;18(1):165. doi: 10.1186/s12874-018-0632-5.
5
Record linkage under suboptimal conditions for data-intensive evaluation of primary care in Rio de Janeiro, Brazil.在条件不理想的情况下进行记录链接,以对巴西里约热内卢的初级保健进行数据密集型评估。
BMC Med Inform Decis Mak. 2021 Jun 15;21(1):190. doi: 10.1186/s12911-021-01550-6.
6
Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort.利用全国性巴西行政数据库检查记录链接过程的质量,以建立一个大型出生队列。
BMC Med Inform Decis Mak. 2020 Jul 25;20(1):173. doi: 10.1186/s12911-020-01192-0.
7
Accuracy of Probabilistic Linkage Using the Enhanced Matching System for Public Health and Epidemiological Studies.使用公共卫生与流行病学研究增强匹配系统的概率性链接的准确性
PLoS One. 2015 Aug 24;10(8):e0136179. doi: 10.1371/journal.pone.0136179. eCollection 2015.
8
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
9
Comparing Methods for Record Linkage for Public Health Action: Matching Algorithm Validation Study.比较公共卫生行动记录链接的方法:匹配算法验证研究。
JMIR Public Health Surveill. 2020 Apr 30;6(2):e15917. doi: 10.2196/15917.
10
[Probabilistic record linkage: recovery of data on infant deaths and stillbirths in Maranhão State, Brazil].[概率性记录链接:巴西马拉尼昂州婴儿死亡和死产数据的恢复]
Cad Saude Publica. 2011 Jul;27(7):1371-9. doi: 10.1590/s0102-311x2011000700012.

引用本文的文献

1
A cohort of patients in New York State with an alcohol use disorder and subsequent treatment information - A merging of two administrative data sources.纽约州患有酒精使用障碍及后续治疗信息的患者队列 - 两个行政数据源的合并。
J Biomed Inform. 2023 Aug;144:104443. doi: 10.1016/j.jbi.2023.104443. Epub 2023 Jul 16.
2
Linkage of multiple electronic health record datasets using a 'spine linkage' approach compared with all 'pairwise linkages'.采用“脊柱链接”方法对多个电子健康记录数据集进行链接,与所有“两两链接”相比。
Int J Epidemiol. 2023 Feb 8;52(1):214-226. doi: 10.1093/ije/dyac130.
3
The Centre for Data and Knowledge Integration for Health (CIDACS): Linking Health and Social Data in Brazil.

本文引用的文献

1
Symptomatic dengue infection during pregnancy and the risk of stillbirth in Brazil, 2006-12: a matched case-control study.2006 - 2012年巴西孕期有症状的登革热感染与死产风险:一项配对病例对照研究
Lancet Infect Dis. 2017 Sep;17(9):957-964. doi: 10.1016/S1473-3099(17)30366-3.
2
Linking Data for Mothers and Babies in De-Identified Electronic Health Data.在去识别化电子健康数据中关联母婴数据
PLoS One. 2016 Oct 20;11(10):e0164667. doi: 10.1371/journal.pone.0164667. eCollection 2016.
3
Accuracy of probabilistic and deterministic record linkage: the case of tuberculosis.
巴西健康数据与知识整合中心(CIDACS):连接巴西的健康与社会数据
Int J Popul Data Sci. 2019 Nov 20;4(2):1140. doi: 10.23889/ijpds.v4i2.1140.
4
Probabilistic linkage without personal information successfully linked national clinical datasets.无需个人信息的概率链接成功链接了国家临床数据集。
J Clin Epidemiol. 2021 Aug;136:136-145. doi: 10.1016/j.jclinepi.2021.04.015. Epub 2021 Apr 28.
5
Linkage of maternity hospital episode statistics birth records to birth registration and notification records for births in England 2005-2006: quality assurance of linkage.2005 - 2006年英格兰产妇医院病历统计出生记录与出生登记及出生通知记录的关联:关联的质量保证
BMJ Open. 2020 Oct 26;10(10):e037885. doi: 10.1136/bmjopen-2020-037885.
6
Examining the quality of record linkage process using nationwide Brazilian administrative databases to build a large birth cohort.利用全国性巴西行政数据库检查记录链接过程的质量,以建立一个大型出生队列。
BMC Med Inform Decis Mak. 2020 Jul 25;20(1):173. doi: 10.1186/s12911-020-01192-0.
7
Nutrient Estimation from 24-Hour Food Recalls Using Machine Learning and Database Mapping: A Case Study with Lactose.基于机器学习和数据库映射的 24 小时食物回顾法中的营养素估计:以乳糖为例的案例研究。
Nutrients. 2019 Dec 13;11(12):3045. doi: 10.3390/nu11123045.
8
Dengue during pregnancy and live birth outcomes: a cohort of linked data from Brazil.妊娠合并登革热与活产结局:来自巴西的队列关联数据。
BMJ Open. 2019 Jul 24;9(7):e023529. doi: 10.1136/bmjopen-2018-023529.
9
Validating linkage of multiple population-based administrative databases in Brazil.验证巴西多个基于人群的行政数据库的关联性。
PLoS One. 2019 Mar 28;14(3):e0214050. doi: 10.1371/journal.pone.0214050. eCollection 2019.
10
Challenges in administrative data linkage for research.研究中行政数据链接的挑战。
Big Data Soc. 2017 Dec 5;4(2):2053951717745678. doi: 10.1177/2053951717745678.
概率性和确定性记录链接的准确性:以结核病为例。
Rev Saude Publica. 2016 Aug 22;50:49. doi: 10.1590/S1518-8787.2016050006327.
4
Prenatal parental depression and preterm birth: a national cohort study.产前父母抑郁与早产:一项全国队列研究。
BJOG. 2016 Nov;123(12):1973-1982. doi: 10.1111/1471-0528.13891. Epub 2016 Jan 19.
5
Probabilistic record linkage.概率性记录链接
Int J Epidemiol. 2016 Jun;45(3):954-64. doi: 10.1093/ije/dyv322. Epub 2015 Dec 20.
6
Fetal Gender of the First Born and the Recurrent Risk of Spontaneous Preterm Birth.头胎胎儿性别与复发性自然早产风险
Am J Perinatol. 2015 Dec;32(14):1305-10. doi: 10.1055/s-0035-1563716. Epub 2015 Sep 9.
7
Cardiovascular mortality among a cohort of hypertensive and normotensives in Rio de Janeiro - Brazil - 1991-2009.1991年至2009年巴西里约热内卢一组高血压患者和血压正常者的心血管死亡率。
BMC Public Health. 2015 Jul 8;15:623. doi: 10.1186/s12889-015-1999-4.
8
A new method for assessing how sensitivity and specificity of linkage studies affects estimation.一种评估连锁研究的敏感性和特异性如何影响估计的新方法。
PLoS One. 2014 Jul 28;9(7):e103690. doi: 10.1371/journal.pone.0103690. eCollection 2014.
9
Evaluating bias due to data linkage error in electronic healthcare records.评估电子医疗记录中因数据链接错误导致的偏差。
BMC Med Res Methodol. 2014 Mar 5;14:36. doi: 10.1186/1471-2288-14-36.
10
Linkage, evaluation and analysis of national electronic healthcare data: application to providing enhanced blood-stream infection surveillance in paediatric intensive care.国家电子医疗数据的关联、评估与分析:在儿科重症监护中用于加强血流感染监测的应用
PLoS One. 2013 Dec 20;8(12):e85278. doi: 10.1371/journal.pone.0085278. eCollection 2013.