Suppr超能文献

两种记录链接算法在从巴西公共数据库中检索艾滋病毒/艾滋病患者生命状态信息方面的准确性和计算可行性比较。

A comparison of accuracy and computational feasibility of two record linkage algorithms in retrieving vital status information from HIV/AIDS patients registered in Brazilian public databases.

机构信息

PROCC/FIOCRUZ, Avenida Brasil, 4365, Rio de Janeiro, Brazil.

Rio de Janeiro State Health Secretariat, Rua México, 128, Rio de Janeiro, Brazil.

出版信息

Int J Med Inform. 2018 Jun;114:45-51. doi: 10.1016/j.ijmedinf.2018.03.005. Epub 2018 Mar 20.

Abstract

BACKGROUND AND OBJECTIVE

While cross-referencing information from people living with HIV/AIDS (PLWHA) to the official mortality database is a critical step in monitoring the HIV/AIDS epidemic in Brazil, the accuracy of the linkage routine may compromise the validity of the final database, yielding to biased epidemiological estimates. We compared the accuracy and the total runtime of two linkage algorithms applied to retrieve vital status information from PLWHA in Brazilian public databases.

METHODS

Nominally identified records from PLWHA were obtained from three distinct government databases. Linkage routines included an algorithm in Python language (PLA) and Reclink software (RlS), a probabilistic software largely utilized in Brazil. Records from PLWHA known to be alive were added to those from patients reported as deceased. Data were then searched into the mortality system. Scenarios where 5% and 50% of patients actually dead were simulated, considering both complete cases and 20% missing maternal names.

RESULTS

When complete information was available both algorithms had comparable accuracies. In the scenario of 20% missing maternal names, PLA and RlS had sensitivities of 94.5% and 94.6% (p > 0.5), respectively; after manual reviewing, PLA sensitivity increased to 98.4% (96.6-100.0) exceeding that for RlS (p < 0.01). PLA had higher positive predictive value in 5% death proportion. Manual reviewing was intrinsically required by RlS in up to 14% register for people actually dead, whereas the corresponding proportion ranged from 1.5% to 2% for PLA. The lack of manual inspection did not alter PLA sensitivity when complete information was available. When incomplete data was available PLA sensitivity increased from 94.5% to 98.4%, thus exceeding that presented by RlS (94.6%, p < 0.05). RlS spanned considerably less processing time compared to PLA.

CONCLUSION

Both linkage algorithms presented interchangeable accuracies in retrieving vital status data from PLWHA. RlS had a considerably lesser runtime but intrinsically required manually reviewing a fastidious proportion of the matched registries. On the other hand, PLA spent quite more runtime but spared manual reviewing at no expense of accuracy.

摘要

背景和目的

将艾滋病毒/艾滋病感染者(PLWHA)的信息与官方死亡率数据库进行交叉引用,是监测巴西艾滋病毒/艾滋病流行情况的关键步骤,但链接程序的准确性可能会影响最终数据库的有效性,导致流行病学估计出现偏差。我们比较了两种链接算法在从巴西公共数据库中检索 PLWHA 生命状态信息方面的准确性和总运行时间。

方法

从三个不同的政府数据库中获得 PLWHA 的名义识别记录。链接程序包括一种 Python 语言算法(PLA)和 Reclink 软件(RlS),后者是巴西广泛使用的概率软件。将已知存活的 PLWHA 记录添加到报告死亡的患者记录中。然后将数据搜索到死亡率系统中。模拟了实际死亡患者比例为 5%和 50%的情况,同时考虑了完整病例和 20%缺失母亲姓名的情况。

结果

当完整信息可用时,两种算法的准确性相当。在 20%缺失母亲姓名的情况下,PLA 和 RlS 的灵敏度分别为 94.5%和 94.6%(p>0.5);经过手动审查后,PLA 的灵敏度提高到 98.4%(96.6-100.0),超过了 RlS(p<0.01)。在 5%的死亡比例下,PLA 的阳性预测值更高。对于实际死亡的患者,RlS 需要手动审查高达 14%的登记册,而 PLA 的相应比例为 1.5%至 2%。当完整信息可用时,缺乏手动检查不会改变 PLA 的灵敏度。当数据不完整时,PLA 的灵敏度从 94.5%提高到 98.4%,超过了 RlS(94.6%,p<0.05)。与 PLA 相比,RlS 的处理时间明显缩短。

结论

两种链接算法在从 PLWHA 检索生命状态数据方面都具有可互换的准确性。RlS 的运行时间明显较短,但需要手动审查匹配登记册中的一个相当挑剔的比例。另一方面,PLA 的运行时间相当长,但无需手动审查,不会影响准确性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验