Suppr超能文献

用于流行病学随访数据保密的自动记录哈希编码与链接

Automatic record hash coding and linkage for epidemiological follow-up data confidentiality.

作者信息

Quantin C, Bouzelat H, Allaert F A, Benhamiche A M, Faivre J, Dusserre L

机构信息

Department of Medical Informatics, Teaching Hospital of Dijon, France.

出版信息

Methods Inf Med. 1998 Sep;37(3):271-7.

PMID:9787628
Abstract

A protocol is proposed to allow linkage of anonymous medical information within the framework of epidemiological follow-up studies. The protocol is composed of two steps; the first concerns the irreversible transformation of identification data, using a one-way hash function which is used after spelling processing. To avoid dictionary attacks, two large random files of keys, called pads, are introduced. The second step consists in the linkage of files rendered anonymous. The weight given to each linkage field is estimated by a mixture model, the likelihood of which being maximized with the Expectation and Maximization (EM) algorithm. The performance of this method has been assessed by comparing record linkage, based on exclusive use of the automatic procedure, with a manual linkage, obtained by the Burgundy Registry of Digestive Cancers. The result of the linkage of a file of 2,847 cancers with a file of 388,614 hospitalization stays in the Dijon university hospital showed a sensitivity of 97% and a specificity of 93%.

摘要

提出了一种方案,以允许在流行病学随访研究框架内对匿名医疗信息进行关联。该方案由两个步骤组成;第一步涉及使用单向哈希函数对识别数据进行不可逆转换,该函数在拼写处理后使用。为避免字典攻击,引入了两个称为填充的大型随机密钥文件。第二步是对已匿名化的文件进行关联。每个关联字段的权重由混合模型估计,该模型的似然性通过期望最大化(EM)算法最大化。通过将仅使用自动程序的记录关联与勃艮第消化癌登记处获得的手动关联进行比较,评估了该方法的性能。在第戎大学医院,将2847例癌症病例文件与388614例住院记录文件进行关联的结果显示,灵敏度为97%,特异性为93%。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验