Fett M J
Int J Epidemiol. 1984 Sep;13(3):351-55. doi: 10.1093/ije/13.3.351.
The hypothesis that pairs of records with identical surnames, given names and birth dates represent the same person was tested by compiling a frequency distribution of the number of birth date digits in common when the names contained in two registers were matched. This distribution was compared with a computer simulation of the distribution which would be expected if the paired records represented different people. The divergence of the two distributions in the region of five and six birth date digits in common confirmed the hypothesis. Where surname, two given names and at least four date of birth digits matched, only 0.012% of the matching records represented different people. Where surname, two given initials and six date of birth digits matched, 0.1% of the matched records represented different people.
通过编制两个登记册中姓名匹配时出生日期数字相同的数量的频率分布,对姓氏、名字和出生日期相同的记录对代表同一个人的假设进行了检验。将该分布与计算机模拟的分布进行比较,该模拟分布是在配对记录代表不同的人的情况下所预期的。两个分布在共同拥有五到六个出生日期数字的区域中的差异证实了该假设。当姓氏、两个名字和至少四个出生日期数字匹配时,只有0.012%的匹配记录代表不同的人。当姓氏、两个名字首字母和六个出生日期数字匹配时,0.1%的匹配记录代表不同的人。