Galanti M Rosaria, Siliquini Roberta, Cuomo Luca, Melero Juan Carlos, Panella Massimiliano, Faggiano Fabrizio
Stockholm Center for Public Health/Tobacco Prevention, Box 17533 118 91 Stockholm, Sweden.
Prev Med. 2007 Feb;44(2):174-7. doi: 10.1016/j.ypmed.2006.07.019. Epub 2006 Sep 18.
To study the feasibility of an anonymous coding procedure linking longitudinal information in a multi-center trial of substance abuse prevention among adolescents.
A school-based survey with re-test procedure was conducted among 485 students (mean age 13.8 years) from three countries at four study centers in order to study accuracy and repeatability of a self-generated anonymous code.
Errors affected 18% of codes and 3% of all digits required for the code generation, with highest figures for two of the seven generation items. Sixty-one percent of the codes generated at the test were repeated identically at the re-test. Seventy-six percent of the codes could be linked excluding the 2 digits with the highest error rate in code generation, while 92% were linked using the best combination of the remaining seven or six digits. There was substantial variation between the centers in the results.
Self-generation of anonymous codes is a feasible, but not a very efficient procedure to link longitudinal data among adolescents. Easy derivation and iterative matching procedures are crucial for achieving high efficiency of this type of anonymous linkage.
研究在一项青少年药物滥用预防多中心试验中,采用匿名编码程序关联纵向信息的可行性。
在来自三个国家的四个研究中心,对485名学生(平均年龄13.8岁)开展了一项采用重新测试程序的校内调查,以研究自行生成的匿名编码的准确性和可重复性。
错误影响了18%的编码以及编码生成所需所有数字的3%,七个生成项中的两项错误率最高。测试时生成的编码有61%在重新测试时完全重复。排除编码生成中错误率最高的两位数字后,76%的编码能够关联起来,而使用其余七个或六个数字的最佳组合时,92%的编码能够关联起来。各中心的结果存在很大差异。
自行生成匿名编码是关联青少年纵向数据的一种可行但效率不高的程序。简便的推导和迭代匹配程序对于实现此类匿名关联的高效率至关重要。