Clinical Epidemiology, Leibniz Institute for Prevention Research and Epidemiology - BIPS, Bremen, Germany.
Health Services Research, IGES Institut GmbH, Berlin, Germany.
BMJ Open. 2019 Jul 2;9(7):e028223. doi: 10.1136/bmjopen-2018-028223.
Claims data need to be validated to assess their use for epidemiological research. This study aimed to examine the validity of mortality information in the German Pharmacoepidemiological Research Database (GePaRD).
Validation study, secondary data, medical claims.
Claims data of two German nationwide acting statutory health insurance providers (SHIs) contributing data for GePaRD; record linkage with epidemiological cancer registry providing individual official mortality information.
All women insured with the two SHIs whose insurance coverage ended in the period 2006-2013 and who were residents of North Rhine Westphalia.
Descriptive statistics were used to analyse the performance of the linkage procedure. Further, we calculated measures of agreement between the official and the GePaRD-based vital status and assessed differences between the official and the GePaRD-based date of death.
Of the 256 111 women of the linkage sample, 25 528 were classified as 'deceased' in GePaRD and the others as 'alive'. Compared with the official data, the GePaRD-based vital status showed a sensitivity of 95.9% and a specificity of 99.4%. The negative predictive value was 99.6% and the positive predictive value 94.3%. The date of death agreed in 96.3% between both data sources.
The vital status recorded in GePaRD was of high accuracy and discrepancies between dates of death in GePaRD and official dates were rare. This underlines the potential of the database for conducting large cohort studies with mortality as the endpoint.
为了评估其在流行病学研究中的应用,需要对索赔数据进行验证。本研究旨在检验德国药物流行病学研究数据库(GePaRD)中死亡率信息的有效性。
验证研究,二级数据,医疗索赔。
为 GePaRD 提供数据的两家德国全国性法定健康保险公司(SHI)的索赔数据;与提供个人官方死亡率信息的流行病学癌症登记处进行记录链接。
两家 SHI 的所有保险范围在 2006-2013 年期间结束的女性被保险人,且居住在北莱茵-威斯特法伦州。
使用描述性统计来分析链接程序的性能。此外,我们计算了官方和 GePaRD 基于的生存状态之间的一致性度量,并评估了官方和 GePaRD 基于的死亡日期之间的差异。
在链接样本的 256111 名女性中,25528 人被归类为 GePaRD 中的“死亡”,其余人被归类为“存活”。与官方数据相比,基于 GePaRD 的生存状态具有 95.9%的敏感性和 99.4%的特异性。阴性预测值为 99.6%,阳性预测值为 94.3%。两个数据源之间的死亡日期一致率为 96.3%。
GePaRD 中记录的生存状态具有很高的准确性,并且 GePaRD 和官方死亡日期之间的差异很少。这强调了该数据库在以死亡率为终点的大型队列研究中的潜力。