Laboratory of Clinical Pharmacy, Tohoku University Graduate School of Pharmaceutical Sciences, Sendai, Japan.
Division of Preventive Medicine and Epidemiology, Tohoku University Tohoku Medical Megabank Organization, Sendai, Japan.
Curr Med Res Opin. 2022 Sep;38(9):1651-1654. doi: 10.1080/03007995.2022.2101817. Epub 2022 Jul 26.
When using administrative data, validation is essential since these data are not collected for research purposes and misclassification can occur. Thus, this study aimed to develop algorithms identifying pregnancy and to evaluate the validity of administrative claims data in Japan.
All females who visited the Tohoku University Hospital Department of Obstetrics in 2018 were included. The diagnosis, medical procedure, medication, and medical service addition fee data were utilized to identify pregnancy, with the electronic medical records set as the gold standard. Combination algorithms were developed using predefined pregnancy-related claims data with a positive predictive value (PPV) ≥80%. Sensitivity (SE), specificity (SP), PPV, and negative predictive value (NPV) with their corresponding 95% confidence intervals (CIs) were calculated for these combination algorithms.
This study included 1757 females with a mean age of 32.8 (standard deviation: 5.9) years. In general, the individual claims data were able to identify pregnancy with a PPV ≥80%; however, the number of pregnancies identified using a single claims data was limited. Based on the combination algorithm with all of the categories, including diagnosis, medical procedure, medication, and medical service addition, the calculated SE, SP, PPV, and NPV were 73.4% (95% CI: 71.2%-75.4%), 96.9% (95% CI: 89.3%-99.6%), 99.8%,(95% CI: 99.4%-100.0%), and 12.3% (95% CI: 9.6%-15.4%), respectively.
The combination algorithm to identify pregnancy demonstrated a high PPV and moderate SE. The algorithm validated in this study is expected to accelerate future studies that aim to identify pregnancies and evaluate pregnancy outcome.
在使用行政数据时,验证是必不可少的,因为这些数据不是为研究目的而收集的,并且可能会发生分类错误。因此,本研究旨在开发识别妊娠的算法,并评估日本行政索赔数据的有效性。
纳入 2018 年在东北大学医院妇产科就诊的所有女性。使用诊断、医疗程序、药物和医疗服务附加费数据来识别妊娠,以电子病历作为金标准。使用具有阳性预测值(PPV)≥80%的预定义妊娠相关索赔数据开发组合算法。计算这些组合算法的敏感性(SE)、特异性(SP)、PPV 和阴性预测值(NPV)及其相应的 95%置信区间(CI)。
本研究纳入了 1757 名女性,平均年龄为 32.8(标准差:5.9)岁。一般来说,单个索赔数据能够以 PPV≥80%识别妊娠;但是,使用单一索赔数据识别的妊娠数量有限。基于包括诊断、医疗程序、药物和医疗服务附加在内的所有类别组合算法,计算出的 SE、SP、PPV 和 NPV 分别为 73.4%(95%CI:71.2%-75.4%)、96.9%(95%CI:89.3%-99.6%)、99.8%(95%CI:99.4%-100.0%)和 12.3%(95%CI:9.6%-15.4%)。
识别妊娠的组合算法具有较高的 PPV 和中等 SE。本研究验证的算法有望加速未来旨在识别妊娠和评估妊娠结局的研究。