Oracle Health Sciences, Burlington, Massachusetts, USA,
Drug Saf. 2013 Oct;36 Suppl 1:S123-32. doi: 10.1007/s40264-013-0106-y.
To evaluate the performance of a disproportionality design, commonly used for analysis of spontaneous reports data such as the FDA Adverse Event Reporting System database, as a potential analytical method for an adverse drug reaction risk identification system using healthcare data.
We tested the disproportionality design in 5 real observational healthcare databases and 6 simulated datasets, retrospectively studying the predictive accuracy of the method when applied to a collection of 165 positive controls and 234 negative controls across 4 outcomes: acute liver injury, acute myocardial infarction, acute kidney injury, and upper gastrointestinal bleeding.
We estimate how well the method can be expected to identify true effects and discriminate from false findings and explore the statistical properties of the estimates the design generates. The primary measure was the area under the curve (AUC) of the receiver operating characteristic (ROC) curve.
For each combination of 4 outcomes and 5 databases, 48 versions of disproportionality analysis (DPA) were carried out and the AUC computed. The majority of the AUC values were in the range of 0.35 < AUC < 0.6, which is considered to be poor predictive accuracy, since the value AUC = 0.5 would be expected from mere random assignment. Several DPA versions achieved AUC of about 0.7 for the outcome Acute Renal Failure within the GE database. The overall highest DPA version across all 20 outcome-database combinations was the Bayesian Information Component method with no stratification by age and gender, using first occurrence of outcome and with assumed time-at-risk equal to duration of exposure + 30 d, but none were uniformly optimal. The relative risk estimates for the negative control drug-event combinations were very often biased either upward or downward by a factor of 2 or more. Coverage probabilities of confidence intervals from all methods were far below nominal.
The disproportionality methods that we evaluated did not discriminate true positives from true negatives using healthcare data as they seem to do using spontaneous report data.
评估不匀称性设计的性能,该设计常用于分析自发报告数据,如 FDA 不良事件报告系统数据库,作为使用医疗保健数据的药物不良反应风险识别系统的潜在分析方法。
我们在 5 个真实的观察性医疗保健数据库和 6 个模拟数据集上测试了不匀称性设计,回顾性地研究了该方法在应用于包含 165 个阳性对照和 234 个阴性对照的 4 个结果(急性肝损伤、急性心肌梗死、急性肾损伤和上消化道出血)时的预测准确性。
我们估计该方法能够识别真实效果的程度,并与虚假发现进行区分,并探索设计生成的估计值的统计特性。主要衡量标准是接收者操作特性(ROC)曲线下的面积(AUC)。
对于 4 个结果和 5 个数据库的每种组合,进行了 48 次不匀称性分析(DPA)并计算了 AUC。大多数 AUC 值在 0.35 < AUC < 0.6 的范围内,这被认为是预测准确性差,因为仅随机分配的 AUC = 0.5。在 GE 数据库中,几种 DPA 版本对急性肾衰竭的结果达到了 AUC 约为 0.7。在所有 20 个结果-数据库组合中,整体最高的 DPA 版本是贝叶斯信息分量方法,没有按年龄和性别分层,使用结果的首次发生,并假设风险时间等于暴露持续时间+ 30 天,但没有一种方法是统一最优的。阴性对照药物-事件组合的相对风险估计值经常向上或向下偏倚 2 倍或更多。所有方法的置信区间覆盖概率远低于名义值。
我们评估的不匀称性方法在使用医疗保健数据时,并未像在使用自发报告数据时那样,区分真阳性和真阴性。