Uno Satoshi, Tango Toshiro
The Graduate University for Advanced Studies (SOKENDAI), Tachikawa-Shi, Tokyo, Japan.
Center of Medical Statistics, Minato-Ku, Tokyo, Japan.
Ann Clin Epidemiol. 2024 Jul 18;6(4):77-86. doi: 10.37737/ace.24012. eCollection 2024 Oct 1.
Large electronic databases have been widely used in recent years; however, they can be susceptible to bias due to incomplete information. To address this, validation studies have been conducted to assess the accuracy of disease diagnoses defined in databases. However, such studies may be constrained by potential misclassification in references and the interdependence between diagnoses from the same data source.
This study employs latent class modeling with Bayesian inference to estimate the sensitivity, specificity, and positive/negative predictive values of different diagnostic definitions. Four models are defined with/without assumptions of the gold standard and conditional independence, and then compared with breast cancer study data as a motivating example. Additionally, simulations that generated data under various true values are used to compare the performance of each model with bias, Pearson-type goodness-of-fit statistics, and widely applicable information criterion.
The model assuming conditional dependence and non-gold standard references exhibited the best predictive performance among the four models in the motivating example data analysis. The disease prevalence was slightly higher than that in previous findings, and the sensitivities were significantly lower than those of the other models. Additionally, bias evaluation showed that the Bayesian models with more assumptions and the frequentist model performed better under the true value conditions. The Bayesian model with fewer assumptions performed well in terms of goodness of fit and widely applicable information criteria.
The current assessments of outcome validation can introduce bias. The proposed approach can be adopted broadly as a valuable method for validation studies.
近年来,大型电子数据库得到了广泛应用;然而,由于信息不完整,它们可能容易产生偏差。为了解决这个问题,已经开展了验证研究来评估数据库中定义的疾病诊断的准确性。然而,此类研究可能会受到参考文献中潜在的错误分类以及来自同一数据源的诊断之间的相互依赖性的限制。
本研究采用贝叶斯推断的潜在类别模型来估计不同诊断定义的敏感性、特异性以及阳性/阴性预测值。定义了四个模型,分别有无金标准假设和条件独立性假设,然后以乳腺癌研究数据为例进行比较。此外,使用在各种真实值下生成数据的模拟来通过偏差、Pearson 型拟合优度统计量和广泛适用的信息准则比较每个模型的性能。
在激励示例数据分析中,假设条件依赖性和非金标准参考文献的模型在四个模型中表现出最佳的预测性能。疾病患病率略高于先前的研究结果,并且敏感性显著低于其他模型。此外,偏差评估表明,假设更多的贝叶斯模型和频率主义模型在真实值条件下表现更好。假设较少的贝叶斯模型在拟合优度和广泛适用的信息准则方面表现良好。
目前对结果验证的评估可能会引入偏差。所提出的方法可以广泛用作验证研究的一种有价值的方法。