Pepe Margaret Sullivan, Janes Holly
Department of Biostatistics, University of Washington and Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA.
Biostatistics. 2007 Apr;8(2):474-84. doi: 10.1093/biostatistics/kxl038. Epub 2006 Nov 3.
Latent class analysis is used to assess diagnostic test accuracy when a gold standard assessment of disease is not available but results of multiple imperfect tests are. We consider the simplest setting, where 3 tests are observed and conditional independence (CI) is assumed. Closed-form expressions for maximum likelihood parameter estimates are derived. They show explicitly how observed 2- and 3-way associations between test results are used to infer disease prevalence and test true- and false-positive rates. Although interesting and reasonable under CI, the estimators clearly have no basis when it fails. Intuition for bias induced by conditional dependence follows from the analytic expressions. Further intuition derives from an Expectation Maximization (EM) approach to calculating the estimates. We discuss implications of our results and related work for settings where more than 3 tests are available. We conclude that careful justification of assumptions about the dependence between tests in diseased and nondiseased subjects is necessary in order to ensure unbiased estimates of prevalence and test operating characteristics and to provide these estimates clinical interpretations. Such justification must be based in part on a clear clinical definition of disease and biological knowledge about mechanisms giving rise to test results.
当无法获得疾病的金标准评估,但有多个不完善检测的结果时,潜在类别分析用于评估诊断检测的准确性。我们考虑最简单的情况,即观察到3项检测,并假设条件独立性(CI)成立。推导了最大似然参数估计的闭式表达式。它们明确显示了如何利用检测结果之间观察到的二元和三元关联来推断疾病患病率以及检测的真阳性率和假阳性率。虽然在CI假设下这些估计很有趣且合理,但当CI不成立时,这些估计显然没有依据。由条件依赖性引起的偏差的直观理解来自于解析表达式。进一步的直观理解来自于用于计算估计值的期望最大化(EM)方法。我们讨论了我们的结果和相关工作对有超过3项检测可用的情况的影响。我们得出结论,为了确保患病率和检测操作特征的无偏估计,并为这些估计提供临床解释,必须仔细论证关于患病和未患病个体中检测之间依赖性的假设。这种论证必须部分基于明确的疾病临床定义以及关于产生检测结果的机制的生物学知识。