Department of Reproduction, Development, and Regeneration, KU Leuven - University of Leuven, Leuven, Belgium.
Stat Med. 2012 Oct 15;31(23):2610-26. doi: 10.1002/sim.5321. Epub 2012 Jun 26.
Diagnostic problems in medicine are sometimes polytomous, meaning that the outcome has more than two distinct categories. For example, ovarian tumors can be benign, borderline, primary invasive, or metastatic. Extending the main measure of binary discrimination, the c-statistic or area under the ROC curve, to nominal polytomous settings is not straightforward. This paper reviews existing measures and presents the polytomous discrimination index (PDI) as an alternative. The PDI assesses all sets of k cases consisting of one case from each outcome category. For each category i (i = 1, … ,k), it is assessed whether the risk of category i is highest for the case from category i. A score of 1∕k is given per category for which this holds, yielding a set score between 0 and 1 to indicate the level of discrimination. The PDI is the average set score and is interpreted as the probability to correctly identify a case from a randomly selected category within a set of k cases. This probability can be split up by outcome category, yielding k category-specific values that result in the PDI when averaged. We demonstrate the measures on two diagnostic problems (residual mass histology after chemotherapy for testicular cancer; diagnosis of ovarian tumors). We compare the behavior of the measures on theoretical data, showing that PDI is more strongly influenced by simultaneous discrimination between all categories than by partial discrimination between pairs of categories. In conclusion, the PDI is attractive because it better matches the requirements of a measure to summarize polytomous discrimination.
医学中的诊断问题有时是多分类的,这意味着结果有超过两个明显的类别。例如,卵巢肿瘤可以是良性的、交界性的、原发性侵袭性的或转移性的。将二分类判别分析的主要度量标准,即 c 统计量或 ROC 曲线下面积,扩展到名义多分类设置并不简单。本文回顾了现有的度量标准,并提出了多分类判别指数(PDI)作为替代方法。PDI 评估由每个类别中的一个病例组成的 k 个病例的所有集合。对于每个类别 i(i=1,…,k),评估 i 类的风险是否对 i 类的病例最高。对于这种情况,每个类别都会获得 1∕k 的分数,从而得出 0 到 1 的集合分数,以表示判别水平。PDI 是平均集合分数,解释为在 k 个病例的集合中正确识别随机选择类别的病例的概率。该概率可以按结局类别进行划分,从而得出平均时产生 PDI 的 k 个类别特异性值。我们在两个诊断问题(睾丸癌化疗后残留肿块组织学;卵巢肿瘤的诊断)上演示了这些度量标准。我们比较了这些度量标准在理论数据上的行为,表明 PDI 受到所有类别的同时判别比类别对之间的部分判别更强烈的影响。总之,PDI 很有吸引力,因为它更好地满足了总结多分类判别所需的要求。