超越 2×2 contingency 表：涉及 m 个诊断类别和 n 个诊断测试类别的各种场景中熵和互信息的入门介绍。

Beyond the 2×2 -contingency table: a primer on entropies and mutual information in various scenarios involving m diagnostic categories and n categories of diagnostic tests.

机构信息

Institute of Physiological Chemistry, Center of Physiological Medicine, Medical University of Graz, A-8010 Graz, Austria.

出版信息

Clin Chim Acta. 2013 Oct 21;425:97-103. doi: 10.1016/j.cca.2013.07.011. Epub 2013 Jul 22.

DOI:10.1016/j.cca.2013.07.011

PMID:23886553

Abstract

BACKGROUND

Usual evaluation tools for diagnostic tests such as, sensitivity/specificity and ROC analyses, are designed for the discrimination between two diagnostic categories, using dichotomous test results. Information theoretical quantities such as mutual information allow in depth-analysis of more complex discrimination problems, including continuous test results, but are rarely used in clinical chemistry. This paper provides a primer on useful information theoretical concepts with a strong focus on typical diagnostic scenarios.

METHODS AND RESULTS

Information theoretical concepts are shortly explained. Mathematica CDF documents are provided which compute entropies and mutual information as function of pretest probabilities and the distribution of test results among the categories, and allow interactive exploration of the behavior of these quantities in comparison with more conventional diagnostic measures. Using data from a previously published study, the application of information theory to practical diagnostic problems involving up to 4×4 -contingency tables is demonstrated.

CONCLUSIONS

Information theoretical concepts are particularly useful for diagnostic problems requiring more than the usual binary classification. Quantitative test results can be properly analyzed, and in contrast to popular concepts such as ROC analysis, the effects of variations of pre-test probabilities of the diagnostic categories can be explicitly taken into account.

摘要