Department of Biostatistics, Epidemiology, and Informatics, The University of Pennsylvania, Philadelphia, Pennsylvania.
Department of Data and Analytics, Klynveld Peat Marwick Goerdeler US, New York, New York.
Biometrics. 2020 Mar;76(1):98-108. doi: 10.1111/biom.13133. Epub 2019 Nov 6.
Identifiability of statistical models is a fundamental regularity condition that is required for valid statistical inference. Investigation of model identifiability is mathematically challenging for complex models such as latent class models. Jones et al. used Goodman's technique to investigate the identifiability of latent class models with applications to diagnostic tests in the absence of a gold standard test. The tool they used was based on examining the singularity of the Jacobian or the Fisher information matrix, in order to obtain insights into local identifiability (ie, there exists a neighborhood of a parameter such that no other parameter in the neighborhood leads to the same probability distribution as the parameter). In this paper, we investigate a stronger condition: global identifiability (ie, no two parameters in the parameter space give rise to the same probability distribution), by introducing a powerful mathematical tool from computational algebra: the Gröbner basis. With several existing well-known examples, we argue that the Gröbner basis method is easy to implement and powerful to study global identifiability of latent class models, and is an attractive alternative to the information matrix analysis by Rothenberg and the Jacobian analysis by Goodman and Jones et al.
统计模型的可识别性是进行有效统计推断的基本正则条件。对于潜在类别模型等复杂模型,调查模型的可识别性在数学上具有挑战性。Jones 等人使用 Goodman 的技术来研究潜在类别模型的可识别性,并将其应用于缺乏金标准测试的诊断测试。他们使用的工具基于检查雅可比行列式或 Fisher 信息矩阵的奇异值,以深入了解局部可识别性(即,存在一个参数的邻域,使得邻域中的任何其他参数都不会导致与参数相同的概率分布)。在本文中,我们通过引入计算代数学中的强大数学工具:Gröbner 基,研究了一个更强的条件:全局可识别性(即,参数空间中的没有两个参数会产生相同的概率分布)。通过几个现有的著名示例,我们认为 Gröbner 基方法易于实现,并且非常强大,可以研究潜在类别模型的全局可识别性,是替代 Rothenberg 的信息矩阵分析以及 Goodman 和 Jones 等人的雅可比分析的有吸引力的选择。