Service de Biostatistique-Bioinformatique, Pôle Santé Publique, Hospices Civils de Lyon, Lyon, France.
Université de Lyon, Université Lyon 1, CNRS, Laboratoire de Biométrie et Biologie Evolutive UMR 5558, Villeurbanne, France.
Stat Med. 2018 Dec 10;37(28):4155-4166. doi: 10.1002/sim.7921. Epub 2018 Aug 2.
Classifying patients into groups according to longitudinal series of measurements (ie, trajectory classification) has become frequent in clinical research. Most classification models suppose an equal intra-group variance across groups. This assumption is sometimes inappropriate because measurements in diseased subjects are often more heterogeneous than in healthy ones. We developed a new classification model for trajectories that uses unequal intra-group variance across groups and evaluated its impact on classification using simulations and a clinical study. The classification and typical trajectories were estimated using the classification Expectation Maximization (EM) algorithm to maximize the classification likelihood, the log-likelihood being profiled during the Maximization (M) step of the algorithm. The simulations showed that assuming equal intra-group variance resulted in a high misclassification rate (up to 50%) when the real intra-group variances were different. This rate was greatly reduced by allowing intra-group variances to be different. Similar classification was obtained when the real intra-group variances were equal, except when the total sample size and the number of repeated measurements were small. In a randomized trial that compared the effect of low vs standard cyclosporine A dose on creatinine levels after cardiac transplantation, the classification model with unequal intra-group variance led to more meaningful groups than with equal intra-group variance and showed distinct benefits of low dose. In conclusion, we recommend the use of a classification model for trajectories that allows for unequal intra-group variance across groups except when the number of repeated measurements and total sample size are small.
根据纵向系列测量(即轨迹分类)将患者分为不同组别在临床研究中越来越常见。大多数分类模型假设组内方差在各组之间相等。这种假设有时并不合适,因为患病患者的测量结果通常比健康患者更具异质性。我们开发了一种新的轨迹分类模型,该模型允许组间存在不等的组内方差,并通过模拟和临床研究评估了其对分类的影响。使用分类期望最大化(EM)算法估计分类和典型轨迹,以最大化分类似然,对数似然在算法的最大化(M)步骤中进行分析。模拟结果表明,当实际组内方差不同时,假设组内方差相等会导致高错误分类率(高达 50%)。通过允许组内方差不同,可以大大降低这种错误分类率。当实际组内方差相等时,也可以获得类似的分类,但当总样本量和重复测量次数较小时除外。在一项比较心脏移植后低剂量与标准环孢素 A 剂量对肌酐水平影响的随机试验中,与组内方差相等的分类模型相比,允许组间存在不等的组内方差的分类模型导致了更有意义的分组,并且显示了低剂量的明显益处。总之,除非重复测量次数和总样本量较小,否则我们建议使用允许组间存在不等组内方差的轨迹分类模型。