Center for Research in Occupational Health (CiSAL), University Pompeu Fabra, Barcelona, Spain.
Research Group on Statistics, Econometrics and Health (GRECS) University of Girona, Girona, Spain.
PLoS One. 2022 Feb 11;17(2):e0263810. doi: 10.1371/journal.pone.0263810. eCollection 2022.
Trajectory analyses are being increasingly used in efforts to increase understanding about the heterogeneity in the development of different longitudinal outcomes such as sickness absence, use of medication, income, or other time varying outcomes. However, several methodological and interpretational challenges are related to using trajectory analyses. This methodological study aimed to compare results using two different types of software to identify trajectories and to discuss methodological aspects related to them and the interpretation of the results.
Group-based trajectory models (GBTM) and latent class growth models (LCGM) were fitted, using SAS and Mplus, respectively. The data for the examples were derived from a representative sample of Spanish workers in Catalonia, covered by the social security system (n = 166,192). Repeatedly measured sickness absence spells per trimester (n = 96,453) were from the Catalan Institute of Medical Evaluations. The analyses were stratified by sex and two birth cohorts (1949-1969 and 1970-1990).
Neither of the software were superior to the other. Four groups were the optimal number of groups in both software, however, we detected differences in the starting values and shapes of the trajectories between the two software used, which allow for different conclusions when they are applied. We cover questions related to model fit, selecting the optimal number of trajectory groups, investigating covariates, how to interpret the results, and what are the key pitfalls and strengths of using these person-oriented methods.
Future studies could address further methodological aspects around these statistical techniques, to facilitate epidemiological and other research dealing with longitudinal study designs.
轨迹分析越来越多地被用于增加对不同纵向结果(如病假、药物使用、收入或其他随时间变化的结果)发展的异质性的理解。然而,使用轨迹分析存在一些方法学和解释方面的挑战。本方法学研究旨在比较使用两种不同类型的软件来识别轨迹的结果,并讨论与之相关的方法学方面以及对结果的解释。
使用 SAS 和 Mplus 分别拟合基于群组的轨迹模型(GBTM)和潜在类别增长模型(LCGM)。示例数据来自加泰罗尼亚代表性的西班牙工人样本,涵盖社会保障系统(n=166192)。每个季度重复测量的病假缺勤期(n=96453)来自加泰罗尼亚医学评估研究所。分析按性别和两个出生队列(1949-1969 年和 1970-1990 年)分层。
两种软件都没有优势。在两种软件中,四个组都是最佳的组数量,但我们检测到两种软件之间轨迹的起始值和形状存在差异,这允许在应用时得出不同的结论。我们涵盖了与模型拟合、选择最佳轨迹组数量、调查协变量、如何解释结果以及使用这些面向个体的方法的关键陷阱和优势相关的问题。
未来的研究可以围绕这些统计技术进一步解决方法学方面的问题,以促进处理纵向研究设计的流行病学和其他研究。