Department of Educational Studies, Ohio State University, Columbus, OH, USA.
Department of Individual, Family, and Community Education, University of New Mexico, Albuquerque, NM, USA.
Behav Res Methods. 2020 Apr;52(2):591-606. doi: 10.3758/s13428-019-01257-7.
Regression mixture models are one increasingly utilized approach for developing theories about and exploring the heterogeneity of effects. In this study we aimed to extend the current use of regression mixtures to a repeated regression mixture method when repeated measures, such as diary-type and experience-sampling method, data are available. We hypothesized that additional information borrowed from the repeated measures would improve the model performance, in terms of class enumeration and accuracy of the parameter estimates. We specifically compared three types of model specifications in regression mixtures: (a) traditional single-outcome model; (b) repeated measures models with three, five, and seven measures; and (c) a single-outcome model with the average of seven repeated measures. The results showed that the repeated measures regression mixture models substantially outperformed the traditional and average single-outcome models in class enumeration, with less bias in the parameter estimates. For sample size, whereas prior recommendations have suggested that regression mixtures require samples of well over 1,000 participants, even for classes at a large distance from each other (classes with regression weights of .20 vs. .70), the present repeated measures regression mixture models allow for samples as low as 200 participants with an increased number (i.e., seven) of repeated measures. We also demonstrate an application of the proposed repeated measures approach using data from the Sleep Research Project. Implications and limitations of the study are discussed.
回归混合模型是一种越来越被广泛应用于发展理论和探索效应异质性的方法。本研究旨在将回归混合模型的现有应用扩展到重复回归混合方法,当可获得重复测量(如日记式和经验采样法数据)时。我们假设从重复测量中额外借用的信息将提高模型的性能,表现在分类枚举和参数估计的准确性方面。我们特别比较了回归混合物中的三种模型规格:(a)传统的单因变量模型;(b)具有三、五和七项测量的重复测量模型;以及(c)具有七项重复测量平均值的单因变量模型。结果表明,重复测量回归混合模型在分类枚举方面明显优于传统的和平均单因变量模型,参数估计的偏差更小。关于样本量,虽然先前的建议表明回归混合物需要超过 1000 名参与者的样本量,即使对于彼此相距较大的类别(回归权重为.20 与.70 的类别)也是如此,但本研究中的重复测量回归混合模型允许使用样本量低至 200 名参与者,并增加了重复测量的数量(即七项)。我们还使用睡眠研究项目的数据演示了所提出的重复测量方法的应用。讨论了研究的意义和局限性。