Institute for the Study of Behavioral Health and Addiction, Boise State University, 1910 University Drive, Boise, ID, 83725-1940, USA.
School of Social Work, Boise State University, Boise, ID, USA.
Implement Sci. 2022 Oct 1;17(1):66. doi: 10.1186/s13012-022-01235-2.
Statistical tests of mediation are important for advancing implementation science; however, little research has examined the sample sizes needed to detect mediation in 3-level designs (e.g., organization, provider, patient) that are common in implementation research. Using a generalizable Monte Carlo simulation method, this paper examines the sample sizes required to detect mediation in 3-level designs under a range of conditions plausible for implementation studies.
Statistical power was estimated for 17,496 3-level mediation designs in which the independent variable (X) resided at the highest cluster level (e.g., organization), the mediator (M) resided at the intermediate nested level (e.g., provider), and the outcome (Y) resided at the lowest nested level (e.g., patient). Designs varied by sample size per level, intraclass correlation coefficients of M and Y, effect sizes of the two paths constituting the indirect (mediation) effect (i.e., X→M and M→Y), and size of the direct effect. Power estimates were generated for all designs using two statistical models-conventional linear multilevel modeling of manifest variables (MVM) and multilevel structural equation modeling (MSEM)-for both 1- and 2-sided hypothesis tests.
For 2-sided tests, statistical power to detect mediation was sufficient (≥0.8) in only 463 designs (2.6%) estimated using MVM and 228 designs (1.3%) estimated using MSEM; the minimum number of highest-level units needed to achieve adequate power was 40; the minimum total sample size was 900 observations. For 1-sided tests, 808 designs (4.6%) estimated using MVM and 369 designs (2.1%) estimated using MSEM had adequate power; the minimum number of highest-level units was 20; the minimum total sample was 600. At least one large effect size for either the X→M or M→Y path was necessary to achieve adequate power across all conditions.
While our analysis has important limitations, results suggest many of the 3-level mediation designs that can realistically be conducted in implementation research lack statistical power to detect mediation of highest-level independent variables unless effect sizes are large and 40 or more highest-level units are enrolled. We suggest strategies to increase statistical power for multilevel mediation designs and innovations to improve the feasibility of mediation tests in implementation research.
统计中介效应检验对推进实施科学很重要;然而,很少有研究探讨在实施研究中常见的 3 水平设计(例如,组织、提供者、患者)中检测中介效应所需的样本量。本文使用可推广的蒙特卡罗模拟方法,在对实施研究具有实际意义的一系列条件下,研究了 3 水平设计中检测中介效应所需的样本量。
对 17496 个 3 水平中介设计的统计功效进行了估计,其中自变量(X)位于最高聚类水平(例如,组织),中介变量(M)位于中间嵌套水平(例如,提供者),结果变量(Y)位于最低嵌套水平(例如,患者)。设计根据每个水平的样本量、M 和 Y 的类内相关系数、构成间接(中介)效应的两个路径(即 X→M 和 M→Y)的效应大小以及直接效应的大小而变化。使用两种统计模型-显变量的常规线性多层建模(MVM)和多层结构方程建模(MSEM)-为双边假设检验和单边假设检验生成了所有设计的功效估计值。
对于双边检验,使用 MVM 估计的 463 个设计(2.6%)和使用 MSEM 估计的 228 个设计(1.3%)中,检测中介效应的统计功效足够(≥0.8);达到足够功效所需的最高水平单位的最小数量为 40;最小总样本量为 900 个观测值。对于单边检验,使用 MVM 估计的 808 个设计(4.6%)和使用 MSEM 估计的 369 个设计(2.1%)具有足够的功效;所需的最高水平单位最小数量为 20;最小总样本量为 600。在所有条件下,对于 X→M 或 M→Y 路径中的至少一个大效应大小,都需要足够的功效来实现。
尽管我们的分析存在重要限制,但结果表明,在实施研究中可以实际进行的许多 3 水平中介设计缺乏检测最高水平自变量中介效应的统计功效,除非效应大小较大且纳入 40 个或更多最高水平单位。我们建议了增加多层次中介设计统计功效的策略,并提出了在实施研究中改进中介检验可行性的创新方法。