Evidence Synthesis, Modeling & Communication, Evidera, Bethesda, Maryland, USA.
Res Synth Methods. 2024 Nov;15(6):1094-1110. doi: 10.1002/jrsm.1759. Epub 2024 Sep 25.
In health technology assessment, matching-adjusted indirect comparison (MAIC) is the most common method for pairwise comparisons that control for imbalances in baseline characteristics across trials. One of the primary challenges in MAIC is the need to properly account for the additional uncertainty introduced by the matching process. Limited evidence and guidance are available on variance estimation in MAICs. Therefore, we conducted a comprehensive Monte Carlo simulation study to evaluate the performance of different statistical methods across 108 scenarios. Four general approaches for variance estimation were compared in both anchored and unanchored MAICs of binary and time-to-event outcomes: (1) conventional estimators (CE) using raw weights; (2) CE using weights rescaled to the effective sample size (ESS); (3) robust sandwich estimators; and (4) bootstrapping. Several variants of sandwich estimators and bootstrap methods were tested. Performance was quantified on the basis of empirical coverage probabilities for 95% confidence intervals and variability ratios. Variability was underestimated by CE + raw weights when population overlap was poor or moderate. Despite several theoretical limitations, CE + ESS weights accurately estimated uncertainty across most scenarios. Original implementations of sandwich estimators had a downward bias in MAICs with a small ESS, and finite sample adjustments led to marked improvements. Bootstrapping was unstable if population overlap was poor and the sample size was limited. All methods produced valid coverage probabilities and standard errors in cases of strong population overlap. Our findings indicate that the sample size, population overlap, and outcome type are important considerations for variance estimation in MAICs.
在健康技术评估中,匹配调整间接比较(MAIC)是最常用于控制试验间基线特征不平衡的成对比较的方法。MAIC 中的主要挑战之一是需要正确考虑匹配过程引入的额外不确定性。在 MAIC 中,方差估计的证据和指南有限。因此,我们进行了一项全面的蒙特卡罗模拟研究,以评估 108 种情况下不同统计方法的性能。在针对二分类和生存数据的锚定和非锚定 MAIC 中,比较了 4 种用于方差估计的一般方法:(1)使用原始权重的常规估计量(CE);(2)使用有效样本量(ESS)重新缩放权重的 CE;(3)稳健的 sandwich 估计量;(4)自举法。测试了几种 sandwich 估计量和自举方法的变体。根据 95%置信区间的经验覆盖率和变异性比来衡量性能。当群体重叠较差或中等时,CE+原始权重会低估方差。尽管存在一些理论限制,但 CE+ESS 权重在大多数情况下准确估计了不确定性。原始的 sandwich 估计量在 ESS 较小的 MAIC 中存在向下偏差,有限样本调整会显著改善。如果群体重叠较差且样本量有限,则自举法不稳定。在群体重叠较强的情况下,所有方法都产生了有效的覆盖率概率和标准误差。我们的研究结果表明,样本量、群体重叠和结果类型是 MAIC 中方差估计的重要考虑因素。