Xu Jiayue, Wang Yuning, He Qiao, Xie Shuangyi, Feng Sheng, Wang Xiaofei, Wang Wen, Sun Xin
Intensive Care Unit, Chinese Evidence-Based Medicine Center and Cochrane China Center, West China Hospital, Sichuan University, Chengdu, Sichuan, China.
NMPA Key Laboratory for Real World Data Research and Evaluation in Hainan, Chengdu, China.
BMC Med. 2025 Jul 1;23(1):393. doi: 10.1186/s12916-025-04199-4.
Sensitivity analysis is a crucial approach to assessing the "robustness" of research findings. Previous reviews have revealed significant concerns regarding the misuse and misinterpretation of sensitivity analyses in observational studies using routinely collected healthcare data (RCD). However, little is known regarding how sensitivity analyses are conducted in real-world observational studies, and to what extent their results and interpretations differ from primary analyses.
We searched PubMed for observational studies assessing drug treatment effects published between January 2018 and December 2020 in core clinical journals defined by the National Library of Medicine. Information on sensitivity analyses was extracted using standardized, pilot-tested collection forms. We characterized the sensitivity analyses conducted and compared the treatment effects estimated by primary and sensitivity analyses. The association between study characteristics and the agreement of primary and sensitivity analysis results were explored using multivariable logistic regression.
Of the 256 included studies, 152 (59.4%) conducted sensitivity analyses, with a median number of three (IQR: two to six), and 131 (51.2%) reported the results clearly. Of these 131 studies, 71 (54.2%) showed significant differences between the primary and sensitivity analyses, with an average difference in effect size of 24% (95% CI 12% to 35%). Across the 71 studies, 145 sensitivity analyses showed inconsistent results with the primary analyses, including 59 using alternative study definitions, 39 using alternative study designs, and 38 using alternative statistical models. Only nine of the 71 studies discussed the potential impact of these inconsistencies. The remaining 62 either suggested no impact or did not note any differences. Conducting three or more sensitivity analyses, not having a large effect size (0.5-2 for ratio measures, ≤ 3 for standardized difference measures), using blank controls, and publishing in a non-Q1 journal were more likely to exhibit inconsistent results.
Over 40% of observational studies using RCD conduct no sensitivity analyses. Among those that did, the results often differed between the sensitivity and primary analyses; however, these differences are rarely taken into account. The practice of conducting sensitivity analyses and addressing inconsistent results between sensitivity and primary analyses is in urgent need of improvement.
敏感性分析是评估研究结果“稳健性”的关键方法。以往的综述显示,人们对在使用常规收集的医疗保健数据(RCD)的观察性研究中滥用和错误解读敏感性分析存在重大担忧。然而,对于在实际观察性研究中如何进行敏感性分析,以及其结果和解释与初步分析在多大程度上存在差异,我们知之甚少。
我们在PubMed中检索了2018年1月至2020年12月期间在国立医学图书馆定义的核心临床期刊上发表的评估药物治疗效果的观察性研究。使用标准化的、经过预试验的收集表格提取有关敏感性分析的信息。我们对所进行的敏感性分析进行了特征描述,并比较了初步分析和敏感性分析估计的治疗效果。使用多变量逻辑回归探讨研究特征与初步分析和敏感性分析结果一致性之间的关联。
在纳入的256项研究中,152项(59.4%)进行了敏感性分析,中位数为三项(四分位间距:两项至六项),131项(51.2%)清晰报告了结果。在这131项研究中,71项(54.2%)显示初步分析和敏感性分析之间存在显著差异,效应大小的平均差异为24%(95%置信区间12%至35%)。在这71项研究中,145次敏感性分析显示结果与初步分析不一致,其中59次使用了替代研究定义,39次使用了替代研究设计,38次使用了替代统计模型。71项研究中只有9项讨论了这些不一致的潜在影响。其余62项要么表明没有影响,要么未指出任何差异。进行三次或更多次敏感性分析、效应大小不大(比值测量为0.5至2,标准化差异测量≤3)、使用空白对照以及在非Q1期刊上发表的研究更有可能呈现不一致的结果。
超过40%使用RCD的观察性研究未进行敏感性分析。在进行了敏感性分析的研究中,敏感性分析结果和初步分析结果往往存在差异;然而,这些差异很少被考虑在内。进行敏感性分析以及解决敏感性分析和初步分析之间不一致结果的做法亟待改进。