Mickenautsch Steffen, Yengopal Veerasamy
Dentistry, University of the Western Cape, Cape Town, ZAF.
Community Dentistry, University of the Witwatersrand, Johannesburg, ZAF.
Cureus. 2024 Dec 30;16(12):e76607. doi: 10.7759/cureus.76607. eCollection 2024 Dec.
The aim of the study is to test the null hypothesis that the specificities and sensitivities of the p-value-based significance test for differences between baseline variables and the I test for single trials do not significantly differ in detecting selection bias in randomised controlled trials (RCTs).
In MS Excel (Microsoft Corp., Redmond, WA, US), 100 trials were simulated, each consisting of two treatment groups (A and B), with 100 subjects in each group. Fifty trials were biased, while 50 remained non-biased. Both tests were applied to all trials, yielding true positive, false positive, false negative, and true negative per test. Subsequently, sensitivities and specificities with a 95% confidence interval (CI) were calculated and statistically compared using the z-test.
No false positive results were observed, and subsequently, the specificities of both tests were identical (100.00%; 95% CI: 92.89%-100.00%). The sensitivity for the significance test and I test was 24.00% (95% CI: 13.06%-38.17%) and 76.00% (95% CI: 61.83%-86.94%), respectively. A statistical comparison of the test sensitivities yielded a significant result in favour of the I test (z = 5.2; p < 0.0001). Consequently, the null hypothesis for the tests' sensitivities was rejected.
The I test appears to be a more effective method than the p-value-based significance test for detecting selection bias in RCTs.
本研究旨在检验原假设,即基于p值的显著性检验用于基线变量差异与单项试验的I检验在检测随机对照试验(RCT)中的选择偏倚时,其特异性和敏感性无显著差异。
在MS Excel(微软公司,美国华盛顿州雷德蒙德)中模拟100项试验,每项试验包含两个治疗组(A组和B组),每组100名受试者。50项试验存在偏倚,50项试验无偏倚。将两种检验方法应用于所有试验,得出每次检验的真阳性、假阳性、假阴性和真阴性结果。随后,计算敏感性和特异性及其95%置信区间(CI),并使用z检验进行统计学比较。
未观察到假阳性结果,因此,两种检验的特异性相同(100.00%;95%CI:92.89%-100.00%)。显著性检验和I检验的敏感性分别为24.00%(95%CI:13.06%-38.17%)和76.00%(95%CI:61.83%-86.94%)。对检验敏感性的统计学比较得出显著结果,支持I检验(z = 5.2;p < 0.0001)。因此,拒绝了关于检验敏感性的原假设。
在检测RCT中的选择偏倚方面,I检验似乎比基于p值的显著性检验更有效。