Department of Medicine and Therapeutics, Gardiner Institute, Western Infirmary, Glasgow, UK.
Stroke. 2012 May;43(5):1401-3. doi: 10.1161/STROKEAHA.111.644484. Epub 2012 Feb 2.
Clinical trials in stroke typically measure outcome after 90 days. Earlier outcome assessment would reduce costs and may increase power. We aimed to compare the sensitivity of 4 end points (modified Rankin Scale [mRS] at 30 and 90 days, and National Institutes of Health Stroke Scale (NIHSS) at 7 and 90 days, analyzed as ordinal measures) to detect the established treatment effect of recombinant tissue-type plasminogen activator (rtPA).
Within the Virtual International Stroke Trials Archive, we compared rtPA-treated patients with untreated control subjects using a multiple resampling approach. From our total sample we drew 10 000 random samples of unique patients, constraining the sample sizes in treated and untreated groups to be equal. In each of these samples we tested for the treatment effect of rtPA by each of the 4 studied end points. The percentage of samples yielding significant results approximates the power of each end point at a given sample size. This process was repeated across a range of sample sizes, to determine the relationship between sample size and power for each of the 4 end points.
For our 4 end points of mRS at 30 and 90 days, and NIHSS at 7 and 90 days the smallest sample sizes required to generate statistical power >80% were 620, 480, 370, and 420, respectively, making 7-day NIHSS the most sensitive end point. These results were supported by dichotomized analyses.
Seven-day NIHSS score appears a sensitive end point that should be validated in randomized trial datasets for use in exploratory stroke trials.
中风临床试验通常在 90 天后评估结果。更早的结果评估可以降低成本,并且可能增加功效。我们旨在比较 4 个终点(30 天和 90 天的改良 Rankin 量表[mRS],以及 7 天和 90 天的国立卫生研究院卒中量表[NIHSS],作为有序测量进行分析)对检测重组组织型纤溶酶原激活剂(rtPA)既定治疗效果的敏感性。
在虚拟国际中风试验档案中,我们使用多次重采样方法比较了 rtPA 治疗的患者与未治疗的对照组。从我们的总样本中,我们抽取了 10000 个独特患者的随机样本,限制治疗组和未治疗组的样本量相等。在这些样本中的每一个中,我们通过 4 个研究终点来测试 rtPA 的治疗效果。产生显著结果的样本百分比近似于每个终点在给定样本量下的功效。此过程在一系列样本量中重复进行,以确定每个 4 个终点的样本量与功效之间的关系。
对于我们的 mRS 在 30 天和 90 天的 4 个终点,以及 NIHSS 在 7 天和 90 天的 4 个终点,生成>80%统计功效所需的最小样本量分别为 620、480、370 和 420,使得 7 天 NIHSS 成为最敏感的终点。这些结果得到了二分类分析的支持。
7 天 NIHSS 评分似乎是一个敏感的终点,应该在随机试验数据集进行验证,以便在探索性中风试验中使用。