Department of Community Health Sciences, Faculty of Medicine, University of Calgary, Calgary, Alberta, Canada.
Department of Clinical Neurosciences, Foothills Hospital, Calgary, Alberta, Canada.
PLoS One. 2013 Dec 19;8(12):e84702. doi: 10.1371/journal.pone.0084702. eCollection 2013.
The National Institutes of Health Stroke Scale (NIHSS) is commonly used to measure neurologic function and guide treatment after spontaneous intracerebral hemorrhage (ICH) in routine stroke clinics. We evaluated its reliability and sensitivity to detect change with consecutive and unique rater combinations in a real-world setting.
Conservative measures of interrater reliability (unweighted Kappa (κ), Intraclass Correlation Coefficient (ICC1,1) and sensitivity to detect change (Minimal Detectable Difference (MDD)) were estimated. Sixty-one repeated ratings were completed within 1 week after ICH by physicians and nurses with no investigator intervention.
Reliability (consistency) of the NIHSS total score was good for both physicians vs. nurses and nurses vs. nurses (ICC=0.78, 95%CI: 0.58-0.89 and ICC=0.75, 95%CI: 0.55-0.87 respectively) in this scenario. Reliability (agreement) of items 1C and 9 were excellent (κ>=0.61) for both rater comparisons, however, reliability was poor to fair on most remaining items (κ:0.01-0.60), with item 11 being completely unreliable in this scenario (κ<0.01). The MDD95 of the total NIHSS score was ±10 and ±11 points for physician vs. nurse and nurse vs. nurse comparisons.
The reliability of the NIHSS is good overall for ICH even in an uncontrolled setting. However, on repeated measurements changes in total NIHSS score of at least >=10 points need to be observed for clinicians to be confident that real changes had occurred within 1 week after ICH.
国立卫生研究院卒中量表(NIHSS)常用于测量自发性脑出血(ICH)后常规卒中临床中的神经功能,并指导治疗。我们评估了其在真实环境中连续、独特的评估者组合下的可靠性和检测变化的敏感性。
评估了组内一致性的保守测量指标(未加权 Kappa(κ)、组内相关系数(ICC1,1)和检测变化的敏感性(最小可检测差异(MDD))。在 ICH 后 1 周内,医生和护士在没有研究者干预的情况下,重复进行了 61 次评分。
在这种情况下,NIHSS 总分的医生与护士之间(ICC=0.78,95%CI:0.58-0.89)和护士与护士之间(ICC=0.75,95%CI:0.55-0.87)的可靠性(一致性)均较好。对于这两种评估者比较,项目 1C 和 9 的可靠性(一致性)均为极好(κ≥0.61),但大多数其他项目的可靠性为差至中等(κ:0.01-0.60),项目 11 在这种情况下完全不可靠(κ<0.01)。NIHSS 总分的 MDD95 值为医师与护士比较和护士与护士比较时的±10 点和±11 点。
即使在不受控制的环境中,NIHSS 对 ICH 的可靠性总体上也较好。然而,对于临床医生来说,需要观察 NIHSS 总分至少≥10 点的重复测量变化,才能有信心在 ICH 后 1 周内发生了真正的变化。