Research Department, Spine Centre of Southern Denmark, Hospital Lillebaelt, Oestre Hougvej 55, 5500 Middelfart, Denmark.
Eur Spine J. 2011 Dec;20(12):2166-73. doi: 10.1007/s00586-011-1911-6. Epub 2011 Jul 19.
The STarT Back Screening Tool (STarT) is a nine-item patient self-report questionnaire that classifies low back pain patients into low, medium or high risk of poor prognosis. When assessed by GPs, these subgroups can be used to triage patients into different evidence-based treatment pathways. The objective of this study was to translate the English version of STarT into Danish (STarT-dk) and test its discriminative validity.
Translation was performed using methods recommended by best practice translation guidelines. Psychometric validation of the discriminative ability was performed using the Area Under the Curve statistic. The Area Under the Curve was calculated for seven of the nine items where reference standards were available and compared with the original English version.
The linguistic translation required minor semantic and layout alterations. The response options were changed from "agree/disagree" to "yes/no" for four items. No patients reported item ambiguity using the final version. The Area Under the Curve ranged from 0.735 to 0.855 (CI95% 0.678-0.897) in a Danish cohort (n = 311) and 0.840 to 0.925 (CI95% 0.772-0.948) in the original English cohort (n = 500). On four items, the Area Under the Curve was statistically similar between the two cohorts but lower on three psychosocial sub-score items.
The translation was linguistically accurate and the discriminative validity broadly similar, with some differences probably due to differences in severity between the cohorts and the Danish reference standard questionnaires not having been validated. Despite those differences, we believe the results show that the STarT-dk has sufficient patient acceptability and discriminative validity to be used in Denmark.
STarT 背部筛查工具(STarT)是一个由九项患者自我报告问卷组成的工具,用于将腰痛患者分为预后不良的低、中、高风险人群。当由全科医生评估时,这些亚组可用于将患者分诊到不同的基于证据的治疗途径中。本研究的目的是将 STarT 的英文版本翻译成丹麦语(STarT-dk)并测试其判别效度。
使用最佳实践翻译指南推荐的方法进行翻译。使用曲线下面积(AUC)统计量来评估判别能力的心理测量学验证。对于有参考标准的九个项目中的七个项目进行了 AUC 计算,并与原始英语版本进行了比较。
语言翻译需要进行细微的语义和布局调整。对于四个项目,将回答选项从“同意/不同意”更改为“是/否”。没有患者使用最终版本报告项目模糊性。在丹麦队列(n=311)中,AUC 范围为 0.735 至 0.855(95%CI 0.678-0.897),在原始英语队列(n=500)中,AUC 范围为 0.840 至 0.925(95%CI 0.772-0.948)。在四个项目中,两个队列之间的 AUC 具有统计学相似性,但在三个心理社会子评分项目上较低。
翻译在语言上是准确的,判别效度大致相似,有些差异可能是由于队列之间的严重程度差异以及丹麦参考标准问卷未经过验证所致。尽管存在这些差异,但我们认为结果表明 STarT-dk 具有足够的患者可接受性和判别效度,可在丹麦使用。