Terslev Lene, Gutierrez Marwin, Schmidt Wolfgang A, Keen Helen I, Filippucci Emilio, Kane David, Thiele Ralf, Kaeley Gurjit, Balint Peter, Mandl Peter, Delle Sedie Andrea, Hammer Hilde Berner, Christensen Robin, Möller Ingrid, Pineda Carlos, Kissin Eugene, Bruyn George A, Iagnocco Annamaria, Naredo Esperanza, D'Agostino Maria Antonietta
J Rheumatol. 2015 Nov;42(11):2177-81. doi: 10.3899/jrheum.141294. Epub 2015 Sep 1.
To summarize the work performed by the Outcome Measures in Rheumatology (OMERACT) Ultrasound (US) Working Group on the validation of US as a potential outcome measure in gout.
Based on the lack of definitions, highlighted in a recent literature review on US as an outcome tool in gout, a series of iterative exercises were carried out to obtain consensus-based definitions on US elementary components in gout using a Delphi exercise and subsequently testing these definitions in static images and in patients with proven gout. Cohen's κ was used to test agreement, and values of 0-0.20 were considered poor, 0.20-0.40 fair, 0.40-0.60 moderate, 0.60-0.80 good, and 0.80-1 excellent.
With an agreement of > 80%, consensus-based definitions were obtained for the 4 elementary lesions highlighted in the literature review: tophi, aggregates, erosions, and double contour (DC). In static images interobserver reliability ranged from moderate to almost perfect, and similar results were found for the intrareader reliability. In patients the intraobserver agreement was good for all lesions except DC (moderate). The interobserver agreement was poor for aggregates and DC but moderate for the other components.
These first steps in evaluating the validity of US as an outcome measure for gout show that the reliability of the definitions ranged from moderate to excellent in static images and somewhat lower in patients, indicating that a standardized scanning technique may be needed, before testing the responsiveness of those definitions in a composite US score.
总结风湿病结局评估(OMERACT)超声(US)工作组在验证超声作为痛风潜在结局评估指标方面所做的工作。
鉴于近期一篇关于超声作为痛风结局评估工具的文献综述中强调缺乏定义,开展了一系列迭代练习,通过德尔菲法就痛风中超声基本组成部分达成基于共识的定义,随后在静态图像和确诊痛风患者中对这些定义进行测试。采用科恩κ系数检验一致性,0至0.20为差,0.20至0.40为一般,0.40至0.60为中等,0.60至0.80为良好,0.80至1为优秀。
对于文献综述中突出的4种基本病变:痛风石、聚集物、侵蚀和双轮廓(DC),达成了>80%的一致性,获得了基于共识的定义。在静态图像中,观察者间可靠性从中等到几乎完美,观察者内可靠性也得到类似结果。在患者中,除DC外(中等),所有病变的观察者内一致性均良好。聚集物和DC的观察者间一致性较差,其他组成部分为中等。
这些评估超声作为痛风结局评估指标有效性的初步步骤表明,定义的可靠性在静态图像中从中等到优秀,在患者中略低,这表明在将这些定义用于综合超声评分测试其反应性之前,可能需要标准化的扫描技术。