Näther Philipp, Kersten Jan Felix, Schablon Anja, Nienhaus Albert
BG Klinikum Bergmannstrost Halle (Saale) gGmbH, Halle (Saale), Germany.
Asklepios Klinik Nord Heidberg, Hamburg, Germany.
J Occup Med Toxicol. 2025 Sep 16;20(1):30. doi: 10.1186/s12995-025-00479-0.
Currently, the most widely used method to determine lumbar intervertebral disc degeneration is MRI. However, the evaluation of imaging signs of disc degeneration involves several subjective assessments. The aim of this study was to investigate differences in radiological assessments between two independent reports of the same MRI, emphasizing that the identical images were assessed twice by different raters.
MRI of the lumbar and cervical spine of a population-based random sample of women and a sample of female nurses, geriatric nurses and care workers as a subgroup with a relatively high level of work-related stress on the lumbar spine was performed. Each MRI was then assessed by two radiologists from the corresponding clinic that had examined the participant. Ten criteria were assessed: three continuous and seven categorical parameters. Agreement was assessed with bias and dispersion figures or agreement and Cohen's kappa for categorical parameters.
Double diagnosis of 318 participants with available MR images of the cervical and lumbar spine were performed. The results show that there is remarkable consensus on some parameters as well as substantial disagreement on others-the agreement of the two reports for categorical parameters, as measured by Cohen's kappa, ranges from 0.04 to 0.57. For continuous measurements, the percentage difference ranges from 8 to 24%; it depends on the extent of the subjectivity of the parameter to be rated.
The interrater reliability of MRI readings of the lumbar spine is greater when clearly defined parameters and measurement methods are used. Therefore, it should be investigated which easy to use rating scales can be implemented in daily clinical practice to make reports more reliable and useful for clinicians. One way to reduce subjectivity might be the use of reference images.
目前,确定腰椎间盘退变最广泛使用的方法是磁共振成像(MRI)。然而,椎间盘退变影像学征象的评估涉及多项主观评价。本研究的目的是调查同一MRI的两份独立报告之间的放射学评估差异,强调相同图像由不同评估者进行了两次评估。
对基于人群的女性随机样本以及作为腰椎工作相关压力水平相对较高的亚组的女性护士、老年护理护士和护理人员样本进行了腰椎和颈椎的MRI检查。然后,由检查参与者的相应诊所的两名放射科医生对每个MRI进行评估。评估了十个标准:三个连续参数和七个分类参数。使用偏差和离散度数值或分类参数的一致性和科恩kappa系数评估一致性。
对318名有颈椎和腰椎可用MRI图像的参与者进行了双重诊断。结果表明,在一些参数上存在显著共识,而在其他参数上则存在重大分歧——用科恩kappa系数衡量,两份报告对分类参数的一致性范围为0.04至0.57。对于连续测量,百分比差异范围为8%至24%;这取决于要评估参数的主观程度。
当使用明确界定的参数和测量方法时,腰椎MRI读数的评估者间可靠性更高。因此,应该研究在日常临床实践中可以实施哪些易于使用的评分量表,以使报告对临床医生更可靠且有用。减少主观性的一种方法可能是使用参考图像。