Salomon L J, Nassar M, Bernard J P, Ville Y, Fauconnier A
Service de Gynécologie - Obstétrique, Hôpital Necker-Enfants Malades, AP-HP, Faculté de Médecine, Université Paris Descartes, Paris. France.
Eur J Obstet Gynecol Reprod Biol. 2009 Apr;143(2):116-20. doi: 10.1016/j.ejogrb.2008.12.003. Epub 2009 Feb 8.
The quality of ultrasound (US) examination in the setting of a gynaecological emergency consultation is variable. Our aims were to develop and evaluate the feasibility of an image-based scoring method for the evaluation of standardised gynaecological images in this context.
Ultrasound images of the uterus (2 planes), ovaries (1 plane of each side) and Morrison's pouch were obtained by specialist registrars as part of their assessment of gynaecological emergencies during a three-month period. Twenty images of each of these 5 planes were randomly selected and anonymised. These 100 images were analysed by two reviewers. Each image was scored according to 23 criteria yielding a maximum score of 23 points for the entire ultrasound examination. Training was then offered with special emphasis on quality criteria. Following training, another set of 100 images obtained by the same specialist registrars was scored. Scores before and after training were compared. Inter- and intra-reviewer reproducibility were analysed using intra-class correlation, adjusted Kappa and Bland-Altman plot.
The mean (+/-SD) scores were 10.22(+/-2.75) and 16.1(+/-3.35) before and after theoretical training respectively (p<10(-4)). The intra-class correlation coefficient and the mean difference in score were 0.973 [0.957;0.990] and 0.02 [-1.98;1.94], and 0.952 [0.894;1] and -0.1 [-2.44;2.24] for inter- and intra-reviewer reproducibility respectively. Kappa values were above 0.8 for all but 3 criteria.
A quality control policy for gynaecological emergencies based on image scoring is feasible and allows for good inter- and intra-reviewer reproducibility. A policy of this nature is likely to improve the quality of emergency gynaecological ultrasound examination.
在妇科急诊会诊中,超声(US)检查的质量参差不齐。我们的目的是开发并评估一种基于图像的评分方法在这种情况下评估标准化妇科图像的可行性。
子宫(2个平面)、卵巢(每侧1个平面)和肝肾隐窝的超声图像由专科住院医师获取,作为他们在三个月期间对妇科急诊评估的一部分。从这5个平面中每个平面随机选取20张图像并进行匿名处理。这100张图像由两名审阅者进行分析。每张图像根据23项标准进行评分,整个超声检查的最高分为23分。然后提供培训,特别强调质量标准。培训后,对同一专科住院医师获取的另一组100张图像进行评分。比较培训前后的分数。使用组内相关系数、调整后的Kappa值和Bland-Altman图分析审阅者间和审阅者内的可重复性。
理论培训前后的平均(±标准差)分数分别为10.22(±2.75)和16.1(±3.35)(p<10⁻⁴)。审阅者间和审阅者内可重复性的组内相关系数和分数平均差异分别为0.973 [0.957;0.990] 和0.02 [-1.98;1.94],以及0.952 [0.894;1] 和-0.1 [-2.44;2.24]。除3项标准外,所有标准的Kappa值均高于0.8。
基于图像评分的妇科急诊质量控制政策是可行的,并且在审阅者间和审阅者内具有良好的可重复性。这种性质的政策可能会提高妇科急诊超声检查的质量。