Midrio Paola, van Rooij Iris A L M, Brisighelli Giulia, Garcia Aracelli, Fanjul Maria, Broens Paul, Iacobelli Barbara D, Giné Carlos, Lisi Gabriele, Sloots Cornelius E J, Fascetti Leon Francesco, Morandi Anna, van der Steeg Herjan, Giuliani Stefan, Grasshoff-Derr Sabine, Lacher Martin, de Blaauw Ivo, Jenetzky Ekkehart
Pediatric Surgery Unit, Cà Foncello Hospital, Treviso, Italy.
Department of Health Evidence, Radboud Institute for Health Sciences, Nijmegen, Netherlands.
Front Pediatr. 2020 Sep 18;8:571. doi: 10.3389/fped.2020.00571. eCollection 2020.
Male patients with anorectal malformations (ARM) are classified according to presence and level of the recto-urinary fistula. This is traditionally established by a preoperative high-pressure distal colostogram that may be variably interpreted by different surgeons. The aim of this study was to evaluate the inter- and intraobserver variation in the assessment by pediatric surgeons of preoperative colostograms with respect to the level of the recto-urinary fistula. Sixteen pediatric surgeons from 14 European centers belonging to the ARM-Net Consortium twice scored 130 images of distal colostograms taken in sagittal projection at a median age of 66 days of life (range: 4-1,106 days). Surgeons were asked to classify the fistula in bulbar, prostatic, bladder-neck, no fistula, and "unclear anatomy" example. Their assessments were compared with the intraoperative findings (kappa) for two scoring rounds with an interval of 6 months (intraobserver variation). Agreement among the surgeons' scores (interobserver variation) was also calculated using Krippendorff's alpha. A kappa over 0.75 is considered excellent, between 0.40 and 0.75 fair to good, and below 0.40 poor. Surgeons were asked to score the images in "poor" and "good" quality and to provide their years of experience in ARM treatment. Agreement between the image-based rating of surgeons and the intraoperative findings ranges from 0.06 to 0.45 (mean 0.31). Interobserver variation is higher (Krippendorff's alpha between 0.40 and 0.45). Years of experience in ARM treatment does not seem to influence the scoring. The mean intraobserver variation between the two rounds is 0.64. Overall, the quality of the images is considered poor. Images categorized as having a good quality result in a statistically significant higher kappa (mean: 0.36 and 0.37 in the first and second round, respectively) than in the group of bad-quality images (mean: 0.25 and 0.23, respectively). There is poor agreement among experienced pediatric colorectal surgeons on preoperative colostograms. Techniques and analyses of images need to be improved in order to generate a homogeneous series of patients and make comparison of outcomes reliable.
患有肛门直肠畸形(ARM)的男性患者根据直肠尿道瘘的有无及位置进行分类。传统上这是通过术前高压远端结肠造影来确定的,不同外科医生对此的解读可能存在差异。本研究的目的是评估小儿外科医生对术前结肠造影中直肠尿道瘘位置评估的观察者间及观察者内差异。来自ARM-Net联盟14个欧洲中心的16名小儿外科医生对130张矢状位远端结肠造影图像进行了两次评分,这些图像拍摄时患者的中位年龄为66天(范围:4 - 1106天)。要求外科医生将瘘管分为球部、前列腺部、膀胱颈部、无瘘管以及“解剖结构不清晰”等类型。将他们的评估结果与术中发现进行比较(kappa值),分两轮评分,间隔6个月(观察者内差异)。还使用Krippendorff's alpha计算外科医生评分之间的一致性(观察者间差异)。kappa值大于0.75被认为是优秀,在0.40至0.75之间为中等至良好,低于0.40为较差。要求外科医生对图像质量进行“差”和“好”的评分,并提供他们在ARM治疗方面的经验年限。外科医生基于图像的评级与术中发现之间的一致性范围为0.06至0.45(平均0.31)。观察者间差异更高(Krippendorff's alpha在0.40至0.45之间)。ARM治疗的经验年限似乎并不影响评分。两轮之间的平均观察者内差异为0.64。总体而言,图像质量被认为较差。分类为高质量的图像在统计学上kappa值显著更高(第一轮和第二轮分别平均为0.36和0.37),高于低质量图像组(分别平均为0.25和0.23)。经验丰富的小儿结直肠外科医生对术前结肠造影的一致性较差。图像的技术和分析需要改进,以便生成同质化的患者系列并使结果比较可靠。