Mohammed Vashist Nadia, Samaan Mark, Mosli Mahmoud H, Parker Claire E, MacDonald John K, Nelson Sigrid A, Zou G Y, Feagan Brian G, Khanna Reena, Jairath Vipul
PRA Health Sciences, 300-655 Tyee Road, Victoria, BC, Canada, V9A 6X5.
Cochrane Database Syst Rev. 2018 Jan 16;1(1):CD011450. doi: 10.1002/14651858.CD011450.pub2.
Endoscopic assessment of mucosal disease activity is routinely used to determine eligibility and response to therapy in clinical trials of ulcerative colitis. The operating properties of the existing endoscopic scoring indices are unclear.
A systematic review was undertaken to evaluate the development and operating characteristics of endoscopic scoring indices for the evaluation of ulcerative colitis.
We searched MEDLINE, Embase and CENTRAL from inception to 5 July 2016. We also searched references and conference proceedings (Digestive Disease Week, United European Gastroenterology Week, European Crohn's and Colitis Organization).
Any study design (e.g. randomized controlled trials, cohort studies, case series) that evaluated endoscopic indices for evaluation of ulcerative colitis disease activity were considered for inclusion. Eligible participants were adult patients (> 16 years), diagnosed with ulcerative colitis using conventional clinical, radiologic and endoscopic criteria.
Two authors independently reviewed the studies identified from the literature search. These authors also independently extracted and recorded data on the number of patients enrolled; number of patients per treatment arm; patient characteristics including age and gender distribution; endoscopic index; and outcomes such as reliability (intra-rater and inter-rater), validity (content, construct, criterion), responsiveness and feasibility. Any disagreements regarding study inclusion or data extraction were resolved by discussion and consensus with a third author. Risk of bias was assessed by determining whether assessors were blinded to clinical information and whether assessors scored the endoscopic index independently. We also assessed the methodological quality of the validation studies using the COSMIN checklist MAIN RESULTS: A total of 23 reports of 20 studies met the pre-defined inclusion criteria and were included in the review. Of the 20 included validation studies, 19 endoscopic scoring indices were assessed, including the Azzolini Classification, Baron Score, Blackstone Endoscopic Interpretation, Chinese Grading System of Ulcerative Colitis, Endoscopic Activty Index, Jeroen Score, Magnifying Colonoscopy Grade, Matts Score, Mayo Clinic Endoscopic Subscore, Modified Baron Score, Modified Mayo Clinic Endoscopic Subscore, Osada Score, Rachmilewtiz Endoscopic Score, St. Mark's Index, Ulcerative Colitis Colonoscopic Index of Serverity (UCCIS), endoscopic component of the Ulcerative Colitis Disease Activity Index (UCDAI), Ulcerative Colitis Endoscopic Index of Severity (UCEIS), Witts Sigmoidoscopic Score and Watson Grade. The individuals who performed the endoscopic scoring were blinded to clinical and/or histologic information in ten of the included studies, not blinded to clinical and/or histologic information in one of the included studies, and it was unclear whether blinding occurred in the remaining nine included studies. Independent observation was confirmed in four of the included studies, unclear in five of the included studies, and non-applicable (since inter-rater reliability was not assessed) in the remaining eleven included studies. The methodological quality (COSMIN checklist) of most of the included studies was rated as 'good' or 'excellent'. One study that assessed responsiveness was rated as 'fair'. The inter-rater reliability of nine endoscopic scoring indices including the Baron Score, Blackstone Endoscopic Interpretation, Endoscopic Activity Index, Matts Score, Mayo Clinic Endoscopic Subscore, Osada Score, UCCIS, UCEIS, Watson Grade was assessed in seven studies, with estimates of correlation, ƙ, ranging from 0.44 to 0.97. The iIntra-rater reliability of seven endoscopic scoring indices including the Baron Score, Blackstone Endoscopic Interpretation, Matts Score, Mayo Clinic Endoscopic Subscore, Osada Score, UCCIS and UCEIS was assessed in three studies, with estimates of correlation, ƙ, ranging from 0.41 to 0.86. No studies assessed content validity. Three studies evaluated the criterion validity of three endoscopic scoring indices including the Rachmilewitz Endoscopic Score, Magnifying Colonoscopy Grade and the UCCIS. These indices were correlated with objective markers of disease activity including albumin, blood leukocytes, C-reactive protein, fecal calprotectin, hemoglobin, mucosal interleukin-8 concentration and platelet count. Correlation estimates ranged from r = -0.19 to 0.83. Thirteen endoscopic scoring indices were tested for construct validity in 13 studies. Estimates of correlation between the endoscopic scoring indices and other measures of disease activity ranged from r = 0.27 to 0.93. Two studies explored the responsiveness of four endoscopic scoring indices including the Mayo Endoscopic Subscore, Modified Baron Score, Modified Mayo Endoscopic Subscore and UCEIS. One study concluded that the Modified Baron Score, Modified Mayo Endoscopic Subscore and UCEIS had similar responsiveness for detecting disease change in ulcerative colitis. The other included study concluded that the UCEIS may be the most accurate endoscopic scoring tool. None of the included studies formally assessed feasibility.
AUTHORS' CONCLUSIONS: While the UCEIS, UCCIS and Mayo Clinic Endoscopic Subscore have undergone extensive validation, none of these instruments have been fully validated and only two studies assessed responsiveness. Further research on the operating properties of these indices is needed given the lack of a fully-validated endoscopic scoring instrument for the evaluation of disease activity in ulcerative colitis.
在溃疡性结肠炎的临床试验中,内镜评估黏膜疾病活动度常用于确定治疗的适用性和反应情况。现有内镜评分指数的操作特性尚不清楚。
进行一项系统评价,以评估用于评估溃疡性结肠炎的内镜评分指数的发展情况和操作特征。
我们检索了从创刊至2016年7月5日的MEDLINE、Embase和CENTRAL数据库。我们还检索了参考文献和会议论文集(消化系统疾病周、欧洲胃肠病学联合周、欧洲克罗恩病和结肠炎组织)。
任何评估用于评估溃疡性结肠炎疾病活动度的内镜指数的研究设计(如随机对照试验、队列研究、病例系列)均考虑纳入。符合条件的参与者为成年患者(>16岁),根据传统临床、放射学和内镜标准诊断为溃疡性结肠炎。
两位作者独立审查从文献检索中识别出的研究。这些作者还独立提取并记录了以下数据:纳入患者数量;每个治疗组的患者数量;患者特征,包括年龄和性别分布;内镜指数;以及诸如可靠性(评分者内和评分者间)、有效性(内容、结构、标准)、反应性和可行性等结果。关于研究纳入或数据提取的任何分歧通过与第三位作者讨论并达成共识来解决。通过确定评估者是否对临床信息不知情以及评估者是否独立对内镜指数进行评分来评估偏倚风险。我们还使用COSMIN清单评估了验证研究的方法学质量。主要结果:共有20项研究的23份报告符合预先定义的纳入标准并纳入本评价。在纳入的20项验证研究中,评估了19种内镜评分指数,包括阿佐利尼分类、巴伦评分、布莱克斯通内镜解读、中国溃疡性结肠炎分级系统、内镜活动指数、杰罗恩评分、放大结肠镜分级、马茨评分、梅奥诊所内镜子评分、改良巴伦评分、改良梅奥诊所内镜子评分、小田评分、拉赫米列维茨内镜评分、圣马克指数、溃疡性结肠炎结肠镜严重程度指数(UCCIS)、溃疡性结肠炎疾病活动指数(UCDAI)的内镜部分、溃疡性结肠炎内镜严重程度指数(UCEIS)、维茨乙状结肠镜评分和沃森分级。在纳入的10项研究中,进行内镜评分的人员对临床和/或组织学信息不知情,在1项纳入研究中对临床和/或组织学信息知情,在其余9项纳入研究中是否知情尚不清楚。在4项纳入研究中确认了独立观察,在5项纳入研究中情况不明,在其余11项纳入研究中不适用(因为未评估评分者间可靠性)。大多数纳入研究的方法学质量(COSMIN清单)被评为“良好”或“优秀”。一项评估反应性的研究被评为“中等”。在7项研究中评估了9种内镜评分指数的评分者间可靠性,包括巴伦评分、布莱克斯通内镜解读、内镜活动指数、马茨评分、梅奥诊所内镜子评分、小田评分、UCCIS、UCEIS、沃森分级,相关性估计值κ范围为0.44至0.97。在3项研究中评估了7种内镜评分指数的评分者内可靠性,包括巴伦评分、布莱克斯通内镜解读、马茨评分、梅奥诊所内镜子评分、小田评分、UCCIS和UCEIS,相关性估计值κ范围为0.41至0.86。没有研究评估内容效度。3项研究评估了3种内镜评分指数的标准效度,包括拉赫米列维茨内镜评分、放大结肠镜分级和UCCIS。这些指数与疾病活动的客观标志物相关,包括白蛋白、血白细胞、C反应蛋白、粪便钙卫蛋白、血红蛋白、黏膜白细胞介素-8浓度和血小板计数。相关性估计值范围为r = -0.19至0.83。在13项研究中对13种内镜评分指数进行了结构效度测试。内镜评分指数与其他疾病活动度测量指标之间的相关性估计值范围为r = 0.27至0.93。2项研究探讨了4种内镜评分指数的反应性,包括梅奥内镜子评分、改良巴伦评分、改良梅奥内镜子评分和UCEIS。一项研究得出结论,改良巴伦评分、改良梅奥内镜子评分和UCEIS在检测溃疡性结肠炎疾病变化方面具有相似的反应性。另一项纳入研究得出结论,UCEIS可能是最准确的内镜评分工具。没有纳入研究正式评估可行性。
虽然UCEIS、UCCIS和梅奥诊所内镜子评分已经过广泛验证,但这些工具均未得到充分验证,只有两项研究评估了反应性。鉴于缺乏用于评估溃疡性结肠炎疾病活动度的经过充分验证的内镜评分工具,需要对这些指数的操作特性进行进一步研究。