从结肠镜检查报告中准确、稳健且可扩展地进行梅奥内镜亚评分的机器抽象。

Accurate, Robust, and Scalable Machine Abstraction of Mayo Endoscopic Subscores From Colonoscopy Reports.

作者信息

Silverman Anna L, Bhasuran Balu, Mosenia Arman, Yasini Fatema, Ramasamy Gokul, Banerjee Imon, Gupta Saransh, Mardirossian Taline, Narain Rohan, Sewell Justin, Butte Atul J, Rudrapatna Vivek A

机构信息

Division of Gastroenterology and Hepatology, Department of Medicine, Mayo Clinic, Phoenix, AZ, USA.

Department of Medicine, University of California, San Diego, La Jolla, CA, USA.

出版信息

Inflamm Bowel Dis. 2025 Mar 3;31(3):665-670. doi: 10.1093/ibd/izae068.

DOI:10.1093/ibd/izae068

PMID:38533919

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11879245/

Abstract

BACKGROUND

The Mayo endoscopic subscore (MES) is an important quantitative measure of disease activity in ulcerative colitis. Colonoscopy reports in routine clinical care usually characterize ulcerative colitis disease activity using free text description, limiting their utility for clinical research and quality improvement. We sought to develop algorithms to classify colonoscopy reports according to their MES.

METHODS

We annotated 500 colonoscopy reports from 2 health systems. We trained and evaluated 4 classes of algorithms. Our primary outcome was accuracy in identifying scorable reports (binary) and assigning an MES (ordinal). Secondary outcomes included learning efficiency, generalizability, and fairness.

RESULTS

Automated machine learning models achieved 98% and 97% accuracy on the binary and ordinal prediction tasks, outperforming other models. Binary models trained on the University of California, San Francisco data alone maintained accuracy (96%) on validation data from Zuckerberg San Francisco General. When using 80% of the training data, models remained accurate for the binary task (97% [n = 320]) but lost accuracy on the ordinal task (67% [n = 194]). We found no evidence of bias by gender (P = .65) or area deprivation index (P = .80).

CONCLUSIONS

We derived a highly accurate pair of models capable of classifying reports by their MES and recognizing when to abstain from prediction. Our models were generalizable on outside institution validation. There was no evidence of algorithmic bias. Our methods have the potential to enable retrospective studies of treatment effectiveness, prospective identification of patients meeting study criteria, and quality improvement efforts in inflammatory bowel diseases.

摘要

背景

梅奥内镜亚评分（MES）是溃疡性结肠炎疾病活动的一项重要定量指标。常规临床护理中的结肠镜检查报告通常使用自由文本描述来表征溃疡性结肠炎的疾病活动，这限制了它们在临床研究和质量改进中的效用。我们试图开发算法，根据MES对结肠镜检查报告进行分类。

方法

我们对来自2个医疗系统的500份结肠镜检查报告进行了注释。我们训练并评估了4类算法。我们的主要结果是识别可评分报告（二元）和分配MES（有序）的准确性。次要结果包括学习效率、可推广性和公平性。

结果

自动化机器学习模型在二元和有序预测任务上分别达到了98%和97%的准确率，优于其他模型。仅在加利福尼亚大学旧金山分校数据上训练的二元模型在来自扎克伯格旧金山综合医院的验证数据上保持了准确性（96%）。当使用80%的训练数据时，模型在二元任务上仍然准确（97%[n = 320]），但在有序任务上失去了准确性（67%[n = 194]）。我们没有发现性别（P = 0.65）或地区贫困指数（P = 0.80）存在偏差的证据。

结论

我们得出了一对高度准确的模型，能够根据MES对报告进行分类，并识别何时放弃预测。我们的模型在外部机构验证中具有可推广性。没有算法偏差的证据。我们的方法有可能用于炎症性肠病治疗效果的回顾性研究、符合研究标准患者的前瞻性识别以及质量改进工作。

相似文献

Accurate, Robust, and Scalable Machine Abstraction of Mayo Endoscopic Subscores From Colonoscopy Reports.从结肠镜检查报告中准确、稳健且可扩展地进行梅奥内镜亚评分的机器抽象。

Inflamm Bowel Dis. 2025 Mar 3;31(3):665-670. doi: 10.1093/ibd/izae068.

Endoscopic scoring indices for evaluation of disease activity in ulcerative colitis.用于评估溃疡性结肠炎疾病活动度的内镜评分指数。

Cochrane Database Syst Rev. 2018 Jan 16;1(1):CD011450. doi: 10.1002/14651858.CD011450.pub2.

Diagnostic Performance of a Fecal Calprotectin Assay as a Biomarker for Mayo Endoscopic Subscore in Ulcerative Colitis: Result From a Tertiary Referral Center.粪便钙卫蛋白检测作为溃疡性结肠炎梅奥内镜亚评分生物标志物的诊断性能：来自三级转诊中心的结果

Inflamm Bowel Dis. 2024 Dec 5;30(12):2347-2355. doi: 10.1093/ibd/izae005.

Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data.使用多中心临床试验数据训练和部署用于溃疡性结肠炎内镜严重程度分级的深度学习模型。

Ther Adv Gastrointest Endosc. 2021 Feb 25;14:2631774521990623. doi: 10.1177/2631774521990623. eCollection 2021 Jan-Dec.

Artificial Intelligence-assisted Video Colonoscopy for Disease Monitoring of Ulcerative Colitis: A Prospective Study.人工智能辅助视频结肠镜用于溃疡性结肠炎疾病监测的前瞻性研究

J Crohns Colitis. 2025 Jan 11;19(1). doi: 10.1093/ecco-jcc/jjae080.

Deep learning enabled classification of Mayo endoscopic subscore in patients with ulcerative colitis.深度学习可对溃疡性结肠炎患者的 Mayo 内镜评分进行分类。

Eur J Gastroenterol Hepatol. 2021 May 1;33(5):645-649. doi: 10.1097/MEG.0000000000001952.

Correlation of Fecal Markers with Magnifying Endoscopic Stratification in Patients with Ulcerative Colitis Who Are in Clinical Remission.溃疡性结肠炎临床缓解患者粪便标志物与放大内镜分层的相关性。

Digestion. 2018;97(1):82-89. doi: 10.1159/000484223. Epub 2018 Feb 1.

Complete mucosal healing defined by endoscopic Mayo subscore still demonstrates abnormalities by novel high definition colonoscopy and refined histological gradings.由内镜梅奥子评分定义的完全黏膜愈合，通过新型高清结肠镜检查和改良组织学分级仍显示存在异常。

Endoscopy. 2015 Aug;47(8):726-34. doi: 10.1055/s-0034-1391863. Epub 2015 Mar 31.

Patient-Reported Outcome and Clinical Scores Are Equally Accurate in Predicting Mucosal Healing in Ulcerative Colitis: A Prospective Study.患者报告结局和临床评分在预测溃疡性结肠炎黏膜愈合方面同样准确：一项前瞻性研究。

Dig Dis Sci. 2022 Jul;67(7):3089-3095. doi: 10.1007/s10620-021-07178-w. Epub 2021 Jul 20.

Diagnostic accuracy of convolutional neural network-based machine learning algorithms in endoscopic severity prediction of ulcerative colitis: a systematic review and meta-analysis.基于卷积神经网络的机器学习算法在溃疡性结肠炎内镜严重程度预测中的诊断准确性：系统评价和荟萃分析。

Gastrointest Endosc. 2023 Aug;98(2):145-154.e8. doi: 10.1016/j.gie.2023.04.2074. Epub 2023 Apr 23.

引用本文的文献

Real-world effectiveness of ustekinumab and vedolizumab in TNF-exposed pediatric patients with ulcerative colitis.优特克单抗和维多珠单抗在接触过肿瘤坏死因子的儿童溃疡性结肠炎患者中的真实世界疗效。

J Pediatr Gastroenterol Nutr. 2024 May;78(5):1126-1134. doi: 10.1002/jpn3.12169. Epub 2024 Mar 14.

本文引用的文献

Identifying the Presence, Activity, and Status of Extraintestinal Manifestations of Inflammatory Bowel Disease Using Natural Language Processing of Clinical Notes.利用临床记录的自然语言处理识别炎症性肠病肠外表现的存在、活动情况及状态

Inflamm Bowel Dis. 2023 Apr 3;29(4):503-510. doi: 10.1093/ibd/izac109.

Evaluation of Natural Language Processing for the Identification of Crohn Disease-Related Variables in Spanish Electronic Health Records: A Validation Study for the PREMONITION-CD Project.西班牙语电子健康记录中用于识别克罗恩病相关变量的自然语言处理评估：PREMONITION-CD项目的验证研究

JMIR Med Inform. 2022 Feb 18;10(2):e30345. doi: 10.2196/30345.

Clinical characteristics and prognostic factors for Crohn's disease relapses using natural language processing and machine learning: a pilot study.利用自然语言处理和机器学习分析克罗恩病复发的临床特征和预后因素：一项初步研究。

Eur J Gastroenterol Hepatol. 2022 Apr 1;34(4):389-397. doi: 10.1097/MEG.0000000000002317.

STRIDE-II: An Update on the Selecting Therapeutic Targets in Inflammatory Bowel Disease (STRIDE) Initiative of the International Organization for the Study of IBD (IOIBD): Determining Therapeutic Goals for Treat-to-Target strategies in IBD.STRIDE-II：炎症性肠病（STRIDE）国际研究组织（IOIBD）治疗靶点选择更新：确定炎症性肠病靶向治疗策略的治疗目标。

Gastroenterology. 2021 Apr;160(5):1570-1583. doi: 10.1053/j.gastro.2020.12.031. Epub 2021 Feb 19.

Fully automated endoscopic disease activity assessment in ulcerative colitis.溃疡性结肠炎的完全自动化内镜疾病活动评估。

Gastrointest Endosc. 2021 Mar;93(3):728-736.e1. doi: 10.1016/j.gie.2020.08.011. Epub 2020 Aug 15.

Performance of a Deep Learning Model vs Human Reviewers in Grading Endoscopic Disease Severity of Patients With Ulcerative Colitis.深度学习模型与人类评估者在溃疡性结肠炎患者内镜疾病严重程度分级中的表现比较。

JAMA Netw Open. 2019 May 3;2(5):e193963. doi: 10.1001/jamanetworkopen.2019.3963.

The Association Between Arthralgia and Vedolizumab Using Natural Language Processing.关节痛与维得利珠单抗的关联：自然语言处理的应用。

Inflamm Bowel Dis. 2018 Sep 15;24(10):2242-2246. doi: 10.1093/ibd/izy127.

Introduction of an Area Deprivation Index Measuring Patient Socioeconomic Status in an Integrated Health System: Implications for Population Health.在综合卫生系统中引入衡量患者社会经济地位的地区剥夺指数：对人群健康的影响

EGEMS (Wash DC). 2016 Aug 11;4(3):1238. doi: 10.13063/2327-9214.1238. eCollection 2016.

Improving case definition of Crohn's disease and ulcerative colitis in electronic medical records using natural language processing: a novel informatics approach.利用自然语言处理改善电子病历中克罗恩病和溃疡性结肠炎的病例定义：一种新的信息学方法。

Inflamm Bowel Dis. 2013 Jun;19(7):1411-20. doi: 10.1097/MIB.0b013e31828133fd.

Automated identification of surveillance colonoscopy in inflammatory bowel disease using natural language processing.利用自然语言处理技术自动识别炎症性肠病的监测结肠镜检查。

Dig Dis Sci. 2013 Apr;58(4):936-41. doi: 10.1007/s10620-012-2433-8. Epub 2012 Oct 21.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。