Suppr超能文献

自动识别和分配结直肠息肉患者的结肠镜监测建议。

Automated identification and assignment of colonoscopy surveillance recommendations for individuals with colorectal polyps.

机构信息

Department of Radiological Sciences, Data Integration, Architecture, and Analytics Group, University of California Los Angeles, Los Angeles, California, USA.

Department of Medicine, Vatche and Tamar Manoukian Division of Digestive Diseases, Department of Medicine, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California, USA; UCLA Center for Cancer Prevention and Control Research, UCLA Kaiser Permanente Center for Health Equity and Department of Health Policy and Management, Fielding School of Public Health and Jonsson Comprehensive Cancer Center, Los Angeles, California, USA; Division of Gastroenterology, Department of Medicine, Veterans Affairs Greater Los Angeles Healthcare System, Los Angeles, California, USA.

出版信息

Gastrointest Endosc. 2021 Nov;94(5):978-987. doi: 10.1016/j.gie.2021.05.036. Epub 2021 Jun 1.

Abstract

BACKGROUND AND AIMS

Determining surveillance intervals for patients with colorectal polyps is critical but time-consuming and challenging to do reliably. We present the development and assessment of a pipeline that leverages natural language processing techniques to automatically extract and analyze relevant polyp findings from free-text colonoscopy and pathology reports. Using this information, we categorized individual patients into 6 postcolonoscopy surveillance intervals defined by the U.S. Multi-Society Task Force on Colorectal Cancer.

METHODS

Using a set of 546 randomly selected colonoscopy and pathology reports from 324 patients in a single health system, we used a combination of statistical classifiers and rule-based methods to extract polyp properties from each report type, associate properties with unique polyps, and classify a patient into 1 of 6 risk categories by integrating information from both report types. We then assessed the pipeline's performance by determining the positive predictive value (PPV), sensitivity, and F-score of the algorithm, compared with the determination of surveillance intervals by a gastroenterologist.

RESULTS

The pipeline was developed using 346 reports (224 colonoscopy and 122 pathology) from 224 patients and evaluated on an independent test set of 200 reports (100 colonoscopy and 100 pathology) from 100 patients. We achieved an average PPV, sensitivity, and F-score of .92, .95, and .93, respectively, across targeted entities for colonoscopy. Pathology extraction achieved a PPV, sensitivity, and F-score of .95, .97, and .96. The system achieved an overall accuracy of 92% in assigning the recommended interval for surveillance colonoscopy.

CONCLUSIONS

This study demonstrates the feasibility of using machine learning to automatically extract findings and classify patients to appropriate risk categories and corresponding surveillance intervals. Incorporating this system can facilitate proactive and timely follow-up after screening colonoscopy and enable real-time quality assessment of prevention programs and providers.

摘要

背景与目的

确定结直肠息肉患者的监测间隔至关重要,但要可靠地做到这一点既费时又具有挑战性。我们提出了一种利用自然语言处理技术从结肠镜检查和病理报告的自由文本中自动提取和分析相关息肉发现的管道的开发和评估。使用这些信息,我们将个体患者分为美国多学会大肠癌工作组定义的 6 个结肠镜检查后监测间隔。

方法

使用来自单一医疗系统的 324 名患者的 546 份随机选择的结肠镜检查和病理报告,我们使用统计分类器和基于规则的方法的组合,从每种报告类型中提取息肉属性,将属性与独特的息肉相关联,并通过整合两种报告类型的信息,将患者分类为 6 个风险类别之一。然后,我们通过确定算法的阳性预测值(PPV)、敏感性和 F 分数,与胃肠病学家确定监测间隔的结果进行比较,评估该管道的性能。

结果

该管道使用来自 224 名患者的 346 份报告(224 份结肠镜检查和 122 份病理报告)进行开发,并在来自 100 名患者的 200 份独立测试报告(100 份结肠镜检查和 100 份病理报告)上进行了评估。我们在针对结肠镜检查的目标实体上实现了平均 PPV、敏感性和 F 分数分别为.92、.95 和.93。病理学提取的 PPV、敏感性和 F 分数分别为.95、.97 和.96。该系统在分配推荐的监测结肠镜检查间隔方面的总体准确率为 92%。

结论

这项研究表明,使用机器学习自动提取发现并将患者分类为适当的风险类别和相应的监测间隔是可行的。纳入该系统可以促进筛查结肠镜检查后的主动和及时随访,并实现预防计划和提供者的实时质量评估。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验