• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种使用自然语言处理提取结肠镜检查和病理学数据的透明且可适应的方法。

A Transparent and Adaptable Method to Extract Colonoscopy and Pathology Data Using Natural Language Processing.

机构信息

Division of Research, Kaiser Permanente Northern California, Oakland, CA, USA.

Department of Gastroenterology, Kaiser Permanente Northern California, Santa Clara, CA, USA.

出版信息

J Med Syst. 2020 Jul 31;44(9):151. doi: 10.1007/s10916-020-01604-8.

DOI:10.1007/s10916-020-01604-8
PMID:32737597
Abstract

Key variables recorded as text in colonoscopy and pathology reports have been extracted using natural language processing (NLP) tools that were not easily adaptable to new settings. We aimed to develop a reliable NLP tool with broad adaptability. During 1996-2016, Kaiser Permanente Northern California performed 401,566 colonoscopies with linked pathology. We randomly sampled 1000 linked reports into a Training Set and developed an NLP tool using SAS® PERL regular expressions. The NLP tool captured five colonoscopy and pathology variables: type, size, and location of polyps; extent of procedure; and quality of bowel preparation. We used a Validation Set (N = 3000) to confirm the variables' classifications using manual chart review as the reference. Performance of the NLP tool was assessed using the sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and Cohen's κ. Cohen's κ ranged from 93 to 99%. The sensitivity and specificity ranged from 95 to 100% across all categories. For categories with prevalence exceeding 10%, the PPV ranged from 97% to 100% except for adequate quality of preparation (prevalence 92%), for which the PPV was 65%. For categories with prevalence below 10%, the PPVs ranged from 62% to 100%. NPVs ranged from 94% to 100% except for the "complete" extent of procedure, for which the NPV was 73%. Using information from a large community-based population, we developed a transparent and adaptable NLP tool for extracting five colonoscopy and pathology variables. The tool can be readily tested in other healthcare settings.

摘要

结肠镜检查和病理报告中以文本形式记录的关键变量已使用自然语言处理 (NLP) 工具提取,这些工具不易适应新环境。我们旨在开发一种具有广泛适应性的可靠 NLP 工具。1996 年至 2016 年间,凯撒永久医疗集团北加州分部进行了 401566 例结肠镜检查,并与病理结果相关联。我们随机抽取 1000 份相关报告作为训练集,并使用 SAS®PERL 正则表达式开发了一种 NLP 工具。该 NLP 工具捕获了五个结肠镜检查和病理变量:息肉的类型、大小和位置;手术范围;以及肠道准备的质量。我们使用 3000 份验证集(N=3000)通过手动图表审查作为参考,确认变量的分类。使用灵敏度、特异性、阳性预测值 (PPV)、阴性预测值 (NPV) 和 Cohen's κ 评估 NLP 工具的性能。Cohen's κ 值范围为 93 至 99%。所有类别中,灵敏度和特异性范围为 95%至 100%。对于流行率超过 10%的类别,PPV 范围为 97%至 100%,除了肠道准备充分(流行率 92%)的情况,PPV 为 65%。对于流行率低于 10%的类别,PPV 范围为 62%至 100%。NPV 范围为 94%至 100%,除了“完全”手术范围,NPV 为 73%。使用来自大型基于社区的人群的信息,我们开发了一种透明且适应性强的 NLP 工具,用于提取五个结肠镜检查和病理变量。该工具可以在其他医疗保健环境中轻松进行测试。

相似文献

1
A Transparent and Adaptable Method to Extract Colonoscopy and Pathology Data Using Natural Language Processing.一种使用自然语言处理提取结肠镜检查和病理学数据的透明且可适应的方法。
J Med Syst. 2020 Jul 31;44(9):151. doi: 10.1007/s10916-020-01604-8.
2
Accurate Identification of Colonoscopy Quality and Polyp Findings Using Natural Language Processing.使用自然语言处理技术准确识别结肠镜检查质量和息肉发现。
J Clin Gastroenterol. 2019 Jan;53(1):e25-e30. doi: 10.1097/MCG.0000000000000929.
3
Developing a natural language processing application for measuring the quality of colonoscopy procedures.开发用于测量结肠镜检查程序质量的自然语言处理应用程序。
J Am Med Inform Assoc. 2011 Dec;18 Suppl 1(Suppl 1):i150-6. doi: 10.1136/amiajnl-2011-000431. Epub 2011 Sep 21.
4
Natural Language Processing for Assessing Quality Indicators in Free-Text Colonoscopy and Pathology Reports: Development and Usability Study.用于评估自由文本结肠镜检查和病理报告质量指标的自然语言处理:开发与可用性研究
JMIR Med Inform. 2022 Apr 15;10(4):e35257. doi: 10.2196/35257.
5
Application of optical character recognition with natural language processing for large-scale quality metric data extraction in colonoscopy reports.光学字符识别与自然语言处理在结肠镜报告中大规模质量度量数据提取的应用。
Gastrointest Endosc. 2021 Mar;93(3):750-757. doi: 10.1016/j.gie.2020.08.038. Epub 2020 Sep 3.
6
Natural Language Processing Accurately Calculates Adenoma and Sessile Serrated Polyp Detection Rates.自然语言处理准确计算腺瘤和无蒂锯齿状息肉的检出率。
Dig Dis Sci. 2018 Jul;63(7):1794-1800. doi: 10.1007/s10620-018-5078-4. Epub 2018 Apr 26.
7
The use of natural language processing to identify vaccine-related anaphylaxis at five health care systems in the Vaccine Safety Datalink.利用自然语言处理技术在疫苗安全数据链中的五个医疗系统中识别与疫苗相关的过敏反应。
Pharmacoepidemiol Drug Saf. 2020 Feb;29(2):182-188. doi: 10.1002/pds.4919. Epub 2019 Dec 3.
8
Multi-center colonoscopy quality measurement utilizing natural language processing.利用自然语言处理进行多中心结肠镜检查质量评估
Am J Gastroenterol. 2015 Apr;110(4):543-52. doi: 10.1038/ajg.2015.51. Epub 2015 Mar 10.
9
Natural language processing accurately categorizes findings from colonoscopy and pathology reports.自然语言处理能准确地对结肠镜检查和病理报告的结果进行分类。
Clin Gastroenterol Hepatol. 2013 Jun;11(6):689-94. doi: 10.1016/j.cgh.2012.11.035. Epub 2013 Jan 11.
10
Extracting data from electronic medical records: validation of a natural language processing program to assess prostate biopsy results.从电子病历中提取数据:评估前列腺活检结果的自然语言处理程序的验证
World J Urol. 2014 Feb;32(1):99-103. doi: 10.1007/s00345-013-1040-4. Epub 2013 Feb 17.

引用本文的文献

1
A Novel Ensemble Framework for Comprehensive Early-Stage Colorectal Cancer Diagnosis, Prognosis, and Treatment: Integration of Gastroenterology-Specific Transformer Language Models and Multiple Decision Trees.一种用于早期结直肠癌综合诊断、预后和治疗的新型集成框架:胃肠病学特定变压器语言模型与多个决策树的整合
J Clin Med. 2025 Jun 23;14(13):4467. doi: 10.3390/jcm14134467.
2
Large language models for extracting histopathologic diagnoses of colorectal cancer and dysplasia from electronic health records.用于从电子健康记录中提取结直肠癌和发育异常组织病理学诊断的大语言模型
medRxiv. 2025 Apr 22:2024.11.27.24318083. doi: 10.1101/2024.11.27.24318083.
3
Emerging applications of NLP and large language models in gastroenterology and hepatology: a systematic review.
自然语言处理和大语言模型在胃肠病学和肝病学中的新兴应用:一项系统综述
Front Med (Lausanne). 2025 Jan 22;11:1512824. doi: 10.3389/fmed.2024.1512824. eCollection 2024.
4
A foundation systematic review of natural language processing applied to gastroenterology & hepatology.一项关于应用于胃肠病学和肝病学的自然语言处理的基础系统评价。
BMC Gastroenterol. 2025 Feb 6;25(1):58. doi: 10.1186/s12876-025-03608-5.
5
Applications of natural language processing tools in the surgical journey.自然语言处理工具在手术过程中的应用。
Front Surg. 2024 May 17;11:1403540. doi: 10.3389/fsurg.2024.1403540. eCollection 2024.
6
Identification of pancreatic cancer risk factors from clinical notes using natural language processing.利用自然语言处理从临床记录中识别胰腺癌风险因素。
Pancreatology. 2024 Jun;24(4):572-578. doi: 10.1016/j.pan.2024.03.016. Epub 2024 Mar 26.
7
Natural Language Processing Can Automate Extraction of Barrett's Esophagus Endoscopy Quality Metrics.自然语言处理可自动提取巴雷特食管内镜检查质量指标。
medRxiv. 2023 Jul 13:2023.07.11.23292529. doi: 10.1101/2023.07.11.23292529.
8
Leveraging Natural Language Processing to Extract Features of Colorectal Polyps From Pathology Reports for Epidemiologic Study.利用自然语言处理技术从病理学报告中提取结直肠息肉特征用于流行病学研究。
JCO Clin Cancer Inform. 2023 Jan;7:e2200131. doi: 10.1200/CCI.22.00131.
9
Deep learning approach to detection of colonoscopic information from unstructured reports.深度学习方法从非结构化报告中检测结肠镜信息。
BMC Med Inform Decis Mak. 2023 Feb 7;23(1):28. doi: 10.1186/s12911-023-02121-7.
10
Automated extraction of information of lung cancer staging from unstructured reports of PET-CT interpretation: natural language processing with deep-learning.从 PET-CT 解读的非结构化报告中自动提取肺癌分期信息:基于深度学习的自然语言处理。
BMC Med Inform Decis Mak. 2022 Sep 1;22(1):229. doi: 10.1186/s12911-022-01975-7.