增强而非取代临床专业知识：通过混合真实与合成训练源提高结肠镜检查报告中的命名实体识别

Enhancing and Not Replacing Clinical Expertise: Improving Named-Entity Recognition in Colonoscopy Reports Through Mixed Real-Synthetic Training Sources.

作者信息

Ioanovici Andrei-Constantin, Feier Andrei-Marian, Mărușteri Marius-Ștefan, Trâmbițaș-Miron Alina-Dia, Dobru Daniela-Ecaterina

机构信息

Department M2-Complementary Functional Sciences, Medical Informatics and Biostatistics, George Emil Palade University of Medicine, Pharmacy, Science, and Technology of Targu Mures, 540142 Targu Mures, Romania.

Department M4-Clinical Sciences, Orthopedics and Traumatology I, George Emil Palade University of Medicine, Pharmacy, Science, and Technology of Targu Mures, 540139 Targu Mures, Romania.

出版信息

J Pers Med. 2025 Jul 30;15(8):334. doi: 10.3390/jpm15080334.

DOI:10.3390/jpm15080334

PMID:40863396

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12387308/

Abstract

: In routine practice, colonoscopy findings are saved as unstructured free text, limiting secondary use. Accurate named-entity recognition (NER) is essential to unlock these descriptions for quality monitoring, personalized medicine and research. We compared named-entity recognition (NER) models trained on real, synthetic, and mixed data to determine whether privacy preserving synthetic reports can boost clinical information extraction. : Three Spark NLP biLSTM CRF models were trained on (i) 100 manually annotated Romanian colonoscopy reports (ModelR), (ii) 100 prompt-generated synthetic reports (ModelS), and (iii) a 1:1 mix (ModelM). Performance was tested on 40 unseen reports (20 real, 20 synthetic) for seven entities. Micro-averaged precision, recall, and F1-score values were computed; McNemar tests with Bonferroni correction assessed pairwise differences. : ModelM outperformed single-source models (precision 0.95, recall 0.93, F1 0.94) and was significantly superior to ModelR (F1 0.70) and ModelS (F1 0.64; < 0.001 for both). ModelR maintained high accuracy on real text (F1 = 0.90), but its accuracy fell when tested on synthetic data (0.47); the reverse was observed for ModelS (F1 = 0.99 synthetic, 0.33 real). McNemar χ statistics (64.6 for ModelM vs. ModelR; 147.0 for ModelM vs. ModelS) greatly exceeded the Bonferroni-adjusted significance threshold (α = 0.0167), confirming that the observed performance gains were unlikely to be due to chance. : Synthetic colonoscopy descriptions are a valuable complement, but not a substitute for real annotations, while AI is helping human experts, not replacing them. Training on a balanced mix of real and synthetic data can help to obtain robust, generalizable NER models able to structure free-text colonoscopy reports, supporting large-scale, privacy-preserving colorectal cancer surveillance and personalized follow-up.

摘要

在常规实践中，结肠镜检查结果以非结构化的自由文本形式保存，限制了二次使用。准确的命名实体识别（NER）对于解锁这些描述以进行质量监测、个性化医疗和研究至关重要。我们比较了在真实数据、合成数据和混合数据上训练的命名实体识别（NER）模型，以确定隐私保护合成报告是否可以促进临床信息提取。：三个Spark NLP双向长短期记忆条件随机场（biLSTM CRF）模型分别在（i）100份人工注释的罗马尼亚结肠镜检查报告（模型R）、（ii）100份提示生成的合成报告（模型S）以及（iii）1:1混合数据（模型M）上进行训练。针对七个实体在40份未见报告（20份真实报告、20份合成报告）上测试性能。计算微观平均精度、召回率和F1分数值；采用Bonferroni校正的McNemar检验评估成对差异。：模型M优于单源模型（精度0.95，召回率0.93，F1 0.94），并且显著优于模型R（F1 0.70）和模型S（F1 0.64；两者均P<0.001）。模型R在真实文本上保持了较高的准确率（F1 = 0.90），但在合成数据上测试时准确率下降（0.47）；模型S则相反（合成数据F1 = 0.99，真实数据F1 = 0.33）。McNemar卡方统计量（模型M与模型R比较为64.6；模型M与模型S比较为147.0）大大超过了Bonferroni调整后的显著性阈值（α = 0.0167），证实观察到的性能提升不太可能是偶然因素导致的。：合成结肠镜检查描述是一种有价值的补充，但不能替代真实注释，同时人工智能是在帮助人类专家，而不是取代他们。在真实数据和合成数据的平衡混合上进行训练有助于获得强大的、可推广的NER模型，能够构建自由文本结肠镜检查报告，支持大规模的、隐私保护的结直肠癌监测和个性化随访。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c34/12387308/d9048e66347d/jpm-15-00334-g001.jpg

相似文献

Enhancing and Not Replacing Clinical Expertise: Improving Named-Entity Recognition in Colonoscopy Reports Through Mixed Real-Synthetic Training Sources.增强而非取代临床专业知识：通过混合真实与合成训练源提高结肠镜检查报告中的命名实体识别

J Pers Med. 2025 Jul 30;15(8):334. doi: 10.3390/jpm15080334.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

From BERT to generative AI - Comparing encoder-only vs. large language models in a cohort of lung cancer patients for named entity recognition in unstructured medical reports.从BERT到生成式人工智能——在一组肺癌患者中比较仅编码器模型与大语言模型用于非结构化医疗报告中的命名实体识别

Comput Biol Med. 2025 Sep;195:110665. doi: 10.1016/j.compbiomed.2025.110665. Epub 2025 Jun 24.

Dynamic taxonomy generation for future skills identification using a named entity recognition and relation extraction pipeline.使用命名实体识别和关系提取管道生成动态分类法以识别未来技能。

Front Artif Intell. 2025 Jul 2;8:1579998. doi: 10.3389/frai.2025.1579998. eCollection 2025.

Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。

Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.

Systemic treatments for metastatic cutaneous melanoma.转移性皮肤黑色素瘤的全身治疗

Cochrane Database Syst Rev. 2018 Feb 6;2(2):CD011123. doi: 10.1002/14651858.CD011123.pub2.

Comparison of cellulose, modified cellulose and synthetic membranes in the haemodialysis of patients with end-stage renal disease.纤维素、改性纤维素和合成膜在终末期肾病患者血液透析中的比较。

Cochrane Database Syst Rev. 2001(3):CD003234. doi: 10.1002/14651858.CD003234.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗：一项系统综述

Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.

Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗：一项网状荟萃分析。

Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.

本文引用的文献

Cross-institutional dental electronic health record entity extraction via generative artificial intelligence and synthetic notes.通过生成式人工智能和合成笔记进行跨机构牙科电子健康记录实体提取

JAMIA Open. 2025 Jun 28;8(3):ooaf061. doi: 10.1093/jamiaopen/ooaf061. eCollection 2025 Jun.

Performance Analysis of Data Augmentation Approaches for Improving Wrist-Based Fall Detection System.用于改进基于手腕的跌倒检测系统的数据增强方法的性能分析

Sensors (Basel). 2025 Mar 29;25(7):2168. doi: 10.3390/s25072168.

Using Synthetic Health Care Data to Leverage Large Language Models for Named Entity Recognition: Development and Validation Study.利用合成医疗保健数据借助大语言模型进行命名实体识别：开发与验证研究。

J Med Internet Res. 2025 Mar 18;27:e66279. doi: 10.2196/66279.

Leveraging natural language processing to aggregate field safety notices of medical devices across the EU.利用自然语言处理技术汇总欧盟范围内的医疗器械现场安全通知。

NPJ Digit Med. 2024 Dec 4;7(1):352. doi: 10.1038/s41746-024-01337-9.

Is Colonoscopy Alone Adequate for Surveillance in Stage I Colorectal Cancer?单纯结肠镜检查对I期结直肠癌进行监测是否足够？

Cancer Res Treat. 2025 Apr;57(2):507-518. doi: 10.4143/crt.2024.526. Epub 2024 Oct 4.

INSAFEDARE Project: Innovative Applications of Assessment and Assurance of Data and Synthetic Data for Regulatory Decision Support.INSAFEDARE项目：用于监管决策支持的数据及合成数据评估与保障的创新应用

Stud Health Technol Inform. 2024 Aug 22;316:1193-1197. doi: 10.3233/SHTI240624.

Big data approach in the field of gastric and colorectal cancer research.大数据方法在胃癌和结直肠癌研究领域的应用。

J Gastroenterol Hepatol. 2024 Jun;39(6):1027-1032. doi: 10.1111/jgh.16527. Epub 2024 Feb 27.

Cancer statistics, 2024.2024年癌症统计数据。

CA Cancer J Clin. 2024 Jan-Feb;74(1):12-49. doi: 10.3322/caac.21820. Epub 2024 Jan 17.

Colorectal polyp classification and management of complex polyps for surgeon endoscopists.结直肠息肉分类和外科内镜医师处理复杂息肉。

Can J Surg. 2023 Sep 21;66(5):E491-E498. doi: 10.1503/cjs.011422. Print 2023 Sep-Oct.

Overview of the 2022 n2c2 shared task on contextualized medication event extraction in clinical notes.2022n2c2 临床笔记中语境化用药事件提取共享任务概述。

J Biomed Inform. 2023 Aug;144:104432. doi: 10.1016/j.jbi.2023.104432. Epub 2023 Jun 24.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

增强而非取代临床专业知识：通过混合真实与合成训练源提高结肠镜检查报告中的命名实体识别

Enhancing and Not Replacing Clinical Expertise: Improving Named-Entity Recognition in Colonoscopy Reports Through Mixed Real-Synthetic Training Sources.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献