• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用生成式人工智能从自由文本病理报告中提取结构化信息。

Using Generative AI to Extract Structured Information from Free Text Pathology Reports.

作者信息

Shahid Fahad, Hsu Min-Huei, Chang Yung-Chun, Jian Wen-Shan

机构信息

Graduate Institute of Data Science, College of Management, Taipei Medical University, Taipei, Taiwan.

Department of Neurosurgery, Shuang-Ho Hospital-Taipei Medical University, Taipei, Taiwan.

出版信息

J Med Syst. 2025 Mar 13;49(1):36. doi: 10.1007/s10916-025-02167-2.

DOI:10.1007/s10916-025-02167-2
PMID:40080229
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11906504/
Abstract

Manually converting unstructured text pathology reports into structured pathology reports is very time-consuming and prone to errors. This study demonstrates the transformative potential of generative AI in automating the analysis of free-text pathology reports. Employing the ChatGPT Large Language Model within a Streamlit web application, we automated the extraction and structuring of information from 33 unstructured breast cancer pathology reports from Taipei Medical University Hospital. Achieving a 99.61% accuracy rate, the AI system notably reduced the processing time compared to traditional methods. This not only underscores the efficacy of AI in converting unstructured medical text into structured data but also highlights its potential to enhance the efficiency and reliability of medical text analysis. However, this study is limited to breast cancer pathology reports and was conducted using data obtained from hospitals associated with a single institution. In the future, we plan to expand the scope of this research to include pathology reports for other cancer types incrementally and conduct external validation to further substantiate the robustness and generalizability of the proposed system. Through this technological integration, we aimed to substantiate the capabilities of generative AI in improving both the speed and reliability of data processing. The outcomes of this study affirm that generative AI can significantly transform the handling of pathology reports, promising substantial advancements in biomedical research by facilitating the structured analysis of complex medical data.

摘要

将非结构化文本病理报告手动转换为结构化病理报告非常耗时且容易出错。本研究展示了生成式人工智能在自动化分析自由文本病理报告方面的变革潜力。我们在一个Streamlit网络应用程序中使用ChatGPT大语言模型,实现了从台北医学大学医院的33份非结构化乳腺癌病理报告中自动提取和构建信息。该人工智能系统的准确率达到99.61%,与传统方法相比,显著缩短了处理时间。这不仅强调了人工智能在将非结构化医学文本转换为结构化数据方面的有效性,还突出了其提高医学文本分析效率和可靠性的潜力。然而,本研究仅限于乳腺癌病理报告,且使用的是从与单一机构相关的医院获取的数据。未来,我们计划逐步扩大本研究的范围,纳入其他癌症类型的病理报告,并进行外部验证,以进一步证实所提出系统的稳健性和通用性。通过这种技术整合,我们旨在证实生成式人工智能在提高数据处理速度和可靠性方面的能力。本研究结果证实,生成式人工智能能够显著改变病理报告的处理方式,有望通过促进对复杂医学数据的结构化分析,在生物医学研究中取得重大进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/845dc9bf95a7/10916_2025_2167_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/06a0cc52150b/10916_2025_2167_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/e8fd1b946f3c/10916_2025_2167_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/bc4e3bbe6cb0/10916_2025_2167_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/84b480ba6198/10916_2025_2167_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/aa486d54c58e/10916_2025_2167_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/1f0fc6da2c27/10916_2025_2167_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/a1fdc2c938a8/10916_2025_2167_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/18a7db62aeff/10916_2025_2167_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/63342b0de356/10916_2025_2167_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/dddd61972d6d/10916_2025_2167_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/b75e891e2ec4/10916_2025_2167_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/5deaedc6ea97/10916_2025_2167_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/585e7d91c6ec/10916_2025_2167_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/227a842afc0a/10916_2025_2167_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/845dc9bf95a7/10916_2025_2167_Fig15_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/06a0cc52150b/10916_2025_2167_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/e8fd1b946f3c/10916_2025_2167_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/bc4e3bbe6cb0/10916_2025_2167_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/84b480ba6198/10916_2025_2167_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/aa486d54c58e/10916_2025_2167_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/1f0fc6da2c27/10916_2025_2167_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/a1fdc2c938a8/10916_2025_2167_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/18a7db62aeff/10916_2025_2167_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/63342b0de356/10916_2025_2167_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/dddd61972d6d/10916_2025_2167_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/b75e891e2ec4/10916_2025_2167_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/5deaedc6ea97/10916_2025_2167_Fig12_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/585e7d91c6ec/10916_2025_2167_Fig13_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/227a842afc0a/10916_2025_2167_Fig14_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f7f2/11906504/845dc9bf95a7/10916_2025_2167_Fig15_HTML.jpg

相似文献

1
Using Generative AI to Extract Structured Information from Free Text Pathology Reports.使用生成式人工智能从自由文本病理报告中提取结构化信息。
J Med Syst. 2025 Mar 13;49(1):36. doi: 10.1007/s10916-025-02167-2.
2
Enhancing Bidirectional Encoder Representations From Transformers (BERT) With Frame Semantics to Extract Clinically Relevant Information From German Mammography Reports: Algorithm Development and Validation.利用框架语义增强来自变换器的双向编码器表征(BERT)以从德国乳腺钼靶报告中提取临床相关信息:算法开发与验证
J Med Internet Res. 2025 Apr 25;27:e68427. doi: 10.2196/68427.
3
Automated Pathologic TN Classification Prediction and Rationale Generation From Lung Cancer Surgical Pathology Reports Using a Large Language Model Fine-Tuned With Chain-of-Thought: Algorithm Development and Validation Study.使用思维链微调的大语言模型从肺癌手术病理报告中进行自动病理TN分类预测及依据生成:算法开发与验证研究
JMIR Med Inform. 2024 Dec 20;12:e67056. doi: 10.2196/67056.
4
Gemini AI vs. ChatGPT: A comprehensive examination alongside ophthalmology residents in medical knowledge.Gemini人工智能与ChatGPT对比:与眼科住院医师一起对医学知识进行的全面考察
Graefes Arch Clin Exp Ophthalmol. 2025 Feb;263(2):527-536. doi: 10.1007/s00417-024-06625-4. Epub 2024 Sep 15.
5
Generative Artificial Intelligence in Anatomic Pathology.解剖病理学中的生成式人工智能
Arch Pathol Lab Med. 2025 Apr 1;149(4):298-318. doi: 10.5858/arpa.2024-0215-RA.
6
Reshaping free-text radiology notes into structured reports with generative question answering transformers.利用生成式问答变换模型将自由文本放射学报告改造成结构化报告。
Artif Intell Med. 2024 Aug;154:102924. doi: 10.1016/j.artmed.2024.102924. Epub 2024 Jun 26.
7
Evaluating ChatGPT-4's Diagnostic Accuracy: Impact of Visual Data Integration.评估ChatGPT-4的诊断准确性:视觉数据整合的影响。
JMIR Med Inform. 2024 Apr 9;12:e55627. doi: 10.2196/55627.
8
The revolution of generative artificial intelligence in psychology: The interweaving of behavior, consciousness, and ethics.生成式人工智能在心理学领域的变革:行为、意识与伦理的交织。
Acta Psychol (Amst). 2024 Nov;251:104593. doi: 10.1016/j.actpsy.2024.104593. Epub 2024 Nov 9.
9
Using ChatGPT-4 to Create Structured Medical Notes From Audio Recordings of Physician-Patient Encounters: Comparative Study.利用 ChatGPT-4 从医患对话的音频记录中创建结构化的医疗记录:比较研究。
J Med Internet Res. 2024 Apr 22;26:e54419. doi: 10.2196/54419.
10
Assessing Laterality Errors in Radiology: Comparing Generative Artificial Intelligence and Natural Language Processing.评估放射学中的侧性错误:生成式人工智能与自然语言处理的比较。
J Am Coll Radiol. 2024 Oct;21(10):1575-1582. doi: 10.1016/j.jacr.2024.06.014. Epub 2024 Jul 1.

引用本文的文献

1
Digital and Artificial Intelligence-based Pathology: Not for Every Laboratory - A Mini-review on the Benefits and Pitfalls of Its Implementation.基于数字和人工智能的病理学:并非适用于每个实验室——关于其实施的益处与陷阱的小型综述
J Clin Transl Pathol. 2025 Jun;5(2):79-85. doi: 10.14218/jctp.2025.00007. Epub 2025 Apr 3.

本文引用的文献

1
Structured information extraction from scientific text with large language models.利用大语言模型从科学文本中提取结构化信息。
Nat Commun. 2024 Feb 15;15(1):1418. doi: 10.1038/s41467-024-45563-x.
2
Extracting structured information from unstructured histopathology reports using generative pre-trained transformer 4 (GPT-4).使用生成式预训练转换器 4(GPT-4)从非结构化组织病理学报告中提取结构化信息。
J Pathol. 2024 Mar;262(3):310-319. doi: 10.1002/path.6232. Epub 2023 Dec 14.
3
Developing prompts from large language model for extracting clinical information from pathology and ultrasound reports in breast cancer.
利用大语言模型开发提示,以从乳腺癌的病理学和超声报告中提取临床信息。
Radiat Oncol J. 2023 Sep;41(3):209-216. doi: 10.3857/roj.2023.00633. Epub 2023 Sep 21.
4
The Role of Large Language Models in Medical Education: Applications and Implications.大语言模型在医学教育中的作用:应用与启示
JMIR Med Educ. 2023 Aug 14;9:e50945. doi: 10.2196/50945.
5
Synchronous Bilateral Breast Cancer: A Case Report Piloting and Evaluating the Implementation of the AI-Powered Large Language Model (LLM) ChatGPT.同步双侧乳腺癌:一例报告——试用和评估人工智能驱动的大语言模型(LLM)ChatGPT的应用情况
Cureus. 2023 Apr 14;15(4):e37587. doi: 10.7759/cureus.37587. eCollection 2023 Apr.
6
An Opinion on ChatGPT in Health Care-Written by Humans Only.关于医疗保健领域中ChatGPT的看法——仅由人类撰写。
J Nucl Med. 2023 May;64(5):701-703. doi: 10.2967/jnumed.123.265687. Epub 2023 Apr 13.
7
Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study.利用GPT-4将自由文本放射学报告进行事后转换为结构化报告:一项多语言可行性研究。
Radiology. 2023 May;307(4):e230725. doi: 10.1148/radiol.230725. Epub 2023 Apr 4.
8
An accessible, efficient, and accurate natural language processing method for extracting diagnostic data from pathology reports.一种用于从病理报告中提取诊断数据的便捷、高效且准确的自然语言处理方法。
J Pathol Inform. 2022 Nov 8;13:100154. doi: 10.1016/j.jpi.2022.100154. eCollection 2022.
9
Natural Language Processing in Pathology: Current Trends and Future Insights.病理学中的自然语言处理:当前趋势与未来展望
Am J Pathol. 2022 Nov;192(11):1486-1495. doi: 10.1016/j.ajpath.2022.07.012. Epub 2022 Aug 17.
10
Automated Generation of Synoptic Reports from Narrative Pathology Reports in University Malaya Medical Centre Using Natural Language Processing.利用自然语言处理技术从马来亚大学医学中心的叙述性病理报告中自动生成概要报告
Diagnostics (Basel). 2022 Apr 1;12(4):879. doi: 10.3390/diagnostics12040879.