• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPT-4在结直肠腺瘤组织病理学图像检测与分类中的准确性。

Accuracy of GPT-4 in histopathological image detection and classification of colorectal adenomas.

作者信息

Laohawetwanit Thiyaphat, Namboonlue Chutimon, Apornvirat Sompon

机构信息

Division of Pathology, Chulabhorn International College of Medicine, Thammasat University, Pathum Thani, Thailand

Division of Pathology, Thammasat University Hospital, Pathum Thani, Thailand.

出版信息

J Clin Pathol. 2025 Feb 18;78(3):202-207. doi: 10.1136/jcp-2023-209304.

DOI:10.1136/jcp-2023-209304
PMID:38199797
Abstract

AIMS

To evaluate the accuracy of Chat Generative Pre-trained Transformer (ChatGPT) powered by GPT-4 in histopathological image detection and classification of colorectal adenomas using the diagnostic consensus provided by pathologists as a reference standard.

METHODS

A study was conducted with 100 colorectal polyp photomicrographs, comprising an equal number of adenomas and non-adenomas, classified by two pathologists. These images were analysed by classic GPT-4 for 1 time in October 2023 and custom GPT-4 for 20 times in December 2023. GPT-4's responses were compared against the reference standard through statistical measures to evaluate its proficiency in histopathological diagnosis, with the pathologists further assessing the model's descriptive accuracy.

RESULTS

GPT-4 demonstrated a median sensitivity of 74% and specificity of 36% for adenoma detection. The median accuracy of polyp classification varied, ranging from 16% for non-specific changes to 36% for tubular adenomas. Its diagnostic consistency, indicated by low kappa values ranging from 0.06 to 0.11, suggested only poor to slight agreement. All of the microscopic descriptions corresponded with their diagnoses. GPT-4 also commented about the limitations in its diagnoses (eg, slide diagnosis best done by pathologists, the inadequacy of single-image diagnostic conclusions, the need for clinical data and a higher magnification view).

CONCLUSIONS

GPT-4 showed high sensitivity but low specificity in detecting adenomas and varied accuracy for polyp classification. However, its diagnostic consistency was low. This artificial intelligence tool acknowledged its diagnostic limitations, emphasising the need for a pathologist's expertise and additional clinical context.

摘要

目的

以病理学家提供的诊断共识为参考标准,评估由GPT-4驱动的聊天生成预训练变换器(ChatGPT)在结直肠腺瘤组织病理学图像检测和分类中的准确性。

方法

对100张结直肠息肉显微照片进行研究,其中腺瘤和非腺瘤数量相等,由两名病理学家进行分类。这些图像于2023年10月由经典GPT-4分析1次,于2023年12月由定制GPT-4分析20次。通过统计方法将GPT-4的回答与参考标准进行比较,以评估其在组织病理学诊断方面的熟练程度,病理学家进一步评估该模型的描述准确性。

结果

GPT-4在腺瘤检测中的中位敏感性为74%,特异性为36%。息肉分类的中位准确率各不相同,从非特异性改变的16%到管状腺瘤的36%不等。其诊断一致性较低,kappa值在0.06至0.11之间,表明一致性仅为差到一般。所有微观描述均与其诊断结果相符。GPT-4还对其诊断中的局限性发表了评论(例如,玻片诊断最好由病理学家完成,单图像诊断结论的不足,需要临床数据和更高放大倍数的视野)。

结论

GPT-4在腺瘤检测中显示出高敏感性,但特异性较低,息肉分类的准确率也各不相同。然而,其诊断一致性较低。这种人工智能工具认识到其诊断局限性,强调需要病理学家的专业知识和更多临床背景信息。

相似文献

1
Accuracy of GPT-4 in histopathological image detection and classification of colorectal adenomas.GPT-4在结直肠腺瘤组织病理学图像检测与分类中的准确性。
J Clin Pathol. 2025 Feb 18;78(3):202-207. doi: 10.1136/jcp-2023-209304.
2
Evaluating ChatGPT's diagnostic potential for pathology images.评估ChatGPT对病理图像的诊断潜力。
Front Med (Lausanne). 2025 Jan 23;11:1507203. doi: 10.3389/fmed.2024.1507203. eCollection 2024.
3
Evaluation of an Artificial Intelligence-Augmented Digital System for Histologic Classification of Colorectal Polyps.人工智能增强型数字系统用于结直肠息肉组织学分类的评估。
JAMA Netw Open. 2021 Nov 1;4(11):e2135271. doi: 10.1001/jamanetworkopen.2021.35271.
4
Accurate Classification of Diminutive Colorectal Polyps Using Computer-Aided Analysis.使用计算机辅助分析对微小结直肠息肉进行准确分类。
Gastroenterology. 2018 Feb;154(3):568-575. doi: 10.1053/j.gastro.2017.10.010. Epub 2017 Oct 16.
5
Narrow band imaging optical diagnosis of small colorectal polyps in routine clinical practice: the Detect Inspect Characterise Resect and Discard 2 (DISCARD 2) study.常规临床实践中小肠结肠息肉的窄带成像光学诊断:检测、检查、表征、切除与丢弃2(DISCARD 2)研究
Gut. 2017 May;66(5):887-895. doi: 10.1136/gutjnl-2015-310584. Epub 2016 Apr 19.
6
The "valley sign" in small and diminutive adenomas: prevalence, interobserver agreement, and validation as an adenoma marker.小而微小腺瘤中的“山谷征”:患病率、观察者间一致性以及作为腺瘤标志物的验证。
Gastrointest Endosc. 2017 Mar;85(3):614-621. doi: 10.1016/j.gie.2016.10.011. Epub 2016 Oct 15.
7
Application of an Automated Deep Learning Program to A Diagnostic Classification Model: Differentiating High-Risk Adenomas Among Colorectal Polyps 10 mm or Smaller.一种自动化深度学习程序在诊断分类模型中的应用:鉴别直径10毫米及以下结直肠息肉中的高危腺瘤
J Dig Dis. 2025 Jan-Feb;26(1-2):80-87. doi: 10.1111/1751-2980.13340. Epub 2025 Apr 2.
8
Reliability in the classification of advanced colorectal adenomas.晚期结直肠腺瘤分类的可靠性
Cancer Epidemiol Biomarkers Prev. 2002 Jul;11(7):660-3.
9
Development and validation of the SIMPLE endoscopic classification of diminutive and small colorectal polyps.发展和验证微小和小结直肠息肉的 SIMPLE 内镜分类。
Endoscopy. 2018 Aug;50(8):779-789. doi: 10.1055/s-0044-100791. Epub 2018 Mar 23.
10
Establishment and validation of an artificial intelligence-based model for real-time detection and classification of colorectal adenoma.基于人工智能的结直肠腺瘤实时检测与分类模型的建立与验证。
Sci Rep. 2024 May 10;14(1):10750. doi: 10.1038/s41598-024-61342-6.

引用本文的文献

1
ChatGPT Is Not Yet Ready to Replace Motility Experts.ChatGPT尚未准备好取代动力专家。
Clin Gastroenterol Hepatol. 2025 Jul 22. doi: 10.1016/j.cgh.2025.04.033.
2
Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis.医学诊断中的大语言模型:基于文献计量分析的综述
J Med Internet Res. 2025 Jun 9;27:e72062. doi: 10.2196/72062.
3
Performance of Large Language Models (ChatGPT and Gemini Advanced) in Gastrointestinal Pathology and Clinical Review of Applications in Gastroenterology.大语言模型(ChatGPT和Gemini Advanced)在胃肠病理学及胃肠病学应用临床综述中的表现
Cureus. 2025 Apr 2;17(4):e81618. doi: 10.7759/cureus.81618. eCollection 2025 Apr.
4
Assessing the accuracy of the GPT-4 model in multidisciplinary tumor board decision prediction.评估GPT-4模型在多学科肿瘤病例讨论决策预测中的准确性。
Clin Transl Oncol. 2025 Mar 25. doi: 10.1007/s12094-025-03905-1.
5
Beyond the Surface: Assessing GPT-4's Accuracy in Detecting Melanoma and Suspicious Skin Lesions From Dermoscopic Images.透过表象:评估GPT-4从皮肤镜图像中检测黑色素瘤及可疑皮肤病变的准确性
Plast Surg (Oakv). 2025 Feb 18:22925503251315489. doi: 10.1177/22925503251315489.
6
Evaluating ChatGPT's diagnostic potential for pathology images.评估ChatGPT对病理图像的诊断潜力。
Front Med (Lausanne). 2025 Jan 23;11:1507203. doi: 10.3389/fmed.2024.1507203. eCollection 2024.
7
Applications of artificial intelligence in digital pathology for gastric cancer.人工智能在胃癌数字病理学中的应用。
Front Oncol. 2024 Oct 28;14:1437252. doi: 10.3389/fonc.2024.1437252. eCollection 2024.
8
Exploring the Potential of Code-Free Custom GPTs in Ophthalmology: An Early Analysis of GPT Store and User-Creator Guidance.探索免代码自定义生成式预训练变换器在眼科领域的潜力:对生成式预训练变换器商店及用户-创作者指南的早期分析
Ophthalmol Ther. 2024 Oct;13(10):2697-2713. doi: 10.1007/s40123-024-01014-w. Epub 2024 Aug 14.
9
Applications of Large Language Models in Pathology.大语言模型在病理学中的应用。
Bioengineering (Basel). 2024 Mar 31;11(4):342. doi: 10.3390/bioengineering11040342.