• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用GPT-4解读口腔黏膜疾病照片以生成结构化报告。

Utilizing GPT-4 to interpret oral mucosal disease photographs for structured report generation.

作者信息

Zhan Zheng-Zhe, Xiong Yu-Tao, Wang Chen-Yuan, Zhang Bao-Tian, Lian Wen-Jun, Zeng Yu-Min, Liu Wei, Tang Wei, Liu Chang

机构信息

State Key Laboratory of Oral Diseases, National Center for Stomatology, National Clinical Research Center for Oral Diseases, Department of Oral and Maxillofacial Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu, 610041, China.

出版信息

Sci Rep. 2025 Feb 12;15(1):5187. doi: 10.1038/s41598-025-89328-y.

DOI:10.1038/s41598-025-89328-y
PMID:39939809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11821891/
Abstract

The aim of this study is to evaluate GPT-4's reasoning ability to interpret oral mucosal disease photos and generate structured reports from free-text inputs, while exploring the role of prompt engineering in enhancing its performance. Prompt received by utilizing automatic prompt engineering and knowledge of oral physicians, was provided to GPT-4 for generating structured reports based on cases of oral mucosal disease. The structured reports included 7 fine-grained items: "location", "shape", "number", "size", "clinical manifestation", "the border of the lesion" and "diagnosis". 120 cases were used for testing, which were divided into two datasets, textbook dataset and internet dataset. Oral physicians evaluated GPT-4's responses by confusion matrices, receiving recall and accuracy. ANOVA and Wald χ2 tests with Bonferroni correction were used to statistical analysis. A total of 120 cases of oral mucosal diseases were included, encompassing the following two datasets: textbook dataset (n = 60), internet dataset (n = 60). GPT-4 had higher recall with the textbook dataset compared to the internet dataset (90.73% vs 89.12%; P = .462, χ = 0.54) and higher accuracy (87.05% vs 84.87%; P = .393, χ = 0.73). Performance varied by items of structured reports within each dataset, with "size" achieving the highest accuracy in the textbook dataset (98.90%) and "the border the lesion" in the internet dataset (95.00%). GPT-4 can transform incomplete descriptive text corresponding to oral mucosal disease photographs into structured reports with the assistance of carefully designed prompts. This study highlights GPT-4's potential in complex and multimodal medical tasks and underscores the importance of prompt engineering in optimizing its capabilities. Nevertheless, achieving further improvements in the model may require more comprehensive and focused efforts. This article demonstrated the capabilities of large multimodal models, represented by GPT-4, in medical photographs interpretation and medical report generation. GPT-4 was capable of recognizing photographs of oral mucosal diseases and generating structured reports, which facilitates telemedicine and peer-to-peer communication.

摘要

本研究的目的是评估GPT-4解读口腔黏膜疾病照片并从自由文本输入生成结构化报告的推理能力,同时探索提示工程在提高其性能方面的作用。利用自动提示工程和口腔医生的知识生成的提示被提供给GPT-4,以根据口腔黏膜疾病病例生成结构化报告。结构化报告包括7个细粒度项目:“位置”、“形状”、“数量”、“大小”、“临床表现”、“病变边界”和“诊断”。120个病例用于测试,分为两个数据集,教科书数据集和互联网数据集。口腔医生通过混淆矩阵、召回率和准确率评估GPT-4的回答。采用方差分析和经Bonferroni校正的Wald χ2检验进行统计分析。总共纳入了120例口腔黏膜疾病病例,包括以下两个数据集:教科书数据集(n = 60),互联网数据集(n = 60)。与互联网数据集相比,GPT-4在教科书数据集上具有更高的召回率(90.73%对89.12%;P = 0.462,χ = 0.54)和更高的准确率(87.05%对84.87%;P = 0.393,χ = 0.73)。每个数据集内结构化报告的项目性能各不相同,“大小”在教科书数据集中准确率最高(98.90%),“病变边界”在互联网数据集中准确率最高(95.00%)。GPT-4可以在精心设计的提示的帮助下,将与口腔黏膜疾病照片对应的不完整描述性文本转换为结构化报告。本研究突出了GPT-4在复杂多模态医学任务中的潜力,并强调了提示工程在优化其能力方面的重要性。然而,要在模型上实现进一步改进可能需要更全面和有针对性的努力。本文展示了以GPT-4为代表的大型多模态模型在医学照片解读和医学报告生成方面的能力。GPT-4能够识别口腔黏膜疾病照片并生成结构化报告,这有助于远程医疗和点对点交流。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/cf0e6f49ac8b/41598_2025_89328_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/842d21ecb7d5/41598_2025_89328_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/1933ca2f465e/41598_2025_89328_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/1b85d079e027/41598_2025_89328_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/f5d4f61fb872/41598_2025_89328_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/ea33a5bd6f77/41598_2025_89328_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/cf0e6f49ac8b/41598_2025_89328_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/842d21ecb7d5/41598_2025_89328_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/1933ca2f465e/41598_2025_89328_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/1b85d079e027/41598_2025_89328_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/f5d4f61fb872/41598_2025_89328_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/ea33a5bd6f77/41598_2025_89328_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a35/11821891/cf0e6f49ac8b/41598_2025_89328_Fig6_HTML.jpg

相似文献

1
Utilizing GPT-4 to interpret oral mucosal disease photographs for structured report generation.利用GPT-4解读口腔黏膜疾病照片以生成结构化报告。
Sci Rep. 2025 Feb 12;15(1):5187. doi: 10.1038/s41598-025-89328-y.
2
Multimodal Performance of GPT-4 in Complex Ophthalmology Cases.GPT-4在复杂眼科病例中的多模态表现。
J Pers Med. 2025 Apr 21;15(4):160. doi: 10.3390/jpm15040160.
3
Improving large language models for clinical named entity recognition via prompt engineering.通过提示工程改进临床命名实体识别的大型语言模型。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1812-1820. doi: 10.1093/jamia/ocad259.
4
Assessing readability of explanations and reliability of answers by GPT-3.5 and GPT-4 in non-traumatic spinal cord injury education.评估GPT-3.5和GPT-4在非创伤性脊髓损伤教育中所提供解释的可读性及答案的可靠性。
Med Teach. 2025 Jan 20:1-8. doi: 10.1080/0142159X.2024.2430365.
5
Evaluating ChatGPT-4's Accuracy in Identifying Final Diagnoses Within Differential Diagnoses Compared With Those of Physicians: Experimental Study for Diagnostic Cases.评估ChatGPT-4在鉴别诊断中识别最终诊断的准确性与医生的准确性比较:诊断病例的实验研究
JMIR Form Res. 2024 Jun 26;8:e59267. doi: 10.2196/59267.
6
Assessing GPT-4's Performance in Delivering Medical Advice: Comparative Analysis With Human Experts.评估 GPT-4 提供医疗建议的表现:与人类专家的比较分析。
JMIR Med Educ. 2024 Jul 8;10:e51282. doi: 10.2196/51282.
7
Comparative analysis of GPT-4-based ChatGPT's diagnostic performance with radiologists using real-world radiology reports of brain tumors.基于GPT-4的ChatGPT与放射科医生在使用脑肿瘤真实世界放射学报告方面的诊断性能比较分析。
Eur Radiol. 2025 Apr;35(4):1938-1947. doi: 10.1007/s00330-024-11032-8. Epub 2024 Aug 28.
8
Diagnostic accuracy of large language models in psychiatry.精神科大语言模型的诊断准确性。
Asian J Psychiatr. 2024 Oct;100:104168. doi: 10.1016/j.ajp.2024.104168. Epub 2024 Jul 25.
9
Can large language models replace humans in systematic reviews? Evaluating GPT-4's efficacy in screening and extracting data from peer-reviewed and grey literature in multiple languages.大型语言模型能否在系统评价中取代人类?评估 GPT-4 从多种语言的同行评议文献和灰色文献中进行筛选和提取数据的效果。
Res Synth Methods. 2024 Jul;15(4):616-626. doi: 10.1002/jrsm.1715. Epub 2024 Mar 14.
10
The Rapid Development of Artificial Intelligence: GPT-4's Performance on Orthopedic Surgery Board Questions.人工智能的快速发展:GPT-4 在骨科手术委员会问题上的表现。
Orthopedics. 2024 Mar-Apr;47(2):e85-e89. doi: 10.3928/01477447-20230922-05. Epub 2023 Sep 27.

引用本文的文献

1
Application of image enhancement in the auxiliary diagnosis of oral potentially malignant disorders.图像增强在口腔潜在恶性疾病辅助诊断中的应用。
Clin Oral Investig. 2025 Apr 25;29(5):270. doi: 10.1007/s00784-025-06357-7.

本文引用的文献

1
Performance of ChatGPT 3.5 and 4 on U.S. dental examinations: the INBDE, ADAT, and DAT.ChatGPT 3.5和4在美国牙科考试中的表现:国际牙科执照考试(INBDE)、高级牙科能力倾向测试(ADAT)和牙科入学考试(DAT)
Imaging Sci Dent. 2024 Sep;54(3):271-275. doi: 10.5624/isd.20240037. Epub 2024 Jul 2.
2
GPT-4 Vision: Multi-Modal Evolution of ChatGPT and Potential Role in Radiology.GPT-4视觉:ChatGPT的多模态演进及其在放射学中的潜在作用。
Cureus. 2024 Aug 31;16(8):e68298. doi: 10.7759/cureus.68298. eCollection 2024 Aug.
3
Hidden flaws behind expert-level accuracy of multimodal GPT-4 vision in medicine.
医学领域多模态GPT-4视觉专家级准确性背后的隐藏缺陷。
NPJ Digit Med. 2024 Jul 23;7(1):190. doi: 10.1038/s41746-024-01185-7.
4
Oral mucosal lesions triage via YOLOv7 models.通过YOLOv7模型进行口腔黏膜病变分类
J Formos Med Assoc. 2024 Jul 12. doi: 10.1016/j.jfma.2024.07.010.
5
Early evaluation of a natural language processing tool to improve access to educational resources for surgical patients.早期评估自然语言处理工具,以改善手术患者获取教育资源的途径。
Eur Spine J. 2024 Jul;33(7):2545-2552. doi: 10.1007/s00586-024-08315-5. Epub 2024 May 30.
6
Potential of GPT-4 for Detecting Errors in Radiology Reports: Implications for Reporting Accuracy.GPT-4 在检测放射科报告错误方面的潜力:对报告准确性的影响。
Radiology. 2024 Apr;311(1):e232714. doi: 10.1148/radiol.232714.
7
Oral mucosal disease recognition based on dynamic self-attention and feature discriminant loss.基于动态自注意力和特征判别损失的口腔黏膜疾病识别。
Oral Dis. 2024 Jul;30(5):3094-3107. doi: 10.1111/odi.14732. Epub 2023 Sep 20.
8
How much can we rely on artificial intelligence chatbots such as the ChatGPT software program to assist with scientific writing?我们能在多大程度上依靠诸如ChatGPT软件程序这样的人工智能聊天机器人来辅助科学写作?
J Prosthet Dent. 2025 Apr;133(4):1082-1088. doi: 10.1016/j.prosdent.2023.05.023. Epub 2023 Jul 10.
9
Leveraging GPT-4 for Post Hoc Transformation of Free-text Radiology Reports into Structured Reporting: A Multilingual Feasibility Study.利用GPT-4将自由文本放射学报告进行事后转换为结构化报告:一项多语言可行性研究。
Radiology. 2023 May;307(4):e230725. doi: 10.1148/radiol.230725. Epub 2023 Apr 4.
10
Multimodal biomedical AI.多模态生物医学人工智能。
Nat Med. 2022 Sep;28(9):1773-1784. doi: 10.1038/s41591-022-01981-2. Epub 2022 Sep 15.