探索ChatGPT在诊断口腔颌面病理学方面的潜力：一项对123例具有挑战性病例的研究。

Exploring ChatGPT's potential in diagnosing oral and maxillofacial pathologies: a study of 123 challenging cases.

作者信息

Tassoker Melek

机构信息

Department of Dentomaxillofacial Radiology, Faculty of Dentistry, Necmettin Erbakan University, Baglarbasi sk, Meram, Konya, 42050, Türkiye.

出版信息

BMC Oral Health. 2025 Jul 17;25(1):1187. doi: 10.1186/s12903-025-06444-x.

DOI:10.1186/s12903-025-06444-x

PMID:40676533

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12272973/

Abstract

OBJECTIVE

This study aimed to evaluate the diagnostic performance of ChatGPT-4o, a large language model developed by OpenAI, in challenging cases of oral and maxillofacial diseases presented in the section of the journal , , , .

MATERIALS AND METHODS

A total of 123 diagnostically challenging oral and maxillofacial cases published in the aforementioned journal were retrospectively collected. The case presentations, which included detailed clinical, radiographic, and sometimes histopathologic descriptions, were input into ChatGPT-4o. The model was prompted to provide a single most likely diagnosis for each case. These outputs were then compared to the final diagnoses established by expert consensus in each original case report. The accuracy of ChatGPT-4o was calculated based on exact diagnostic matches.

RESULTS

ChatGPT-4o correctly diagnosed 96 out of 123 cases, achieving an overall diagnostic accuracy of 78%. Nevertheless, even in cases where the exact diagnosis was not provided, the model often suggested one of the clinically reasonable differential diagnoses.

CONCLUSIONS

ChatGPT-4o demonstrates a promising ability to assist in the diagnostic process of complex maxillofacial conditions, with a relatively high accuracy rate in challenging cases. While it is not a replacement for expert clinical judgment, large language models may offer valuable decision support in oral and maxillofacial radiology, particularly in educational or consultative contexts.

CLINICAL TRIAL NUMBER

Not applicable.

摘要

目的

本研究旨在评估由OpenAI开发的大型语言模型ChatGPT-4o在《[期刊名称]》口腔颌面疾病疑难病例中的诊断性能。

材料与方法

回顾性收集上述期刊发表的123例诊断具有挑战性的口腔颌面病例。将包含详细临床、影像学以及有时还有组织病理学描述的病例呈现内容输入ChatGPT-4o。促使该模型为每个病例提供一个最可能的诊断。然后将这些输出结果与每个原始病例报告中通过专家共识确定的最终诊断进行比较。基于精确的诊断匹配计算ChatGPT-4o的准确性。