Tassoker Melek
Department of Dentomaxillofacial Radiology, Faculty of Dentistry, Necmettin Erbakan University, Baglarbasi sk, Meram, Konya, 42050, Türkiye.
BMC Oral Health. 2025 Jul 17;25(1):1187. doi: 10.1186/s12903-025-06444-x.
This study aimed to evaluate the diagnostic performance of ChatGPT-4o, a large language model developed by OpenAI, in challenging cases of oral and maxillofacial diseases presented in the section of the journal , , , .
A total of 123 diagnostically challenging oral and maxillofacial cases published in the aforementioned journal were retrospectively collected. The case presentations, which included detailed clinical, radiographic, and sometimes histopathologic descriptions, were input into ChatGPT-4o. The model was prompted to provide a single most likely diagnosis for each case. These outputs were then compared to the final diagnoses established by expert consensus in each original case report. The accuracy of ChatGPT-4o was calculated based on exact diagnostic matches.
ChatGPT-4o correctly diagnosed 96 out of 123 cases, achieving an overall diagnostic accuracy of 78%. Nevertheless, even in cases where the exact diagnosis was not provided, the model often suggested one of the clinically reasonable differential diagnoses.
ChatGPT-4o demonstrates a promising ability to assist in the diagnostic process of complex maxillofacial conditions, with a relatively high accuracy rate in challenging cases. While it is not a replacement for expert clinical judgment, large language models may offer valuable decision support in oral and maxillofacial radiology, particularly in educational or consultative contexts.
Not applicable.
本研究旨在评估由OpenAI开发的大型语言模型ChatGPT-4o在《[期刊名称]》口腔颌面疾病疑难病例中的诊断性能。
回顾性收集上述期刊发表的123例诊断具有挑战性的口腔颌面病例。将包含详细临床、影像学以及有时还有组织病理学描述的病例呈现内容输入ChatGPT-4o。促使该模型为每个病例提供一个最可能的诊断。然后将这些输出结果与每个原始病例报告中通过专家共识确定的最终诊断进行比较。基于精确的诊断匹配计算ChatGPT-4o的准确性。
ChatGPT-4o在123例病例中正确诊断出96例,总体诊断准确率为78%。然而,即使在未给出确切诊断的情况下,该模型通常也会提出临床上合理的鉴别诊断之一。
ChatGPT-4o在协助复杂颌面疾病的诊断过程中显示出有前景的能力,在疑难病例中准确率相对较高。虽然它不能替代专家临床判断,但大型语言模型可能在口腔颌面放射学中提供有价值的决策支持,特别是在教育或咨询背景下。
不适用。