Fanelli Francesco, Saleh Muhammad, Santamaria Pasquale, Zhurakivska Khrystyna, Nibali Luigi, Troiano Giuseppe
Department of Clinical and Experimental Medicine, University of Foggia, Foggia, Italy.
Department of Periodontics and Oral Medicine, University of Michigan School of Dentistry, Ann Arbor, Michigan, USA.
J Clin Periodontol. 2025 May;52(5):707-716. doi: 10.1111/jcpe.14101. Epub 2024 Dec 26.
Artificial intelligence (AI) has the potential to enhance healthcare practices, including periodontology, by improving diagnostics, treatment planning and patient care. This study introduces 'PerioGPT', a specialized AI model designed to provide up-to-date periodontal knowledge using GPT-4o and a novel retrieval-augmented generation (RAG) system.
PerioGPT was evaluated in two phases. First, its performance was compared against those of five other chatbots using 50 periodontal questions from specialists, followed by a validation with 71 questions from the 2023-2024 'In-Service Examination' of the American Academy of Periodontology (AAP). The second phase focused on assessing PerioGPT's generative capacity, specifically its ability to create complex and accurate periodontal questions.
PerioGPT outperformed other chatbots, achieving a higher accuracy rate (81.16%) and generating more complex and precise questions with a mean complexity score of 3.81 ± 0.965 and an accuracy score of 4.35 ± 0.898. These results demonstrate PerioGPT's potential as a leading tool for creating reliable clinical queries in periodontology.
This study underscores the transformative potential of AI in periodontology, illustrating that specialized models can offer significant advantages over general language models for both educational and clinical applications. The findings highlight that tailoring AI technologies to specific medical fields may improve performance and relevance.
人工智能(AI)有潜力通过改善诊断、治疗计划和患者护理来提升包括牙周病学在内的医疗实践。本研究介绍了“PerioGPT”,这是一种专门的人工智能模型,旨在使用GPT - 4o和一种新颖的检索增强生成(RAG)系统提供最新的牙周知识。
PerioGPT分两个阶段进行评估。首先,使用来自专家的50个牙周问题将其性能与其他五个聊天机器人的性能进行比较,随后用来自美国牙周病学会(AAP)2023 - 2024年“在职考试”的71个问题进行验证。第二阶段重点评估PerioGPT的生成能力,特别是其创建复杂且准确的牙周问题的能力。
PerioGPT的表现优于其他聊天机器人,准确率更高(81.16%),生成的问题更复杂、精确,平均复杂度评分为3.81 ± 0.965,准确率评分为4.35 ± 0.898。这些结果证明了PerioGPT作为牙周病学中创建可靠临床问题的领先工具的潜力。
本研究强调了人工智能在牙周病学中的变革潜力,表明专门模型在教育和临床应用方面比通用语言模型具有显著优势。研究结果突出了针对特定医学领域定制人工智能技术可能会提高性能和相关性。