一种经重新训练的牙周病学专用GPT-4o模型的开发与比较评估

Development and Comparative Evaluation of a Reinstructed GPT-4o Model Specialized in Periodontology.

作者信息

Fanelli Francesco, Saleh Muhammad, Santamaria Pasquale, Zhurakivska Khrystyna, Nibali Luigi, Troiano Giuseppe

机构信息

Department of Clinical and Experimental Medicine, University of Foggia, Foggia, Italy.

Department of Periodontics and Oral Medicine, University of Michigan School of Dentistry, Ann Arbor, Michigan, USA.

出版信息

J Clin Periodontol. 2025 May;52(5):707-716. doi: 10.1111/jcpe.14101. Epub 2024 Dec 26.

DOI:10.1111/jcpe.14101

PMID:39723544

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12003056/

Abstract

BACKGROUND

Artificial intelligence (AI) has the potential to enhance healthcare practices, including periodontology, by improving diagnostics, treatment planning and patient care. This study introduces 'PerioGPT', a specialized AI model designed to provide up-to-date periodontal knowledge using GPT-4o and a novel retrieval-augmented generation (RAG) system.

METHODS

PerioGPT was evaluated in two phases. First, its performance was compared against those of five other chatbots using 50 periodontal questions from specialists, followed by a validation with 71 questions from the 2023-2024 'In-Service Examination' of the American Academy of Periodontology (AAP). The second phase focused on assessing PerioGPT's generative capacity, specifically its ability to create complex and accurate periodontal questions.

RESULTS

PerioGPT outperformed other chatbots, achieving a higher accuracy rate (81.16%) and generating more complex and precise questions with a mean complexity score of 3.81 ± 0.965 and an accuracy score of 4.35 ± 0.898. These results demonstrate PerioGPT's potential as a leading tool for creating reliable clinical queries in periodontology.

CONCLUSIONS

This study underscores the transformative potential of AI in periodontology, illustrating that specialized models can offer significant advantages over general language models for both educational and clinical applications. The findings highlight that tailoring AI technologies to specific medical fields may improve performance and relevance.

摘要

背景

人工智能（AI）有潜力通过改善诊断、治疗计划和患者护理来提升包括牙周病学在内的医疗实践。本研究介绍了“PerioGPT”，这是一种专门的人工智能模型，旨在使用GPT - 4o和一种新颖的检索增强生成（RAG）系统提供最新的牙周知识。

方法

PerioGPT分两个阶段进行评估。首先，使用来自专家的50个牙周问题将其性能与其他五个聊天机器人的性能进行比较，随后用来自美国牙周病学会（AAP）2023 - 2024年“在职考试”的71个问题进行验证。第二阶段重点评估PerioGPT的生成能力，特别是其创建复杂且准确的牙周问题的能力。

结果

PerioGPT的表现优于其他聊天机器人，准确率更高（81.16%），生成的问题更复杂、精确，平均复杂度评分为3.81 ± 0.965，准确率评分为4.35 ± 0.898。这些结果证明了PerioGPT作为牙周病学中创建可靠临床问题的领先工具的潜力。

结论

本研究强调了人工智能在牙周病学中的变革潜力，表明专门模型在教育和临床应用方面比通用语言模型具有显著优势。研究结果突出了针对特定医学领域定制人工智能技术可能会提高性能和相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/332f/12003056/32fc9448db8c/JCPE-52-707-g004.jpg

相似文献

Development and Comparative Evaluation of a Reinstructed GPT-4o Model Specialized in Periodontology.一种经重新训练的牙周病学专用GPT-4o模型的开发与比较评估

J Clin Periodontol. 2025 May;52(5):707-716. doi: 10.1111/jcpe.14101. Epub 2024 Dec 26.

Performance of three artificial intelligence (AI)-based large language models in standardized testing; implications for AI-assisted dental education.三种基于人工智能（AI）的大语言模型在标准化测试中的表现；对人工智能辅助牙科教育的启示。

J Periodontal Res. 2025 Feb;60(2):121-133. doi: 10.1111/jre.13323. Epub 2024 Jul 18.

Comparative performance of artificial intelligence models in rheumatology board-level questions: evaluating Google Gemini and ChatGPT-4o.人工智能模型在风湿病委员会级问题中的比较性能：评估 Google Gemini 和 ChatGPT-4o。

Clin Rheumatol. 2024 Nov;43(11):3507-3513. doi: 10.1007/s10067-024-07154-5. Epub 2024 Sep 28.

Empowering personalized pharmacogenomics with generative AI solutions.利用生成式人工智能解决方案增强个性化药物基因组学。

J Am Med Inform Assoc. 2024 May 20;31(6):1356-1366. doi: 10.1093/jamia/ocae039.

Artificial intelligence in dental education: ChatGPT's performance on the periodontic in-service examination.人工智能在口腔医学教育中的应用：ChatGPT 在牙周在职考试中的表现。

J Periodontol. 2024 Jul;95(7):682-687. doi: 10.1002/JPER.23-0514. Epub 2024 Jan 10.

Assessing Generative Pretrained Transformers (GPT) in Clinical Decision-Making: Comparative Analysis of GPT-3.5 and GPT-4.评估生成式预训练转换器（GPT）在临床决策中的应用：GPT-3.5 和 GPT-4 的对比分析。

J Med Internet Res. 2024 Jun 27;26:e54571. doi: 10.2196/54571.

Performance of artificial intelligence on Turkish dental specialization exam: can ChatGPT-4.0 and gemini advanced achieve comparable results to humans?人工智能在土耳其牙科专业考试中的表现：ChatGPT-4.0和Gemini Advanced能否取得与人类相当的成绩？

BMC Med Educ. 2025 Feb 10;25(1):214. doi: 10.1186/s12909-024-06389-9.

Evaluating ChatGPT and Google Gemini Performance and Implications in Turkish Dental Education.评估ChatGPT和谷歌Gemini在土耳其牙科教育中的性能及影响

Cureus. 2025 Jan 11;17(1):e77292. doi: 10.7759/cureus.77292. eCollection 2025 Jan.

The future of AI clinicians: assessing the modern standard of chatbots and their approach to diagnostic uncertainty.人工智能临床医生的未来：评估现代聊天机器人的标准及其对诊断不确定性的处理方法。

BMC Med Educ. 2024 Oct 11;24(1):1133. doi: 10.1186/s12909-024-06115-5.

GPT-4o vs. Human Candidates: Performance Analysis in the Polish Final Dentistry Examination.GPT-4o与人类考生：波兰牙科最终考试中的表现分析

Cureus. 2024 Sep 6;16(9):e68813. doi: 10.7759/cureus.68813. eCollection 2024 Sep.

引用本文的文献

Enhancing patient-centered information on implant dentistry through prompt engineering: a comparison of four large language models.通过提示工程增强种植牙科以患者为中心的信息：四种大语言模型的比较

Front Oral Health. 2025 Apr 7;6:1566221. doi: 10.3389/froh.2025.1566221. eCollection 2025.

The Transformative Role of Artificial Intelligence in Dentistry: A Comprehensive Overview. Part 1: Fundamentals of AI, and its Contemporary Applications in Dentistry.人工智能在牙科领域的变革性作用：全面概述。第1部分：人工智能基础及其在牙科领域的当代应用。

Int Dent J. 2025 Apr;75(2):383-396. doi: 10.1016/j.identj.2025.02.005. Epub 2025 Mar 11.

本文引用的文献

J Periodontal Res. 2025 Feb;60(2):121-133. doi: 10.1111/jre.13323. Epub 2024 Jul 18.

Performance of ChatGPT in classifying periodontitis according to the 2018 classification of periodontal diseases.ChatGPT 在根据 2018 年牙周病分类法对牙周炎进行分类方面的表现。

Clin Oral Investig. 2024 Jun 29;28(7):407. doi: 10.1007/s00784-024-05799-9.

GastroBot: a Chinese gastrointestinal disease chatbot based on the retrieval-augmented generation.GastroBot：一个基于检索增强生成技术的中文胃肠疾病聊天机器人。

Front Med (Lausanne). 2024 May 22;11:1392555. doi: 10.3389/fmed.2024.1392555. eCollection 2024.

A retrieval-augmented chatbot based on GPT-4 provides appropriate differential diagnosis in gastrointestinal radiology: a proof of concept study.基于 GPT-4 的检索增强型聊天机器人可在胃肠放射学中提供适当的鉴别诊断：概念验证研究。

Eur Radiol Exp. 2024 May 17;8(1):60. doi: 10.1186/s41747-024-00457-x.

Integrating Retrieval-Augmented Generation with Large Language Models in Nephrology: Advancing Practical Applications.将检索增强生成与大型语言模型在肾脏病学中的整合：推进实际应用。

Medicina (Kaunas). 2024 Mar 8;60(3):445. doi: 10.3390/medicina60030445.

ChatGPT to Enhance Learning in Dental Education at a Historically Black Medical College.ChatGPT助力一所历史悠久的黑人医学院提升牙科教育水平。

Dent Res Oral Health. 2024;7(1):8-14. doi: 10.26502/droh.0069. Epub 2024 Jan 25.

ChatGPT performance in prosthodontics: Assessment of accuracy and repeatability in answer generation.ChatGPT 在口腔修复学中的表现：评估其在回答生成中的准确性和可重复性。

J Prosthet Dent. 2024 Apr;131(4):659.e1-659.e6. doi: 10.1016/j.prosdent.2024.01.018. Epub 2024 Feb 2.

J Periodontol. 2024 Jul;95(7):682-687. doi: 10.1002/JPER.23-0514. Epub 2024 Jan 10.

A content-aware chatbot based on GPT 4 provides trustworthy recommendations for Cone-Beam CT guidelines in dental imaging.基于GPT 4的内容感知聊天机器人为牙科成像中的锥形束CT指南提供可靠建议。

Dentomaxillofac Radiol. 2024 Feb 8;53(2):109-114. doi: 10.1093/dmfr/twad015.

A systematic review and meta-analysis on ChatGPT and its utilization in medical and dental research.关于ChatGPT及其在医学和牙科研究中的应用的系统评价和荟萃分析。

Heliyon. 2023 Nov 29;9(12):e23050. doi: 10.1016/j.heliyon.2023.e23050. eCollection 2023 Dec.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种经重新训练的牙周病学专用GPT-4o模型的开发与比较评估

Development and Comparative Evaluation of a Reinstructed GPT-4o Model Specialized in Periodontology.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSIONS

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献