文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

评估ChatGPT-4和Gemini对世界牙科联盟关于口腔健康常见问题的回答的准确性。

Evaluation of the accuracy of ChatGPT-4 and Gemini's responses to the World Dental Federation's frequently asked questions on oral health.

作者信息

Arpaci Aysenur, Ozturk Asel Usdat, Okur Ismail, Sadry Sanaz

机构信息

Faculty of Dentistry, Department of Periodontology, Istanbul Aydin University, Istanbul, Turkey.

Faculty of Dentistry, Department of Maxillofacial Radiology, Istanbul Atlas University, Istanbul, Turkey.

出版信息

BMC Oral Health. 2025 Aug 2;25(1):1293. doi: 10.1186/s12903-025-06624-9.


DOI:10.1186/s12903-025-06624-9
PMID:40753419
Abstract

BACKGROUND: The field of artificial intelligence (AI) has experienced considerable growth in recent years, with the advent of technologies that are transforming a range of industries, including healthcare and dentistry. Large language models (LLMs) and natural language processing (NLP) are pivotal to this transformation. This study aimed to assess the efficacy of AI-supported chatbots in responding to questions frequently asked by patients to their doctors regarding oral health. METHODS: Frequently asked questions in the oral health section of the World Dental Federation FDI website were asked about Google-Gemini Trends and ChatGPT-4 chatbots on July 9, 2024. Responses from ChatGPT and Gemini, as well as those from the FDI webpage, were recorded. The accuracy of the responses given by ChatGPT-4 and Gemini to the four specified questions, the detection of similarities and differences, and the comprehensive examination of ChatGPT-4 and Gemini's capabilities were analyzed and reported by the researchers. Furthermore, the content of the texts was evaluated in terms of their similarity with respect to the following criteria: "Main Idea," "Quality Analysis," "Common Ideas," and "Inconsistent Ideas." RESULTS: It was observed that both ChatGPT-4 and Gemini exhibited performance comparable to that of the FDI responses in terms of completeness and clarity. Compared with Gemini, ChatGPT-4 provided responses that were more similar to the FDI responses in terms of relevance. Furthermore, ChatGPT-4 provided responses that were more accurate than those of Gemini in terms of the "Accuracy" criterion. CONCLUSIONS: This study demonstrated that, according to the assessment conducted by FDI, the ChatGPT-4 and Gemini applications contain contemporary and comprehensible information in response to general inquiries concerning oral health. These applications are regarded as a prevalent and dependable source of information for individuals seeking to access such data.

摘要

背景:近年来,随着人工智能(AI)技术的出现,该领域取得了显著发展,这些技术正在改变包括医疗保健和牙科在内的一系列行业。大语言模型(LLMs)和自然语言处理(NLP)对这一转变至关重要。本研究旨在评估人工智能支持的聊天机器人在回答患者向医生频繁询问的有关口腔健康问题方面的效果。 方法:2024年7月9日,在世界牙科联盟(FDI)网站的口腔健康板块中常见的问题被输入到谷歌Gemini Trends和ChatGPT-4聊天机器人中。记录了ChatGPT和Gemini的回答,以及FDI网页的回答。研究人员分析并报告了ChatGPT-4和Gemini对四个指定问题的回答准确性、异同检测以及对ChatGPT-4和Gemini能力的综合考察。此外,还根据以下标准对文本内容的相似性进行了评估:“主要观点”、“质量分析”、“共同观点”和“不一致观点”。 结果:观察到ChatGPT-4和Gemini在完整性和清晰度方面的表现与FDI的回答相当。与Gemini相比,ChatGPT-4在相关性方面提供的回答与FDI的回答更相似。此外,在“准确性”标准方面,ChatGPT-4提供的回答比Gemini更准确。 结论:本研究表明根据FDI的评估,ChatGPT-4和Gemini应用程序包含了当代且易于理解的信息,以回应有关口腔健康的一般询问。这些应用程序被视为寻求此类数据的个人普遍且可靠的信息来源。

相似文献

[1]
Evaluation of the accuracy of ChatGPT-4 and Gemini's responses to the World Dental Federation's frequently asked questions on oral health.

BMC Oral Health. 2025-8-2

[2]
Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.

Cureus. 2025-6-1

[3]
Accuracy and Reliability of Artificial Intelligence Chatbots as Public Information Sources in Implant Dentistry.

Int J Oral Maxillofac Implants. 2025-6-25

[4]
Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis.

JMIR AI. 2025-5-13

[5]
Evaluating the readability, quality, and reliability of responses generated by ChatGPT, Gemini, and Perplexity on the most commonly asked questions about Ankylosing spondylitis.

PLoS One. 2025-6-18

[6]
Comparison of Responses from ChatGPT-4, Google Gemini, and Google Search to Common Patient Questions About Ankle Sprains: A Readability Analysis.

J Am Acad Orthop Surg. 2025-7-3

[7]
Evaluating the Performance of State-of-the-Art Artificial Intelligence Chatbots Based on the WHO Global Guidelines for the Prevention of Surgical Site Infection: Cross-Sectional Study.

J Med Internet Res. 2025-7-31

[8]
Psychological First Aid by AI: Proof-of-Concept and Comparative Performance of ChatGPT-4 and Gemini in Different Disaster Scenarios.

J Clin Psychol. 2025-8

[9]
A multi-dimensional performance evaluation of large language models in dental implantology: comparison of ChatGPT, DeepSeek, Grok, Gemini and Qwen across diverse clinical scenarios.

BMC Oral Health. 2025-7-28

[10]
Thyroid Eye Disease and Artificial Intelligence: A Comparative Study of ChatGPT-3.5, ChatGPT-4o, and Gemini in Patient Information Delivery.

Ophthalmic Plast Reconstr Surg. 2024-12-24

本文引用的文献

[1]
Large language models in health care: Development, applications, and challenges.

Health Care Sci. 2023-7-24

[2]
Evaluating the accuracy of Chat Generative Pre-trained Transformer version 4 (ChatGPT-4) responses to United States Food and Drug Administration (FDA) frequently asked questions about dental amalgam.

BMC Oral Health. 2024-5-24

[3]
ChatGPT performance in prosthodontics: Assessment of accuracy and repeatability in answer generation.

J Prosthet Dent. 2024-4

[4]
Examination of the reliability and readability of Chatbot Generative Pretrained Transformer's (ChatGPT) responses to questions about orthodontics and the evolution of these responses in an updated version.

Am J Orthod Dentofacial Orthop. 2024-5

[5]
Performance of Generative Artificial Intelligence in Dental Licensing Examinations.

Int Dent J. 2024-6

[6]
A systematic review and meta-analysis on ChatGPT and its utilization in medical and dental research.

Heliyon. 2023-11-29

[7]
Unveiling the ChatGPT phenomenon: Evaluating the consistency and accuracy of endodontic question answers.

Int Endod J. 2024-1

[8]
ChatGPT- versus human-generated answers to frequently asked questions about diabetes: A Turing test-inspired survey among employees of a Danish diabetes center.

PLoS One. 2023

[9]
Accuracy of ChatGPT-Generated Information on Head and Neck and Oromaxillofacial Surgery: A Multicenter Collaborative Analysis.

Otolaryngol Head Neck Surg. 2024-6

[10]
Artificial Intelligence and Public Health: Evaluating ChatGPT Responses to Vaccination Myths and Misconceptions.

Vaccines (Basel). 2023-7-7

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索