• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于大语言模型的聊天机器人与临床医生作为正畸学信息来源的可靠性:一项比较分析。

Reliability of Large Language Model-Based Chatbots Versus Clinicians as Sources of Information on Orthodontics: A Comparative Analysis.

作者信息

Martina Stefano, Cannatà Davide, Paduano Teresa, Schettino Valentina, Giordano Francesco, Galdi Marzio

机构信息

Department of Medicine, Surgery and Dentistry "Scuola Medica Salernitana", University of Salerno, Via Allende, 84081 Baronissi, Italy.

出版信息

Dent J (Basel). 2025 Jul 24;13(8):343. doi: 10.3390/dj13080343.

DOI:10.3390/dj13080343
PMID:40863046
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12385111/
Abstract

: The present cross-sectional analysis aimed to investigate whether Large Language Model-based chatbots can be used as reliable sources of information in orthodontics by evaluating chatbot responses and comparing them to those of dental practitioners with different levels of knowledge. : Eight true and false frequently asked orthodontic questions were submitted to five leading chatbots (ChatGPT-4, Claude-3-Opus, Gemini 2.0 Flash Experimental, Microsoft Copilot, and DeepSeek). The consistency of the answers given by chatbots at four different times was assessed using Cronbach's α. Chi-squared test was used to compare chatbot responses with those given by two groups of clinicians, i.e., general dental practitioners (GDPs) and orthodontic specialists (Os) recruited in an online survey via social media, and differences were considered significant when < 0.05. Additionally, chatbots were asked to provide a justification for their dichotomous responses using a chain-of-through prompting approach and rating the educational value according to the Global Quality Scale (GQS). : A high degree of consistency in answering was found for all analyzed chatbots (α > 0.80). When comparing chatbot answers with GDP and O ones, statistically significant differences were found for almost all the questions ( < 0.05). When evaluating the educational value of chatbot responses, DeepSeek achieved the highest GQS score (median 4.00; interquartile range 0.00), whereas CoPilot had the lowest one (median 2.00; interquartile range 2.00). : Although chatbots yield somewhat useful information about orthodontics, they can provide misleading information when dealing with controversial topics.

摘要

本横断面分析旨在通过评估基于大语言模型的聊天机器人的回答,并将其与不同知识水平的牙科从业者的回答进行比较,来研究这些聊天机器人是否可作为正畸学中可靠的信息来源。向五个领先的聊天机器人(ChatGPT-4、Claude-3-Opus、Gemini 2.0 Flash Experimental、Microsoft Copilot和DeepSeek)提交了八个正畸常见的是非问题。使用克朗巴哈α系数评估聊天机器人在四个不同时间给出答案的一致性。卡方检验用于比较聊天机器人的回答与通过社交媒体在线调查招募的两组临床医生(即普通牙科从业者(GDPs)和正畸专科医生(Os))的回答,当<0.05时差异被认为具有统计学意义。此外,要求聊天机器人使用推理提示方法为其二分法回答提供理由,并根据全球质量量表(GQS)对教育价值进行评分。所有分析的聊天机器人在回答方面都表现出高度一致性(α>0.80)。将聊天机器人的答案与GDP和Os的答案进行比较时,几乎所有问题都发现了统计学上的显著差异(<0.05)。在评估聊天机器人回答的教育价值时,DeepSeek获得了最高的GQS分数(中位数4.00;四分位间距0.00),而Copilot的分数最低(中位数2.00;四分位间距2.00)。虽然聊天机器人能提供一些有关正畸学的有用信息,但在处理有争议的话题时,它们可能会提供误导性信息。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eff2/12385111/31c083170b69/dentistry-13-00343-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eff2/12385111/b528a089457d/dentistry-13-00343-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eff2/12385111/31c083170b69/dentistry-13-00343-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eff2/12385111/b528a089457d/dentistry-13-00343-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/eff2/12385111/31c083170b69/dentistry-13-00343-g002.jpg

相似文献

1
Reliability of Large Language Model-Based Chatbots Versus Clinicians as Sources of Information on Orthodontics: A Comparative Analysis.基于大语言模型的聊天机器人与临床医生作为正畸学信息来源的可靠性:一项比较分析。
Dent J (Basel). 2025 Jul 24;13(8):343. doi: 10.3390/dj13080343.
2
Information from digital and human sources: A comparison of chatbot and clinician responses to orthodontic questions.来自数字和人工来源的信息:聊天机器人与临床医生对正畸问题回答的比较。
Am J Orthod Dentofacial Orthop. 2025 May 6. doi: 10.1016/j.ajodo.2025.04.008.
3
Accuracy and Reliability of Artificial Intelligence Chatbots as Public Information Sources in Implant Dentistry.人工智能聊天机器人作为种植牙科公共信息来源的准确性和可靠性
Int J Oral Maxillofac Implants. 2025 Jun 25;0(0):1-23. doi: 10.11607/jomi.11280.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Accuracy of ChatGPT-3.5, ChatGPT-4o, Copilot, Gemini, Claude, and Perplexity in advising on lumbosacral radicular pain against clinical practice guidelines: cross-sectional study.ChatGPT-3.5、ChatGPT-4o、Copilot、Gemini、Claude和Perplexity在依据临床实践指南对腰骶神经根性疼痛提供建议方面的准确性:横断面研究
Front Digit Health. 2025 Jun 27;7:1574287. doi: 10.3389/fdgth.2025.1574287. eCollection 2025.
6
Benchmarking AI Chatbots for Maternal Lactation Support: A Cross-Platform Evaluation of Quality, Readability, and Clinical Accuracy.用于产妇泌乳支持的人工智能聊天机器人基准测试:质量、可读性和临床准确性的跨平台评估
Healthcare (Basel). 2025 Jul 20;13(14):1756. doi: 10.3390/healthcare13141756.
7
Sexual Harassment and Prevention Training性骚扰与预防培训
8
Performance of 7 Artificial Intelligence Chatbots on Board-style Endodontic Questions.7款人工智能聊天机器人在根管治疗式问题上的表现
J Endod. 2025 Jun 26. doi: 10.1016/j.joen.2025.06.014.
9
Parental Perception on Usage of AI Chatbot to Understand Paediatric Otorhinolaryngology Condition: A Survey.家长对使用人工智能聊天机器人了解小儿耳鼻咽喉科疾病的认知:一项调查
Indian J Otolaryngol Head Neck Surg. 2025 May;77(5):2078-2087. doi: 10.1007/s12070-025-05451-2. Epub 2025 Apr 7.
10
Five advanced chatbots solving European Diploma in Radiology (EDiR) text-based questions: differences in performance and consistency.五个解决欧洲放射学文凭(EDiR)基于文本问题的先进聊天机器人:性能和一致性的差异。
Eur Radiol Exp. 2025 Aug 19;9(1):79. doi: 10.1186/s41747-025-00591-0.

本文引用的文献

1
Prevalence of Signs and Symptoms of Temporomandibular Disorders and Their Association with Emotional Factors and Waking-State Oral Behaviors on University Students: A Cross-Sectional Study.大学生颞下颌关节紊乱病体征和症状的患病率及其与情绪因素和清醒状态下口腔行为的关联:一项横断面研究
Healthcare (Basel). 2025 Jun 12;13(12):1414. doi: 10.3390/healthcare13121414.
2
Artificial Intelligence in Aesthetic Medicine: Applications, Challenges, and Future Directions.美容医学中的人工智能:应用、挑战与未来方向。
J Cosmet Dermatol. 2025 Jun;24(6):e70241. doi: 10.1111/jocd.70241.
3
Comparative analysis of AI chatbot (ChatGPT-4.0 and Microsoft Copilot) and expert responses to common orthodontic questions: patient and orthodontist evaluations.
人工智能聊天机器人(ChatGPT-4.0和Microsoft Copilot)与正畸专家对常见正畸问题回答的比较分析:患者和正畸医生的评估
BMC Oral Health. 2025 Jun 3;25(1):896. doi: 10.1186/s12903-025-06194-w.
4
Performance of AI-Chatbots to Common Temporomandibular Joint Disorders (TMDs) Patient Queries: Accuracy, Completeness, Reliability and Readability.人工智能聊天机器人对常见颞下颌关节紊乱病(TMDs)患者问题的回答:准确性、完整性、可靠性和可读性。
Orthod Craniofac Res. 2025 May 7. doi: 10.1111/ocr.12939.
5
Information from digital and human sources: A comparison of chatbot and clinician responses to orthodontic questions.来自数字和人工来源的信息:聊天机器人与临床医生对正畸问题回答的比较。
Am J Orthod Dentofacial Orthop. 2025 May 6. doi: 10.1016/j.ajodo.2025.04.008.
6
Parental Perceptions and Family Impact on Adolescents' Oral Health-Related Quality of Life in Relation to the Severity of Malocclusion and Caries Status.父母认知及家庭对青少年口腔健康相关生活质量的影响与错颌畸形严重程度和龋齿状况的关系
Children (Basel). 2025 Mar 28;12(4):425. doi: 10.3390/children12040425.
7
Evaluation of the performance of large language models in clinical decision-making in endodontics.大型语言模型在牙髓病学临床决策中的性能评估。
BMC Oral Health. 2025 Apr 28;25(1):648. doi: 10.1186/s12903-025-06050-x.
8
Readability, accuracy and appropriateness and quality of AI chatbot responses as a patient information source on root canal retreatment: A comparative assessment.作为根管再治疗患者信息来源的人工智能聊天机器人回复的可读性、准确性、恰当性和质量:一项比较评估。
Int J Med Inform. 2025 Sep;201:105948. doi: 10.1016/j.ijmedinf.2025.105948. Epub 2025 Apr 25.
9
Effectiveness and Adherence of Pharmacological vs. Non-Pharmacological Technology-Supported Smoking Cessation Interventions: An Umbrella Review.药物与非药物技术支持的戒烟干预措施的有效性及依从性:一项系统综述
Healthcare (Basel). 2025 Apr 21;13(8):953. doi: 10.3390/healthcare13080953.
10
Artificial intelligence (AI) in restorative dentistry: current trends and future prospects.口腔修复学中的人工智能:当前趋势与未来前景。
BMC Oral Health. 2025 Apr 18;25(1):592. doi: 10.1186/s12903-025-05989-1.