文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

大语言模型能否作为牙科领域可靠的信息工具?一项系统综述。

Can Large Language Models Serve as Reliable Tools for Information in Dentistry? A Systematic Review.

作者信息

Alhazmi Nora, Alshehri Aram, BaHammam Fahad, Philip Manju, Nadeem Muhammad, Khanagar Sanjeev

机构信息

Department of Preventive Dental Sciences, College of Dentistry, King Saud bin Abdulaziz University for Health Sciences, King Abdullah International Medical Research Center, Ministry of the National Guard Health Affairs, Riyadh, Saudi Arabia.

Department of Restorative and Prosthetic Dental Sciences, College of Dentistry, King Saud bin Abdulaziz University for Health Sciences, King Abdullah International Medical Research Center, Ministry of the National Guard Health Affairs, Riyadh, Saudi Arabia.

出版信息

Int Dent J. 2025 May 16;75(4):100835. doi: 10.1016/j.identj.2025.04.015.


DOI:10.1016/j.identj.2025.04.015
PMID:40382915
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12146530/
Abstract

Large language models (LLMs) have gained popularity among dental students for generating subject-related answers. However, their widespread use raises significant concerns about misinformation. This systematic review aims to critically evaluate studies assessing the performance of LLMs in dentistry. A comprehensive electronic search was conducted in PubMed/Medline, Scopus, Embase, Web of Science, Google Scholar, and the Saudi Digital Library to identify studies published up to September 2024. The study quality was assessed using the Prediction Model Risk of Bias Assessment Tool (PROBAST). A total of 2030 studies have been identified. After removing 907 duplicate records, 1123 studies remained for screening. Ultimately, 31 studies met the inclusion criteria. Approximately half of these studies were classified as "high risk," while the remainder were classified as "low risk." The applicability of the findings was rated as "low concern." The primary limitations of LLMs include their inability to specify information sources and their tendency to generate fabricated citations. Based on this review, LLMs hold promise as supplementary educational tools in dentistry. Evidence indicates that students using LLMs may achieve improved academic performance compared to traditional methods. However, concerns about occasional inaccuracies and unreliable citations underscore the need for further research, integration with validated sources, and adherence to ethical guidelines. Ultimately, LLMs should be viewed as complementary tools within dental education, with careful consideration of their limitations.

摘要

大语言模型(LLMs)在牙科学生中因能生成与学科相关的答案而颇受欢迎。然而,它们的广泛使用引发了对错误信息的重大担忧。本系统综述旨在严格评估评估大语言模型在牙科领域表现的研究。在PubMed/Medline、Scopus、Embase、Web of Science、谷歌学术和沙特数字图书馆中进行了全面的电子检索,以识别截至2024年9月发表的研究。使用预测模型偏倚风险评估工具(PROBAST)评估研究质量。共识别出2030项研究。去除907条重复记录后,剩余1123项研究进行筛选。最终,31项研究符合纳入标准。其中约一半的研究被归类为“高风险”,其余的被归类为“低风险”。研究结果的适用性被评为“低关注度”。大语言模型的主要局限性包括无法指明信息来源以及有生成虚假引用的倾向。基于本综述,大语言模型有望成为牙科领域的辅助教育工具。有证据表明,与传统方法相比,使用大语言模型的学生可能会取得更好的学业成绩。然而,对偶尔出现的不准确信息和不可靠引用的担忧凸显了进一步研究、与经过验证的来源整合以及遵守道德准则的必要性。最终,大语言模型应被视为牙科教育中的补充工具,并需仔细考虑其局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a7/12146530/5d760e8f4c25/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a7/12146530/a589d915dc5e/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a7/12146530/5d760e8f4c25/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a7/12146530/a589d915dc5e/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16a7/12146530/5d760e8f4c25/gr2.jpg

相似文献

[1]
Can Large Language Models Serve as Reliable Tools for Information in Dentistry? A Systematic Review.

Int Dent J. 2025-5-16

[2]
Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.

Patient Prefer Adherence. 2025-7-31

[3]
Applications and Concerns of ChatGPT and Other Conversational Large Language Models in Health Care: Systematic Review.

J Med Internet Res. 2024-11-7

[4]
Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.

Cochrane Database Syst Rev. 2014-4-29

[5]
The educational effects of portfolios on undergraduate student learning: a Best Evidence Medical Education (BEME) systematic review. BEME Guide No. 11.

Med Teach. 2009-4

[6]
Factors within the clinical encounter that impact upon risk assessment within child and adolescent mental health services: a rapid realist synthesis.

Health Soc Care Deliv Res. 2024-1

[7]
Sexual Harassment and Prevention Training

2025-1

[8]
The measurement of collaboration within healthcare settings: a systematic review of measurement properties of instruments.

JBI Database System Rev Implement Rep. 2016-4

[9]
Stench of Errors or the Shine of Potential: The Challenge of (Ir)Responsible Use of ChatGPT in Speech-Language Pathology.

Int J Lang Commun Disord. 2025

[10]
What is the value of routinely testing full blood count, electrolytes and urea, and pulmonary function tests before elective surgery in patients with no apparent clinical indication and in subgroups of patients with common comorbidities: a systematic review of the clinical and cost-effective literature.

Health Technol Assess. 2012-12

本文引用的文献

[1]
The Transformative Role of Artificial Intelligence in Dentistry: A Comprehensive Overview. Part 1: Fundamentals of AI, and its Contemporary Applications in Dentistry.

Int Dent J. 2025-4

[2]
The Transformative Role of Artificial Intelligence in Dentistry: A Comprehensive Overview Part 2: The Promise and Perils, and the International Dental Federation Communique.

Int Dent J. 2025-4

[3]
Performance of ChatGPT 3.5 and 4 on U.S. dental examinations: the INBDE, ADAT, and DAT.

Imaging Sci Dent. 2024-9

[4]
How reliable is the artificial intelligence product large language model ChatGPT in orthodontics?

Angle Orthod. 2024-11-1

[5]
Performance of three artificial intelligence (AI)-based large language models in standardized testing; implications for AI-assisted dental education.

J Periodontal Res. 2025-2

[6]
Assessing the Accuracy of AI Models in Orthodontic Knowledge: A Comparative Study Between ChatGPT-4 and Google Bard.

J Coll Physicians Surg Pak. 2024-7

[7]
Performance of ChatGPT in classifying periodontitis according to the 2018 classification of periodontal diseases.

Clin Oral Investig. 2024-6-29

[8]
Performance of large language models in oral and maxillofacial surgery examinations.

Int J Oral Maxillofac Surg. 2024-10

[9]
How well do large language model-based chatbots perform in oral and maxillofacial radiology?

Dentomaxillofac Radiol. 2024-9-1

[10]
Assessing ChatGPT's Diagnostic Accuracy and Therapeutic Strategies in Oral Pathologies: A Cross-Sectional Study.

Cureus. 2024-4-19

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索