• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

ChatGPT 能否成为智齿拔除患者的准确且易读的辅助工具?

Is ChatGPT an Accurate and Readable Patient Aid for Third Molar Extractions?

机构信息

Resident, Department of Oral and Maxillofacial Surgery, Rutgers School of Dental Medicine, Newark, NJ.

Candidate, Rutgers New Jersey Medical School, Newark, NJ.

出版信息

J Oral Maxillofac Surg. 2024 Oct;82(10):1239-1245. doi: 10.1016/j.joms.2024.06.177. Epub 2024 Jul 2.

DOI:10.1016/j.joms.2024.06.177
PMID:39019079
Abstract

BACKGROUND

Artificial intelligence (AI) platforms such as Chat Generative Pre-Trained Transformer (ChatGPT) (Open AI, San Francisco, California, USA) have the capacity to answer health-related questions. It remains unknown whether AI can be a patient-friendly and accurate resource regarding third molar extraction.

PURPOSE

The purpose was to determine the accuracy and readability of AI responses to common patient questions regarding third molar extraction.

STUDY DESIGN, SETTING, SAMPLE: This is a cross sectional in-silico assessment of readability and soundness of a computer-generated report.

INDEPENDENT VARIABLE

Not applicable.

MAIN OUTCOME VARIABLES

Accuracy, or the ability to provide clinically correct and relevant information, was determined subjectively by 2 reviewers using a 5-point Likert scale, and objectively by comparing responses to American Association of Oral and Maxillofacial Surgeons (AAOMS) clinical consensus papers. Readability, or how easy a piece of text is to read, was assessed using the Flesch Kincaid Reading Ease (FKRE) and Flesch Kincaid Grade Level (FKGL). Both assess readability based on mean number of syllables per word, and words per sentence. To be deemed readable, FKRE should be >60 and FKGL should be <8.

COVARIATES

Not applicable.

ANALYSES

Descriptive statistics were used to analyze the findings of this study.

RESULTS

AI-generated responses above the recommended level for the average patient (FKRE: 52; FKGL: 10). The average Likert score was 4.36, suggesting that most responses were accurate with minor inaccuracies or missing information. AI correctly deferred to the provider in instances where no definitive answer exists. Of the responses that addressed content in AAOMS consensus papers, 18/19 responses closely aligned with them. All prompts did not provide citations or references.

CONCLUSION AND RELEVANCE

AI was able to provide mostly accurate responses, and content was closely aligned with AAOMS guidelines. However, responses were too complex for the average third molar extraction patient, and were deficient in citations and references. It is important for providers to educate patients on the utility of AI, and to decide whether to recommend using it for information. Ultimately, the best resource for answers is from the practitioners themselves because the AI platform lacks clinical experience.

摘要

背景

人工智能(AI)平台,如 Chat Generative Pre-Trained Transformer(ChatGPT)(美国加利福尼亚州旧金山的 Open AI),有能力回答与健康相关的问题。目前尚不清楚 AI 是否可以成为一种针对第三磨牙拔除的患者友好且准确的资源。

目的

旨在确定 AI 对常见的与第三磨牙拔除相关的患者问题的回答的准确性和可读性。

研究设计、地点、样本:这是对计算机生成报告的可读性和合理性的横截面计算机模拟评估。

自变量

不适用。

主要观察指标

准确性,即提供临床正确和相关信息的能力,由 2 位评审员使用 5 分李克特量表进行主观判断,并通过与美国口腔颌面外科学会(AAOMS)临床共识文件的比较来客观判断。可读性,即文本的易读程度,使用弗莱什金纳阅读舒适度(FKRE)和弗莱什金纳等级(FKGL)进行评估。两者都基于每个单词的平均音节数和每个句子的单词数来评估可读性。为了被认为是可读的,FKRE 应该大于 60,FKGL 应该小于 8。

协变量

不适用。

分析

使用描述性统计来分析本研究的结果。

结果

AI 生成的回复高于推荐给普通患者的水平(FKRE:52;FKGL:10)。平均李克特分数为 4.36,这表明大多数回复是准确的,只有一些细微的不准确或缺失的信息。在没有明确答案的情况下,AI 正确地将回复转交给了提供者。在回答 AAOMS 共识文件中包含的内容的回复中,18/19 个回复与它们密切一致。所有的提示都没有提供引文或参考文献。

结论和相关性

AI 能够提供大多准确的回复,内容与 AAOMS 指南密切一致。然而,回复对于普通的第三磨牙拔除患者来说过于复杂,并且缺乏引文和参考文献。重要的是,提供者应该教育患者 AI 的实用性,并决定是否推荐他们使用 AI 来获取信息。最终,回答问题的最佳资源来自从业者本身,因为 AI 平台缺乏临床经验。

相似文献

1
Is ChatGPT an Accurate and Readable Patient Aid for Third Molar Extractions?ChatGPT 能否成为智齿拔除患者的准确且易读的辅助工具?
J Oral Maxillofac Surg. 2024 Oct;82(10):1239-1245. doi: 10.1016/j.joms.2024.06.177. Epub 2024 Jul 2.
2
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性:公众需谨慎。
Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.
3
Dr. Google to Dr. ChatGPT: assessing the content and quality of artificial intelligence-generated medical information on appendicitis.谷歌博士对 ChatGPT 博士:评估人工智能生成的关于阑尾炎的医学信息的内容和质量。
Surg Endosc. 2024 May;38(5):2887-2893. doi: 10.1007/s00464-024-10739-5. Epub 2024 Mar 5.
4
Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性?
Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.
5
Both Patients and Plastic Surgeons Prefer Artificial Intelligence-Generated Microsurgical Information.患者和整形外科医生都更喜欢人工智能生成的显微手术信息。
J Reconstr Microsurg. 2024 Nov;40(9):657-664. doi: 10.1055/a-2273-4163. Epub 2024 Feb 21.
6
Evaluating the Efficacy of ChatGPT as a Patient Education Tool in Prostate Cancer: Multimetric Assessment.评估 ChatGPT 在前列腺癌患者教育中的疗效:多指标评估。
J Med Internet Res. 2024 Aug 14;26:e55939. doi: 10.2196/55939.
7
Evaluating the accuracy and readability of ChatGPT in providing parental guidance for adenoidectomy, tonsillectomy, and ventilation tube insertion surgery.评估 ChatGPT 在提供腺样体切除术、扁桃体切除术和通气管插入手术的家长指导方面的准确性和可读性。
Int J Pediatr Otorhinolaryngol. 2024 Jun;181:111998. doi: 10.1016/j.ijporl.2024.111998. Epub 2024 May 31.
8
Artificial intelligence insights into osteoporosis: assessing ChatGPT's information quality and readability.人工智能在骨质疏松症中的应用:评估 ChatGPT 的信息质量和可读性。
Arch Osteoporos. 2024 Mar 19;19(1):17. doi: 10.1007/s11657-024-01376-5.
9
The Use of Large Language Models to Generate Education Materials about Uveitis.使用大型语言模型生成有关葡萄膜炎的教育材料。
Ophthalmol Retina. 2024 Feb;8(2):195-201. doi: 10.1016/j.oret.2023.09.008. Epub 2023 Sep 15.
10
Evaluation of Generative Language Models in Personalizing Medical Information: Instrument Validation Study.生成式语言模型在个性化医疗信息方面的评估:工具验证研究
JMIR AI. 2024 Aug 13;3:e54371. doi: 10.2196/54371.

引用本文的文献

1
Effect of AI-based chatbots in promoting oral health awareness among rural populations.基于人工智能的聊天机器人对提高农村人口口腔健康意识的影响。
Bioinformation. 2025 Apr 30;21(4):827-831. doi: 10.6026/973206300210827. eCollection 2025.
2
Performance of AI Chatbots in Preliminary Diagnosis of Maxillofacial Pathologies.人工智能聊天机器人在颌面疾病初步诊断中的表现。
Med Sci Monit. 2025 Jul 9;31:e949076. doi: 10.12659/MSM.949076.
3
Comparative evaluation of responses from DeepSeek-R1, ChatGPT-o1, ChatGPT-4, and dental GPT chatbots to patient inquiries about dental and maxillofacial prostheses.
对DeepSeek-R1、ChatGPT-o1、ChatGPT-4和牙科GPT聊天机器人针对患者有关口腔颌面修复体询问的回复进行比较评估。
BMC Oral Health. 2025 May 31;25(1):871. doi: 10.1186/s12903-025-06267-w.
4
Artificial intelligence-based chatbot assistance in clinical decision-making for medically complex patients in oral surgery: a comparative study.基于人工智能的聊天机器人在口腔外科复杂病情患者临床决策中的辅助作用:一项对比研究。
BMC Oral Health. 2025 Mar 7;25(1):351. doi: 10.1186/s12903-025-05732-w.
5
Chat Generative Pre-Trained Transformer (ChatGPT) in Oral and Maxillofacial Surgery: A Narrative Review on Its Research Applications and Limitations.口腔颌面外科中的聊天生成预训练变换器(ChatGPT):关于其研究应用和局限性的叙述性综述
J Clin Med. 2025 Feb 18;14(4):1363. doi: 10.3390/jcm14041363.
6
Performance of large language artificial intelligence models on solving restorative dentistry and endodontics student assessments.大型语言人工智能模型在解决修复牙科和牙髓学生评估方面的性能。
Clin Oral Investig. 2024 Oct 7;28(11):575. doi: 10.1007/s00784-024-05968-w.