• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于人工智能的聊天机器人在口腔外科复杂病情患者临床决策中的辅助作用:一项对比研究。

Artificial intelligence-based chatbot assistance in clinical decision-making for medically complex patients in oral surgery: a comparative study.

作者信息

Şişman Alanur Çiftçi, Acar Ahmet Hüseyin

机构信息

Hamidiye Faculty of Dental Medicine, Department of Oral and Maxillofacial Surgery, University of Health Sciences, Istanbul, Türkiye.

Faculty of Dentistry, Department of Oral and Maxillofacial Surgery, Istanbul Medeniyet University, Istanbul, Türkiye.

出版信息

BMC Oral Health. 2025 Mar 7;25(1):351. doi: 10.1186/s12903-025-05732-w.

DOI:10.1186/s12903-025-05732-w
PMID:40055745
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11887094/
Abstract

AIM

This study aims to evaluate the potential of AI-based chatbots in assisting with clinical decision-making in the management of medically complex patients in oral surgery.

MATERIALS AND METHODS

A team of oral and maxillofacial surgeons developed a pool of open-ended questions de novo. The validity of the questions was assessed using Lawshe's Content Validity Index. The questions, which focused on systemic diseases and common conditions that may raise concerns during oral surgery, were presented to ChatGPT 3.5 and Claude-instant in two separate sessions, spaced one week apart. Two experienced maxillofacial surgeons, blinded to the chatbots, assessed the responses for quality, accuracy, and completeness using a modified DISCERN tool and Likert scale. Intraclass correlation, Mann-Whitney U test, skewness, and kurtosis coefficients were employed to compare the performances of the chatbots.

RESULTS

Most responses were high quality: 86% and 79.6% for ChatGPT, and 81.25% and 89% for Claude-instant in sessions 1 and 2, respectively. In terms of accuracy, ChatGPT had 92% and 93.4% of its responses rated as completely correct in sessions 1 and 2, respectively, while Claude-instant had 95.2% and 89%. For completeness, ChatGPT had 88.5% and 86.8% of its responses rated as adequate or comprehensive in sessions 1 and 2, respectively, while Claude-instant had 95.2% and 86%.

CONCLUSION

Ongoing software developments and the increasing acceptance of chatbots among healthcare professionals hold promise that these tools can provide rapid solutions to the high demand for medical care, ease professionals' workload, reduce costs, and save time.

摘要

目的

本研究旨在评估基于人工智能的聊天机器人在口腔外科复杂患者管理中辅助临床决策的潜力。

材料与方法

一组口腔颌面外科医生重新编制了一系列开放式问题。使用劳希内容效度指数评估问题的有效性。这些聚焦于全身疾病以及口腔外科手术中可能引发关注的常见病症的问题,在两个独立的环节中分别呈现给ChatGPT 3.5和Claude-instant,两个环节间隔一周。两名经验丰富的颌面外科医生在对聊天机器人不知情的情况下,使用改良的DISCERN工具和李克特量表评估回答的质量、准确性和完整性。采用组内相关系数、曼-惠特尼U检验、偏度和峰度系数来比较聊天机器人的表现。

结果

大多数回答质量较高:在环节1中,ChatGPT的高质量回答率为86%,Claude-instant为81.25%;在环节2中,ChatGPT为79.6%,Claude-instant为89%。在准确性方面,ChatGPT在环节1和环节2中分别有92%和93.4%的回答被评为完全正确,而Claude-instant分别为 95.2%和89%。在完整性方面,ChatGPT在环节1和环节2中分别有88.5%和86.8%的回答被评为足够或全面,而Claude-instant分别为95.2%和86%。

结论

持续的软件开发以及医疗保健专业人员对聊天机器人的接受度不断提高,预示着这些工具能够为医疗护理的高需求提供快速解决方案,减轻专业人员的工作量,降低成本并节省时间。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/11887094/4d3ed7a653b6/12903_2025_5732_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/11887094/4d3ed7a653b6/12903_2025_5732_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4baf/11887094/4d3ed7a653b6/12903_2025_5732_Fig1_HTML.jpg

相似文献

1
Artificial intelligence-based chatbot assistance in clinical decision-making for medically complex patients in oral surgery: a comparative study.基于人工智能的聊天机器人在口腔外科复杂病情患者临床决策中的辅助作用:一项对比研究。
BMC Oral Health. 2025 Mar 7;25(1):351. doi: 10.1186/s12903-025-05732-w.
2
Evaluation of AI-generated responses by different artificial intelligence chatbots to the clinical decision-making case-based questions in oral and maxillofacial surgery.评估不同人工智能聊天机器人对口腔颌面外科基于临床决策案例问题的人工智能生成回复。
Oral Surg Oral Med Oral Pathol Oral Radiol. 2024 Jun;137(6):587-593. doi: 10.1016/j.oooo.2024.02.018. Epub 2024 Mar 6.
3
Accuracy of Prospective Assessments of 4 Large Language Model Chatbot Responses to Patient Questions About Emergency Care: Experimental Comparative Study.前瞻性评估 4 种大型语言模型聊天机器人对患者关于急救护理问题的回答的准确性:实验性对比研究。
J Med Internet Res. 2024 Nov 4;26:e60291. doi: 10.2196/60291.
4
Quality of Information Provided by Artificial Intelligence Chatbots Surrounding the Management of Vestibular Schwannomas: A Comparative Analysis Between ChatGPT-4 and Claude 2.人工智能聊天机器人提供的关于前庭神经鞘瘤管理的信息质量:ChatGPT-4与Claude 2的比较分析
Otol Neurotol. 2025 Apr 1;46(4):432-436. doi: 10.1097/MAO.0000000000004410. Epub 2025 Feb 4.
5
Comparative assessment of artificial intelligence chatbots' performance in responding to healthcare professionals' and caregivers' questions about Dravet syndrome.人工智能聊天机器人在回答医疗专业人员和护理人员有关德雷维特综合征问题时的性能比较评估。
Epilepsia Open. 2025 Apr 1. doi: 10.1002/epi4.70022.
6
Performance of Artificial Intelligence Chatbots in Responding to Patient Queries Related to Traumatic Dental Injuries: A Comparative Study.人工智能聊天机器人在回应与创伤性牙损伤相关的患者咨询中的表现:一项比较研究。
Dent Traumatol. 2025 Jun;41(3):338-347. doi: 10.1111/edt.13020. Epub 2024 Nov 22.
7
Evaluating the Quality and Readability of Generative Artificial Intelligence (AI) Chatbot Responses in the Management of Achilles Tendon Rupture.评估生成式人工智能(AI)聊天机器人在跟腱断裂管理中的回复质量和可读性。
Cureus. 2025 Jan 31;17(1):e78313. doi: 10.7759/cureus.78313. eCollection 2025 Jan.
8
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性:公众需谨慎。
Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.
9
Evaluation of validity and reliability of AI Chatbots as public sources of information on dental trauma.评估人工智能聊天机器人作为牙科创伤公共信息来源的有效性和可靠性。
Dent Traumatol. 2025 Apr;41(2):187-193. doi: 10.1111/edt.13000. Epub 2024 Oct 17.
10
Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.评估药物流产信息的准确性:ChatGPT与谷歌巴德人工智能的比较分析
Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.

引用本文的文献

1
Evaluation of deepseek, gemini, ChatGPT-4o, and perplexity in responding to salivary gland cancer.评估DeepSeek、Gemini、ChatGPT-4o和Perplexity对涎腺癌的回答。
BMC Oral Health. 2025 Aug 23;25(1):1358. doi: 10.1186/s12903-025-06726-4.
2
Readability of AI-Generated Patient Information Leaflets on Alzheimer's, Vascular Dementia, and Delirium.关于阿尔茨海默病、血管性痴呆和谵妄的人工智能生成的患者信息手册的可读性。
Cureus. 2025 Jun 6;17(6):e85463. doi: 10.7759/cureus.85463. eCollection 2025 Jun.

本文引用的文献

1
Evaluating ChatGPT as a patient resource for frequently asked questions about lung cancer surgery-a pilot study.评估ChatGPT作为肺癌手术常见问题患者资源的可行性——一项试点研究。
J Thorac Cardiovasc Surg. 2025 Apr;169(4):1174-1180.e18. doi: 10.1016/j.jtcvs.2024.09.030. Epub 2024 Sep 24.
2
Can artificial intelligence models serve as patient information consultants in orthodontics?人工智能模型能否在正畸学中充当患者信息顾问?
BMC Med Inform Decis Mak. 2024 Jul 29;24(1):211. doi: 10.1186/s12911-024-02619-8.
3
Is ChatGPT an Accurate and Readable Patient Aid for Third Molar Extractions?
ChatGPT 能否成为智齿拔除患者的准确且易读的辅助工具?
J Oral Maxillofac Surg. 2024 Oct;82(10):1239-1245. doi: 10.1016/j.joms.2024.06.177. Epub 2024 Jul 2.
4
Assessing the utility of artificial intelligence throughout the triage outpatients: a prospective randomized controlled clinical study.评估人工智能在分诊门诊中的效用:一项前瞻性随机对照临床研究。
Front Public Health. 2024 May 30;12:1391906. doi: 10.3389/fpubh.2024.1391906. eCollection 2024.
5
Evaluation of AI-generated responses by different artificial intelligence chatbots to the clinical decision-making case-based questions in oral and maxillofacial surgery.评估不同人工智能聊天机器人对口腔颌面外科基于临床决策案例问题的人工智能生成回复。
Oral Surg Oral Med Oral Pathol Oral Radiol. 2024 Jun;137(6):587-593. doi: 10.1016/j.oooo.2024.02.018. Epub 2024 Mar 6.
6
Evaluation of information provided to patients by ChatGPT about chronic diseases in Spanish language.对ChatGPT以西班牙语向患者提供的有关慢性病信息的评估。
Digit Health. 2024 Jan 2;10:20552076231224603. doi: 10.1177/20552076231224603. eCollection 2024 Jan-Dec.
7
Evaluation of the reliability and readability of ChatGPT-4 responses regarding hypothyroidism during pregnancy.评估 ChatGPT-4 在妊娠期间甲状腺功能减退症相关问题的回复的可靠性和可读性。
Sci Rep. 2024 Jan 2;14(1):243. doi: 10.1038/s41598-023-50884-w.
8
Beyond the Scalpel: Assessing ChatGPT's potential as an auxiliary intelligent virtual assistant in oral surgery.超越手术刀:评估ChatGPT作为口腔外科辅助智能虚拟助手的潜力。
Comput Struct Biotechnol J. 2023 Dec 6;24:46-52. doi: 10.1016/j.csbj.2023.11.058. eCollection 2024 Dec.
9
Can natural language processing serve as a consultant in oral surgery?自然语言处理能否在口腔外科中充当顾问?
J Stomatol Oral Maxillofac Surg. 2024 Jun;125(3):101724. doi: 10.1016/j.jormas.2023.101724. Epub 2023 Dec 3.
10
Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study.评估生成式 AI 大语言模型 ChatGPT、Google Bard 和 Microsoft Bing Chat 在支持循证牙科方面的性能:比较混合方法研究。
J Med Internet Res. 2023 Dec 28;25:e51580. doi: 10.2196/51580.