生成式人工智能在口腔医学领域的应用价值。

Application value of generative artificial intelligence in the field of stomatology.

机构信息

State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Research Center for Oral Diseases & Dept. of Traumatic and Plastic Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, China.

出版信息

Hua Xi Kou Qiang Yi Xue Za Zhi. 2024 Dec 1;42(6):810-815. doi: 10.7518/hxkq.2024.2024144.

DOI:10.7518/hxkq.2024.2024144

PMID:39610079

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11669926/

Abstract

OBJECTIVES

This study aims to compare and analyze three types of generative artificial intelligence (GAI) and explore their application value and existing problems in the field of stomatology in the Chinese context.

METHODS

A total of 36 questions were designed, covering all the professional areas of stomatology. The questions encompassed various aspects including medical records, professional knowledge, and translation and editing. These questions were submitted to ChatGPT4-turbo, Gemini (2024.2) and ERNIE Bot 4.0. After obtaining the answers, a blinded evaluation was conducted by three experienced oral medicine physicians using a four-point Likert scale. The value of GAI in various application scenarios was evaluated.

RESULTS

Gemini scored 45, ERNIE Bot scored 38, and ChatGPT scored 33 for clinical documentation and image production. For research assistance, Gemini achieved 45, ERNIE Bot had 39, and ChatGPT scored 35. Teaching assistance capabilities were rated at 54 for ERNIE Bot, 50 for Gemini, and 48 for ChatGPT. In patient consultation and guidance, Gemini scored 78, ERNIE Bot scored 59, and ChatGPT scored 48. Overall, the total scores were 218, 190, and 164 for Gemini, ERNIE Bot, and ChatGPT, respectively. Among GAI applications, the top scoring categories were article translation and polishing (26), patient-doctor communication documentation (23), and popular science content creation (23). The lowest scoring categories were literature search and reporting (13) and image generation (12).

CONCLUSIONS

In the Chinese context, the application value of GAI is the highest for Gemini, followed by ERNIE Bot and ChatGPT. GAI shows significant value in translation, patient-doctor communication, and popular science writing. However, its value in literature search, reporting, and image generation remains limited.

摘要

目的

本研究旨在比较和分析三种生成式人工智能（GAI），并探讨它们在中国语境下在口腔医学领域的应用价值和存在的问题。

方法

共设计 36 个问题，涵盖口腔医学的所有专业领域。这些问题包括病历、专业知识以及翻译和编辑等各个方面。将这些问题提交给 ChatGPT4-turbo、Gemini（2024.2）和 ERNIE Bot 4.0。在获得答案后，由三位有经验的口腔医学医生使用四点 Likert 量表进行盲法评估。评估 GAI 在各种应用场景中的价值。

结果

在临床文档和图像生成方面，Gemini 得分为 45，ERNIE Bot 得分为 38，ChatGPT 得分为 33。在研究辅助方面，Gemini 得分为 45，ERNIE Bot 得分为 39，ChatGPT 得分为 35。教学辅助能力方面，ERNIE Bot 得分为 54，Gemini 得分为 50，ChatGPT 得分为 48。在患者咨询和指导方面，Gemini 得分为 78，ERNIE Bot 得分为 59，ChatGPT 得分为 48。总体而言，Gemini、ERNIE Bot 和 ChatGPT 的总分为 218、190 和 164。在 GAI 应用中，得分最高的类别是文章翻译和润色（26）、医患沟通文档（23）和科普内容创作（23）。得分最低的类别是文献检索和报告（13）和图像生成（12）。

结论

在中国语境下，GAI 的应用价值以 Gemini 最高，其次是 ERNIE Bot 和 ChatGPT。GAI 在翻译、医患沟通和科普写作方面具有显著价值。然而，它在文献检索、报告和图像生成方面的价值仍然有限。

相似文献

Application value of generative artificial intelligence in the field of stomatology.生成式人工智能在口腔医学领域的应用价值。

Hua Xi Kou Qiang Yi Xue Za Zhi. 2024 Dec 1;42(6):810-815. doi: 10.7518/hxkq.2024.2024144.

Physician Versus Large Language Model Chatbot Responses to Web-Based Questions From Autistic Patients in Chinese: Cross-Sectional Comparative Analysis.中文自闭症患者网络问诊中，医生与大型语言模型聊天机器人回复的对比分析：横断面研究。

J Med Internet Res. 2024 Apr 30;26:e54706. doi: 10.2196/54706.

Comparing the performance of ChatGPT and ERNIE Bot in answering questions regarding liver cancer interventional radiology in Chinese and English contexts: A comparative study.比较ChatGPT和文心一言在中英文语境下回答肝癌介入放射学相关问题的性能：一项比较研究。

Digit Health. 2025 Jan 23;11:20552076251315511. doi: 10.1177/20552076251315511. eCollection 2025 Jan-Dec.

The performance of ChatGPT and ERNIE Bot in surgical resident examinations.ChatGPT和文心一言在外科住院医师考试中的表现。

Int J Med Inform. 2025 Aug;200:105906. doi: 10.1016/j.ijmedinf.2025.105906. Epub 2025 Apr 4.

Performance of Artificial Intelligence Chatbots on Ultrasound Examinations: Cross-Sectional Comparative Analysis.人工智能聊天机器人在超声检查中的表现：横断面比较分析。

JMIR Med Inform. 2025 Jan 9;13:e63924. doi: 10.2196/63924.

Comparison of artificial intelligence-generated and physician-generated patient education materials on early diabetic kidney disease.人工智能生成与医生生成的早期糖尿病肾病患者教育材料的比较

Front Endocrinol (Lausanne). 2025 Apr 22;16:1559265. doi: 10.3389/fendo.2025.1559265. eCollection 2025.

Evidence-Based Potential of Generative Artificial Intelligence Large Language Models on Dental Avulsion: ChatGPT Versus Gemini.生成式人工智能大语言模型在牙脱位方面基于证据的潜力：ChatGPT与Gemini对比

Dent Traumatol. 2025 Apr;41(2):178-186. doi: 10.1111/edt.12999. Epub 2024 Nov 2.

Assessing the performance of large language models (LLMs) in answering medical questions regarding breast cancer in the Chinese context.评估大语言模型（LLMs）在中国背景下回答有关乳腺癌医学问题的表现。

Digit Health. 2024 Oct 7;10:20552076241284771. doi: 10.1177/20552076241284771. eCollection 2024 Jan-Dec.

The use of ChatGPT and Google Gemini in responding to orthognathic surgery-related questions: A comparative study.ChatGPT与谷歌Gemini在回答正颌外科相关问题中的应用：一项比较研究。

J World Fed Orthod. 2025 Feb;14(1):20-26. doi: 10.1016/j.ejwf.2024.09.004. Epub 2024 Oct 28.

Evaluating ChatGPT and Google Gemini Performance and Implications in Turkish Dental Education.评估ChatGPT和谷歌Gemini在土耳其牙科教育中的性能及影响

Cureus. 2025 Jan 11;17(1):e77292. doi: 10.7759/cureus.77292. eCollection 2025 Jan.

本文引用的文献

Peer review of GPT-4 technical report and systems card.GPT-4技术报告和系统卡片的同行评审。

PLOS Digit Health. 2024 Jan 18;3(1):e0000417. doi: 10.1371/journal.pdig.0000417. eCollection 2024 Jan.

Evaluating GPT as an Adjunct for Radiologic Decision Making: GPT-4 Versus GPT-3.5 in a Breast Imaging Pilot.评估 GPT 作为放射学决策辅助工具：GPT-4 与 GPT-3.5 在乳腺成像试点中的比较。

J Am Coll Radiol. 2023 Oct;20(10):990-997. doi: 10.1016/j.jacr.2023.05.003. Epub 2023 Jun 21.

Evaluating the Performance of ChatGPT in Ophthalmology: An Analysis of Its Successes and Shortcomings.评估ChatGPT在眼科领域的表现：对其优缺点的分析。

Ophthalmol Sci. 2023 May 5;3(4):100324. doi: 10.1016/j.xops.2023.100324. eCollection 2023 Dec.

Is Chat-GPT4 a qualified surgical oncologist?Chat-GPT4是一名合格的外科肿瘤学家吗？

Int J Surg. 2023 Sep 1;109(9):2846-2848. doi: 10.1097/JS9.0000000000000504.

ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用：对其前景与合理担忧的系统评价

Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma.评估 ChatGPT 在回答肝硬化和肝细胞癌相关问题方面的表现。

Clin Mol Hepatol. 2023 Jul;29(3):721-732. doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.

ChatGPT and other artificial intelligence applications speed up scientific writing.ChatGPT和其他人工智能应用程序加快了科学写作的速度。

J Chin Med Assoc. 2023 Apr 1;86(4):351-353. doi: 10.1097/JCMA.0000000000000900. Epub 2023 Feb 14.

Exploring ChatGPT for information of cardiopulmonary resuscitation.探索ChatGPT以获取心肺复苏相关信息。

Resuscitation. 2023 Apr;185:109729. doi: 10.1016/j.resuscitation.2023.109729. Epub 2023 Feb 10.

ChatGPT: the future of discharge summaries?ChatGPT：出院小结的未来？

Lancet Digit Health. 2023 Mar;5(3):e107-e108. doi: 10.1016/S2589-7500(23)00021-3. Epub 2023 Feb 6.

ChatGPT: can artificial intelligence language models be of value for cardiovascular nurses and allied health professionals.ChatGPT：人工智能语言模型对心血管护士和相关健康专业人员有价值吗？

Eur J Cardiovasc Nurs. 2023 Oct 19;22(7):e55-e59. doi: 10.1093/eurjcn/zvad022.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验