State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Research Center for Oral Diseases & Dept. of Traumatic and Plastic Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, China.
Hua Xi Kou Qiang Yi Xue Za Zhi. 2024 Dec 1;42(6):810-815. doi: 10.7518/hxkq.2024.2024144.
This study aims to compare and analyze three types of generative artificial intelligence (GAI) and explore their application value and existing problems in the field of stomatology in the Chinese context.
A total of 36 questions were designed, covering all the professional areas of stomatology. The questions encompassed various aspects including medical records, professional knowledge, and translation and editing. These questions were submitted to ChatGPT4-turbo, Gemini (2024.2) and ERNIE Bot 4.0. After obtaining the answers, a blinded evaluation was conducted by three experienced oral medicine physicians using a four-point Likert scale. The value of GAI in various application scenarios was evaluated.
Gemini scored 45, ERNIE Bot scored 38, and ChatGPT scored 33 for clinical documentation and image production. For research assistance, Gemini achieved 45, ERNIE Bot had 39, and ChatGPT scored 35. Teaching assistance capabilities were rated at 54 for ERNIE Bot, 50 for Gemini, and 48 for ChatGPT. In patient consultation and guidance, Gemini scored 78, ERNIE Bot scored 59, and ChatGPT scored 48. Overall, the total scores were 218, 190, and 164 for Gemini, ERNIE Bot, and ChatGPT, respectively. Among GAI applications, the top scoring categories were article translation and polishing (26), patient-doctor communication documentation (23), and popular science content creation (23). The lowest scoring categories were literature search and reporting (13) and image generation (12).
In the Chinese context, the application value of GAI is the highest for Gemini, followed by ERNIE Bot and ChatGPT. GAI shows significant value in translation, patient-doctor communication, and popular science writing. However, its value in literature search, reporting, and image generation remains limited.
本研究旨在比较和分析三种生成式人工智能(GAI),并探讨它们在中国语境下在口腔医学领域的应用价值和存在的问题。
共设计 36 个问题,涵盖口腔医学的所有专业领域。这些问题包括病历、专业知识以及翻译和编辑等各个方面。将这些问题提交给 ChatGPT4-turbo、Gemini(2024.2)和 ERNIE Bot 4.0。在获得答案后,由三位有经验的口腔医学医生使用四点 Likert 量表进行盲法评估。评估 GAI 在各种应用场景中的价值。
在临床文档和图像生成方面,Gemini 得分为 45,ERNIE Bot 得分为 38,ChatGPT 得分为 33。在研究辅助方面,Gemini 得分为 45,ERNIE Bot 得分为 39,ChatGPT 得分为 35。教学辅助能力方面,ERNIE Bot 得分为 54,Gemini 得分为 50,ChatGPT 得分为 48。在患者咨询和指导方面,Gemini 得分为 78,ERNIE Bot 得分为 59,ChatGPT 得分为 48。总体而言,Gemini、ERNIE Bot 和 ChatGPT 的总分为 218、190 和 164。在 GAI 应用中,得分最高的类别是文章翻译和润色(26)、医患沟通文档(23)和科普内容创作(23)。得分最低的类别是文献检索和报告(13)和图像生成(12)。
在中国语境下,GAI 的应用价值以 Gemini 最高,其次是 ERNIE Bot 和 ChatGPT。GAI 在翻译、医患沟通和科普写作方面具有显著价值。然而,它在文献检索、报告和图像生成方面的价值仍然有限。