人工智能生成的龋齿多项选择题的质量：ChatGPT和谷歌巴德语言模型的比较分析

The Quality of AI-Generated Dental Caries Multiple Choice Questions: A Comparative Analysis of ChatGPT and Google Bard Language Models.

作者信息

Ahmed Walaa Magdy, Azhari Amr Ahmed, Alfaraj Amal, Alhamadani Abdulaziz, Zhang Min, Lu Chang-Tien

机构信息

Department of Restorative Dentistry, Faculty of Dentistry, King Abdulaziz University, Jeddah, Saudi Arabia.

Department of Prosthodontics, School of Dentistry, King Faisal Universality, Al Ahsa, Saudi Arabia.

出版信息

Heliyon. 2024 Mar 19;10(7):e28198. doi: 10.1016/j.heliyon.2024.e28198. eCollection 2024 Apr 15.

DOI:10.1016/j.heliyon.2024.e28198

PMID:38596020

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11002540/

Abstract

STATEMENT OF PROBLEM

AI technology presents a variety of benefits and challenges for educators.

PURPOSE

To investigate whether ChatGPT and Google Bard (now is named Gemini) are valuable resources for generating multiple-choice questions for educators of dental caries.

MATERIAL AND METHODS

A book on dental caries was used. Sixteen paragraphs were extracted by an expert consultant based on applicability and potential for developing multiple-choice questions. ChatGPT and Bard language models were used to produce multiple-choice questions based on this input, and 64 questions were generated. Three dental specialists assessed the relevance, accuracy, and complexity of the generated questions. The questions were qualitatively evaluated using cognitive learning objectives and item writing flaws. Paired sample t-tests and two-way analysis of variance (ANOVA) were used to compare the generated multiple-choice questions and answers between ChatGPT and Bard.

RESULTS

There were no significant differences between the questions generated by ChatGPT and Bard. Moreover, the analysis of variance found no significant differences in question quality. Bard-generated questions tended to have higher cognitive levels than those of ChatGPT. Format error was predominant in ChatGPT-generated questions. Finally, Bard exhibited more absolute terms than ChatGPT.

CONCLUSIONS

ChatGPT and Bard could generate questions related to dental caries, mainly at the cognitive level of knowledge and comprehension.

CLINICAL SIGNIFICANCE

Language models are crucial for generating subject-specific questions used in quizzes, tests, and education. By using these models, educators can save time and focus on lesson preparation and student engagement instead of solely focusing on assessment creation. Additionally, language models are adept at generating numerous questions, making them particularly valuable for large-scale exams. However, educators must carefully review and adapt the questions to ensure they align with their learning goals.

摘要

问题陈述

人工智能技术给教育工作者带来了各种好处和挑战。

目的

研究ChatGPT和谷歌巴德（现名为Gemini）是否是为龋齿教育工作者生成多项选择题的宝贵资源。

材料与方法

使用一本关于龋齿的书籍。一位专家顾问根据开发多项选择题的适用性和潜力提取了16个段落。使用ChatGPT和巴德语言模型根据这些输入生成多项选择题，共生成了64个问题。三位牙科专家评估了所生成问题的相关性、准确性和复杂性。使用认知学习目标和题目编写缺陷对问题进行定性评估。使用配对样本t检验和双向方差分析（ANOVA）来比较ChatGPT和巴德生成的多项选择题及答案。

结果

ChatGPT和巴德生成的问题之间没有显著差异。此外，方差分析发现问题质量没有显著差异。巴德生成的问题往往比ChatGPT生成的问题具有更高的认知水平。格式错误在ChatGPT生成的问题中占主导地位。最后，巴德使用的绝对术语比ChatGPT更多。

结论

ChatGPT和巴德可以生成与龋齿相关的问题，主要处于知识和理解的认知水平。

临床意义

语言模型对于生成用于测验、考试和教育的特定学科问题至关重要。通过使用这些模型，教育工作者可以节省时间，将精力集中在课程准备和学生参与上，而不是仅仅专注于创建评估。此外，语言模型擅长生成大量问题，使其在大规模考试中特别有价值。然而，教育工作者必须仔细审查并调整问题，以确保它们符合学习目标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ab/11002540/b4d01262b4ad/gr1.jpg

相似文献

The Quality of AI-Generated Dental Caries Multiple Choice Questions: A Comparative Analysis of ChatGPT and Google Bard Language Models.人工智能生成的龋齿多项选择题的质量：ChatGPT和谷歌巴德语言模型的比较分析

Heliyon. 2024 Mar 19;10(7):e28198. doi: 10.1016/j.heliyon.2024.e28198. eCollection 2024 Apr 15.

Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.评估药物流产信息的准确性：ChatGPT与谷歌巴德人工智能的比较分析

Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.

Evaluation of the Current Status of Artificial Intelligence for Endourology Patient Education: A Blind Comparison of ChatGPT and Google Bard Against Traditional Information Resources.评估人工智能在泌尿内镜患者教育中的现状：ChatGPT 和 Google Bard 与传统信息资源的盲对比。

J Endourol. 2024 Aug;38(8):843-851. doi: 10.1089/end.2023.0696. Epub 2024 May 17.

Analysing the Applicability of ChatGPT, Bard, and Bing to Generate Reasoning-Based Multiple-Choice Questions in Medical Physiology.分析ChatGPT、Bard和必应在医学生理学中生成基于推理的多项选择题的适用性。

Cureus. 2023 Jun 26;15(6):e40977. doi: 10.7759/cureus.40977. eCollection 2023 Jun.

How AI Responds to Common Lung Cancer Questions: ChatGPT vs Google Bard.人工智能如何回答常见肺癌问题：ChatGPT 与 Google Bard 对比。

Radiology. 2023 Jun;307(5):e230922. doi: 10.1148/radiol.230922.

The performance of artificial intelligence models in generating responses to general orthodontic questions: ChatGPT vs Google Bard.人工智能模型在生成正畸常见问题回答方面的表现：ChatGPT与谷歌巴德的对比

Am J Orthod Dentofacial Orthop. 2024 Jun;165(6):652-662. doi: 10.1016/j.ajodo.2024.01.012. Epub 2024 Mar 15.

Large Language Models in Hematology Case Solving: A Comparative Study of ChatGPT-3.5, Google Bard, and Microsoft Bing.大语言模型在血液学病例解决中的应用：ChatGPT-3.5、谷歌巴德和微软必应的比较研究

Cureus. 2023 Aug 21;15(8):e43861. doi: 10.7759/cureus.43861. eCollection 2023 Aug.

Assessing the Reproducibility of the Structured Abstracts Generated by ChatGPT and Bard Compared to Human-Written Abstracts in the Field of Spine Surgery: Comparative Analysis.评估 ChatGPT 和 Bard 生成的结构化摘要与脊柱外科领域人类撰写的摘要在可重复性方面的比较：对比分析。

J Med Internet Res. 2024 Jun 26;26:e52001. doi: 10.2196/52001.

Comparison of Large Language Models in Answering Immuno-Oncology Questions: A Cross-Sectional Study.大型语言模型在回答免疫肿瘤学问题中的比较：一项横断面研究。

medRxiv. 2023 Oct 31:2023.10.31.23297825. doi: 10.1101/2023.10.31.23297825.

Evidence-based potential of generative artificial intelligence large language models in orthodontics: a comparative study of ChatGPT, Google Bard, and Microsoft Bing.生成式人工智能大语言模型在正畸学中的循证潜力：ChatGPT、谷歌巴德和微软必应的比较研究

Eur J Orthod. 2024 Apr 13. doi: 10.1093/ejo/cjae017.

引用本文的文献

Comparison of artificial intelligence systems in answering prosthodontics questions from the dental specialty exam in Turkey.土耳其牙科专业考试中人工智能系统回答口腔修复学问题的比较

J Dent Sci. 2025 Jul;20(3):1454-1459. doi: 10.1016/j.jds.2025.01.025. Epub 2025 Jan 31.

Evaluation of Large Language Model Performance in Answering Clinical Questions on Periodontal Furcation Defect Management.大语言模型在回答牙周根分叉病变管理临床问题中的性能评估

Dent J (Basel). 2025 Jun 18;13(6):271. doi: 10.3390/dj13060271.

本文引用的文献

Future Potential Challenges of Using Large Language Models Like ChatGPT in Daily Medical Practice.在日常医疗实践中使用像ChatGPT这样的大语言模型未来可能面临的挑战。

J Am Coll Radiol. 2024 Feb;21(2):344-345. doi: 10.1016/j.jacr.2023.10.019. Epub 2023 Nov 3.

ChatGPT for shaping the future of dentistry: the potential of multi-modal large language model.ChatGPT 塑造牙科的未来：多模态大语言模型的潜力。

Int J Oral Sci. 2023 Jul 28;15(1):29. doi: 10.1038/s41368-023-00239-y.

Early applications of ChatGPT in medical practice, education and research.ChatGPT 在医疗实践、教育和研究中的早期应用。

Clin Med (Lond). 2023 May;23(3):278-279. doi: 10.7861/clinmed.2023-0078. Epub 2023 Apr 21.

Implications of large language models such as ChatGPT for dental medicine.ChatGPT 等大型语言模型对牙科医学的影响。

J Esthet Restor Dent. 2023 Oct;35(7):1098-1102. doi: 10.1111/jerd.13046. Epub 2023 Apr 5.

ChatGPT and the Future of Medical Writing.ChatGPT与医学写作的未来。

Radiology. 2023 Apr;307(2):e223312. doi: 10.1148/radiol.223312. Epub 2023 Feb 2.

Software Systems Security Vulnerabilities Management by Exploring the Capabilities of Language Models Using NLP.利用自然语言处理探索语言模型的能力进行软件系统安全漏洞管理。

Comput Intell Neurosci. 2021 Dec 27;2021:8522839. doi: 10.1155/2021/8522839. eCollection 2021.

Fairness and accountability of AI in disaster risk management: Opportunities and challenges.人工智能在灾害风险管理中的公平性与问责制：机遇与挑战。

Patterns (N Y). 2021 Nov 12;2(11):100363. doi: 10.1016/j.patter.2021.100363.

Privacy and artificial intelligence: challenges for protecting health information in a new era.隐私与人工智能：新时代保护健康信息的挑战。

BMC Med Ethics. 2021 Sep 15;22(1):122. doi: 10.1186/s12910-021-00687-3.

Application of Artificial Intelligence in Dentistry.人工智能在牙科中的应用。

J Dent Res. 2021 Mar;100(3):232-244. doi: 10.1177/0022034520969115. Epub 2020 Oct 29.

Are Multiple Choice Questions for Post Graduate Dental Entrance Examinations Spot On?-Item Analysis of MCQs in Prosthodontics in India.多选择题是否适用于牙科学位入学考试？——印度修复学多项选择题的项目分析。

J Natl Med Assoc. 2018 Oct;110(5):455-458. doi: 10.1016/j.jnma.2017.11.001. Epub 2017 Dec 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

人工智能生成的龋齿多项选择题的质量：ChatGPT和谷歌巴德语言模型的比较分析

The Quality of AI-Generated Dental Caries Multiple Choice Questions: A Comparative Analysis of ChatGPT and Google Bard Language Models.

作者信息

机构信息

出版信息

STATEMENT OF PROBLEM

PURPOSE

MATERIAL AND METHODS

RESULTS

CONCLUSIONS

CLINICAL SIGNIFICANCE

问题陈述

目的

材料与方法

结果

结论

临床意义

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献