Suppr超能文献

人工智能生成的龋齿多项选择题的质量:ChatGPT和谷歌巴德语言模型的比较分析

The Quality of AI-Generated Dental Caries Multiple Choice Questions: A Comparative Analysis of ChatGPT and Google Bard Language Models.

作者信息

Ahmed Walaa Magdy, Azhari Amr Ahmed, Alfaraj Amal, Alhamadani Abdulaziz, Zhang Min, Lu Chang-Tien

机构信息

Department of Restorative Dentistry, Faculty of Dentistry, King Abdulaziz University, Jeddah, Saudi Arabia.

Department of Prosthodontics, School of Dentistry, King Faisal Universality, Al Ahsa, Saudi Arabia.

出版信息

Heliyon. 2024 Mar 19;10(7):e28198. doi: 10.1016/j.heliyon.2024.e28198. eCollection 2024 Apr 15.

Abstract

STATEMENT OF PROBLEM

AI technology presents a variety of benefits and challenges for educators.

PURPOSE

To investigate whether ChatGPT and Google Bard (now is named Gemini) are valuable resources for generating multiple-choice questions for educators of dental caries.

MATERIAL AND METHODS

A book on dental caries was used. Sixteen paragraphs were extracted by an expert consultant based on applicability and potential for developing multiple-choice questions. ChatGPT and Bard language models were used to produce multiple-choice questions based on this input, and 64 questions were generated. Three dental specialists assessed the relevance, accuracy, and complexity of the generated questions. The questions were qualitatively evaluated using cognitive learning objectives and item writing flaws. Paired sample t-tests and two-way analysis of variance (ANOVA) were used to compare the generated multiple-choice questions and answers between ChatGPT and Bard.

RESULTS

There were no significant differences between the questions generated by ChatGPT and Bard. Moreover, the analysis of variance found no significant differences in question quality. Bard-generated questions tended to have higher cognitive levels than those of ChatGPT. Format error was predominant in ChatGPT-generated questions. Finally, Bard exhibited more absolute terms than ChatGPT.

CONCLUSIONS

ChatGPT and Bard could generate questions related to dental caries, mainly at the cognitive level of knowledge and comprehension.

CLINICAL SIGNIFICANCE

Language models are crucial for generating subject-specific questions used in quizzes, tests, and education. By using these models, educators can save time and focus on lesson preparation and student engagement instead of solely focusing on assessment creation. Additionally, language models are adept at generating numerous questions, making them particularly valuable for large-scale exams. However, educators must carefully review and adapt the questions to ensure they align with their learning goals.

摘要

问题陈述

人工智能技术给教育工作者带来了各种好处和挑战。

目的

研究ChatGPT和谷歌巴德(现名为Gemini)是否是为龋齿教育工作者生成多项选择题的宝贵资源。

材料与方法

使用一本关于龋齿的书籍。一位专家顾问根据开发多项选择题的适用性和潜力提取了16个段落。使用ChatGPT和巴德语言模型根据这些输入生成多项选择题,共生成了64个问题。三位牙科专家评估了所生成问题的相关性、准确性和复杂性。使用认知学习目标和题目编写缺陷对问题进行定性评估。使用配对样本t检验和双向方差分析(ANOVA)来比较ChatGPT和巴德生成的多项选择题及答案。

结果

ChatGPT和巴德生成的问题之间没有显著差异。此外,方差分析发现问题质量没有显著差异。巴德生成的问题往往比ChatGPT生成的问题具有更高的认知水平。格式错误在ChatGPT生成的问题中占主导地位。最后,巴德使用的绝对术语比ChatGPT更多。

结论

ChatGPT和巴德可以生成与龋齿相关的问题,主要处于知识和理解的认知水平。

临床意义

语言模型对于生成用于测验、考试和教育的特定学科问题至关重要。通过使用这些模型,教育工作者可以节省时间,将精力集中在课程准备和学生参与上,而不是仅仅专注于创建评估。此外,语言模型擅长生成大量问题,使其在大规模考试中特别有价值。然而,教育工作者必须仔细审查并调整问题,以确保它们符合学习目标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/33ab/11002540/b4d01262b4ad/gr1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验