Ahmed Walaa Magdy, Azhari Amr Ahmed, Alfaraj Amal, Alhamadani Abdulaziz, Zhang Min, Lu Chang-Tien
Department of Restorative Dentistry, Faculty of Dentistry, King Abdulaziz University, Jeddah, Saudi Arabia.
Department of Prosthodontics, School of Dentistry, King Faisal Universality, Al Ahsa, Saudi Arabia.
Heliyon. 2024 Mar 19;10(7):e28198. doi: 10.1016/j.heliyon.2024.e28198. eCollection 2024 Apr 15.
AI technology presents a variety of benefits and challenges for educators.
To investigate whether ChatGPT and Google Bard (now is named Gemini) are valuable resources for generating multiple-choice questions for educators of dental caries.
A book on dental caries was used. Sixteen paragraphs were extracted by an expert consultant based on applicability and potential for developing multiple-choice questions. ChatGPT and Bard language models were used to produce multiple-choice questions based on this input, and 64 questions were generated. Three dental specialists assessed the relevance, accuracy, and complexity of the generated questions. The questions were qualitatively evaluated using cognitive learning objectives and item writing flaws. Paired sample t-tests and two-way analysis of variance (ANOVA) were used to compare the generated multiple-choice questions and answers between ChatGPT and Bard.
There were no significant differences between the questions generated by ChatGPT and Bard. Moreover, the analysis of variance found no significant differences in question quality. Bard-generated questions tended to have higher cognitive levels than those of ChatGPT. Format error was predominant in ChatGPT-generated questions. Finally, Bard exhibited more absolute terms than ChatGPT.
ChatGPT and Bard could generate questions related to dental caries, mainly at the cognitive level of knowledge and comprehension.
Language models are crucial for generating subject-specific questions used in quizzes, tests, and education. By using these models, educators can save time and focus on lesson preparation and student engagement instead of solely focusing on assessment creation. Additionally, language models are adept at generating numerous questions, making them particularly valuable for large-scale exams. However, educators must carefully review and adapt the questions to ensure they align with their learning goals.
人工智能技术给教育工作者带来了各种好处和挑战。
研究ChatGPT和谷歌巴德(现名为Gemini)是否是为龋齿教育工作者生成多项选择题的宝贵资源。
使用一本关于龋齿的书籍。一位专家顾问根据开发多项选择题的适用性和潜力提取了16个段落。使用ChatGPT和巴德语言模型根据这些输入生成多项选择题,共生成了64个问题。三位牙科专家评估了所生成问题的相关性、准确性和复杂性。使用认知学习目标和题目编写缺陷对问题进行定性评估。使用配对样本t检验和双向方差分析(ANOVA)来比较ChatGPT和巴德生成的多项选择题及答案。
ChatGPT和巴德生成的问题之间没有显著差异。此外,方差分析发现问题质量没有显著差异。巴德生成的问题往往比ChatGPT生成的问题具有更高的认知水平。格式错误在ChatGPT生成的问题中占主导地位。最后,巴德使用的绝对术语比ChatGPT更多。
ChatGPT和巴德可以生成与龋齿相关的问题,主要处于知识和理解的认知水平。
语言模型对于生成用于测验、考试和教育的特定学科问题至关重要。通过使用这些模型,教育工作者可以节省时间,将精力集中在课程准备和学生参与上,而不是仅仅专注于创建评估。此外,语言模型擅长生成大量问题,使其在大规模考试中特别有价值。然而,教育工作者必须仔细审查并调整问题,以确保它们符合学习目标。