Department of Pharmacotherapy, University of Utah College of Pharmacy, Salt Lake City, UT, USA.
Department of Pharmacotherapy, University of Utah College of Pharmacy, Salt Lake City, UT, USA.
Am J Pharm Educ. 2024 Oct;88(10):101266. doi: 10.1016/j.ajpe.2024.101266. Epub 2024 Aug 15.
This study aimed to develop a prompt engineering procedure for test question mapping and then determine the effectiveness of test question mapping using Chat Generative Pre-Trained Transformer (ChatGPT) compared to human faculty mapping.
We conducted a cross-sectional study to compare ChatGPT and human mapping using a sample of 139 test questions from modules within the Integrated Pharmacotherapeutics course series. The test questions were mapped by 3 faculty members to both module objectives and the Accreditation Council for Pharmacy Education Standards 2016 (Standards 2016) to create the "correct answer". Prompt engineering procedures were created to facilitate mapping with ChatGPT, and ChatGPT mapping results were compared with human mapping.
ChatGPT mapped test questions directly to the "correct answer" based on human consensus in 68.0% of cases, and the program matched with at least one individual human response in another 20.1% of cases for a total of 88.1% agreement with human mappers. When humans fully agreed with the mapping decision, ChatGPT was more likely to map correctly.
This study presents a practical use case with prompt engineering tailored for college assessment or curriculum committees to facilitate efficient test questions and educational outcomes mapping.
本研究旨在开发一种用于测试问题映射的提示工程程序,然后确定使用 ChatGPT(Chat Generative Pre-Trained Transformer)进行测试问题映射的有效性,与人类教师映射进行比较。
我们进行了一项横断面研究,使用综合药物治疗课程系列模块中的 139 个测试问题样本,比较了 ChatGPT 和人类的映射。将这些测试问题由 3 名教师分别映射到模块目标和药学教育认证委员会 2016 年标准(2016 年标准),以创建“正确答案”。创建了提示工程程序来促进与 ChatGPT 的映射,比较了 ChatGPT 映射结果与人类映射结果。
ChatGPT 根据人类共识直接将测试问题映射到“正确答案”,在 68.0%的情况下,程序与至少一名人类个体的反应相匹配,总共有 88.1%的案例与人类映射者达成一致。当人类完全同意映射决策时,ChatGPT 更有可能正确映射。
本研究提出了一个实用案例,提示工程针对学院评估或课程委员会进行了定制,以促进高效的测试问题和教育成果映射。