Nissen Leon, Rother Johanna Flora, Heinemann Marie, Reimer Lara Marie, Jonas Stephan, Raupach Tobias
Institute for Digital Medicine, University Hospital Bonn, Bonn, Germany.
Institute of Medical Education, University Hospital Bonn, Bonn, Germany.
Med Teach. 2025 Sep;47(9):1544-1550. doi: 10.1080/0142159X.2025.2451870. Epub 2025 Jan 20.
Self-testing has been proven to significantly improve not only simple learning outcomes, but also higher-order skills such as clinical reasoning in medical students. Previous studies have shown that self-testing was especially beneficial when it was presented with feedback, which leaves the question whether an immediate and personalized feedback further encourages this effect. Therefore, we hypothesised that individual feedback has a greater effect on learning outcomes, compared to generic feedback.
In a randomised cross-over trial, German medical students were invited to voluntarily answer daily key-feature questions an App. For half of the items they received a generalised feedback by an expert, while the feedback on the other half was generated immediately through ChatGPT. After the intervention, the students participated in a mandatory exit exam.
Those participants who used the app more frequently experienced a better learning outcome compared to those who did not use it frequently, even though this finding was only examined in a correlative nature. The individual ChatGPT generated feedback did not show a greater effect on exit exam scores compared to the expert comment (51.8 ± 22.0% vs. 55.8 ± 22.8%; = 0.06).
This study proves the concept of providing personalised feedback on medical questions. Despite the promising results, improved prompting and further development of the application seems necessary to strengthen the possible impact of the personalised feedback. Our study closes a research gap and holds great potential for further use not only in medicine but also in other academic fields.
自我测试已被证明不仅能显著提高简单的学习成果,还能提升医学生的高阶技能,如临床推理能力。先前的研究表明,自我测试在提供反馈时尤其有益,这就引出了一个问题,即即时和个性化的反馈是否会进一步增强这种效果。因此,我们假设与一般性反馈相比,个性化反馈对学习成果的影响更大。
在一项随机交叉试验中,邀请德国医学生自愿通过一款应用程序回答每日关键特征问题。对于一半的题目,他们收到专家提供的一般性反馈,而另一半题目的反馈则通过ChatGPT即时生成。干预结束后,学生们参加了一场强制性的结业考试。
与不常使用该应用程序的参与者相比,那些更频繁使用该应用程序的参与者取得了更好的学习成果,尽管这一发现仅以相关性方式进行了检验。与专家评论相比,ChatGPT生成的个性化反馈对结业考试成绩并未显示出更大的影响(51.8±22.0%对55.8±22.8%;P = 0.06)。
本研究证明了针对医学问题提供个性化反馈的概念。尽管取得了有前景的结果,但似乎有必要改进提示方式并进一步开发该应用程序,以增强个性化反馈可能产生的影响。我们的研究填补了一个研究空白,不仅在医学领域,而且在其他学术领域都具有进一步应用的巨大潜力。