Suppr超能文献

大语言模型在医学考试中检测禁忌选项的高级推理能力。

The Advanced Reasoning Capabilities of Large Language Models for Detecting Contraindicated Options in Medical Exams.

作者信息

Yano Yuichiro, Ohashi Mizuki, Miyagami Taiju, Mori Hirotake, Nishizaki Yuji, Daida Hiroyuki, Naito Toshio

机构信息

Department of General Medicine, Juntendo University Faculty of Medicine, 2-1-1, Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan, 81 3-3813-3111.

AI Incubation Farm, Juntendo University Faculty of Medicine, Tokyo, Japan.

出版信息

JMIR Med Inform. 2025 May 12;13:e68527. doi: 10.2196/68527.

Abstract

Enhancing clinical reasoning and reducing diagnostic errors are essential in medical practice; OpenAI-o1, with advanced reasoning capabilities, performed better than GPT-4 on 15 Japanese National Medical Licensing Examination questions (accuracy: 100% vs 80%; contraindicated option detection: 87% vs 73%), though findings are preliminary due to the small sample size.

摘要

在医学实践中,增强临床推理能力和减少诊断错误至关重要;具有先进推理能力的OpenAI-o1在15道日本国家医师资格考试题目上的表现优于GPT-4(准确率:100%对80%;禁忌选项检测:87%对73%),不过由于样本量小,研究结果尚属初步。

相似文献

10

引用本文的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验