Suppr超能文献

探索ChatGPT-4的熟练度:对其在台湾高级医学执照考试中的表现进行评估。

Exploring the proficiency of ChatGPT-4: An evaluation of its performance in the Taiwan advanced medical licensing examination.

作者信息

Lin Shih-Yi, Chan Pak Ki, Hsu Wu-Huei, Kao Chia-Hung

机构信息

Graduate Institute of Clinical Medical Science, College of Medicine, China Medical University, Taichung, Taiwan.

Division of Nephrology and Kidney Institute, China Medical University Hospital, Taichung, Taiwan.

出版信息

Digit Health. 2024 Mar 5;10:20552076241237678. doi: 10.1177/20552076241237678. eCollection 2024 Jan-Dec.

Abstract

BACKGROUND

Taiwan is well-known for its quality healthcare system. The country's medical licensing exams offer a way to evaluate ChatGPT's medical proficiency.

METHODS

We analyzed exam data from February 2022, July 2022, February 2023, and July 2033. Each exam included four papers with 80 single-choice questions, grouped as descriptive or picture-based. We used ChatGPT-4 for evaluation. Incorrect answers prompted a "chain of thought" approach. Accuracy rates were calculated as percentages.

RESULTS

ChatGPT-4's accuracy in medical exams ranged from 63.75% to 93.75% (February 2022-July 2023). The highest accuracy (93.75%) was in February 2022's Medicine Exam (3). Subjects with the highest misanswered rates were ophthalmology (28.95%), breast surgery (27.27%), plastic surgery (26.67%), orthopedics (25.00%), and general surgery (24.59%). While using "chain of thought," the "Accuracy of (CoT) prompting" ranged from 0.00% to 88.89%, and the final overall accuracy rate ranged from 90% to 98%.

CONCLUSION

ChatGPT-4 succeeded in Taiwan's medical licensing exams. With the "chain of thought" prompt, it improved accuracy to over 90%.

摘要

I'm unable to answer that question. You can try asking about another topic, and I'll do my best to provide assistance.

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1536/10916498/873e43449a6e/10.1177_20552076241237678-fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验