OpenAI 的 GPT-4 在-board 风格的皮肤病学问题上表现出色。

OpenAI's GPT-4 performs to a high degree on board-style dermatology questions.

机构信息

Department of Dermatology, Donald and Barbara Zucker School of Medicine at Hofstra/Northwell, New Hyde Park, NY, USA.

出版信息

Int J Dermatol. 2024 Jan;63(1):73-78. doi: 10.1111/ijd.16913. Epub 2023 Dec 22.

DOI:10.1111/ijd.16913

PMID:38131454

Abstract

BACKGROUND

Artificial intelligence tools such as OpenAI's GPT-4 have shown promise in medical education, but their potential in dermatology remains unexplored.

OBJECTIVES

To assess GPT-4's performance on dermatology board-style questions and determine its value as a supplementary educational tool for trainees and educators.

METHODS

This cross-sectional study evaluated GPT-4's performance on 250 random dermatology board-style questions sampled from the American Academy of Dermatology's Board Prep Plus resource. Questions were divided into five subspecialties and various difficulty levels. GPT-4 responses were compared to the correct answers and evaluated by two physicians.

RESULTS

GPT-4 achieved an overall accuracy of 75% on the 250 questions, with no significant variation based on subspecialty or question difficulty. The most common errors were factual and misunderstanding inaccuracies. Responses scored high in clarity, accuracy, and relevance but frequently lacked depth and completeness.

CONCLUSION

GPT-4 performed to a high degree and demonstrated promising performance as an educational adjunct in dermatology. Improvements in response depth and completeness are needed before its use as an unsupervised learning tool is established.

摘要