Suppr超能文献

ChatGPT-4.0 在医学影像评估中的性能:一项横断面研究。

The performance of ChatGPT-4.0o in medical imaging evaluation: a cross-sectional study.

机构信息

UniSA Allied Health & Human Performance, University of South Australia, Adelaide, SA, Australia

Jones Radiology, Eastwood, SA, Australia.

出版信息

J Educ Eval Health Prof. 2024;21:29. doi: 10.3352/jeehp.2024.21.29. Epub 2024 Oct 31.

Abstract

This study investigated the performance of ChatGPT-4.0o in evaluating the quality of positioning in radiographic images. Thirty radiographs depicting a variety of knee, elbow, ankle, hand, pelvis, and shoulder projections were produced using anthropomorphic phantoms and uploaded to ChatGPT-4.0o. The model was prompted to provide a solution to identify any positioning errors with justification and offer improvements. A panel of radiographers assessed the solutions for radiographic quality based on established positioning criteria, with a grading scale of 1–5. In only 20% of projections, ChatGPT-4.0o correctly recognized all errors with justifications and offered correct suggestions for improvement. The most commonly occurring score was 3 (9 cases, 30%), wherein the model recognized at least 1 specific error and provided a correct improvement. The mean score was 2.9. Overall, low accuracy was demonstrated, with most projections receiving only partially correct solutions. The findings reinforce the importance of robust radiography education and clinical experience.

摘要

本研究旨在评估 ChatGPT-4.0o 在评估放射影像定位质量方面的性能。使用人体模型制作了 30 张膝关节、肘关节、踝关节、手部、骨盆和肩部投照的射线照片,并将其上传至 ChatGPT-4.0o。模型被提示提供一种解决方案,以识别任何定位错误,并提供改进建议。一组放射技师根据既定的定位标准,使用 1-5 的评分量表对放射质量的解决方案进行评估。在仅 20%的投影中,ChatGPT-4.0o 正确识别所有错误,并提供了正确的改进建议。最常见的分数是 3(9 例,30%),其中模型识别出至少 1 个特定错误,并提供了正确的改进建议。平均得分为 2.9。总体而言,准确性较低,大多数投影仅获得部分正确的解决方案。研究结果强调了强大的放射学教育和临床经验的重要性。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验