核医学教育中的 GPT-4：它比 GPT-3.5 表现更好吗？

GPT-4 in Nuclear Medicine Education: Does It Outperform GPT-3.5?

机构信息

Charles Sturt University, Wagga Wagga, New South Wales, Australia

出版信息

J Nucl Med Technol. 2023 Dec 5;51(4):314-317. doi: 10.2967/jnmt.123.266485.

DOI:10.2967/jnmt.123.266485

Abstract

The emergence of ChatGPT has challenged academic integrity in teaching institutions, including those providing nuclear medicine training. Although previous evaluations of ChatGPT have suggested a limited scope for academic writing, the March 2023 release of generative pretrained transformer (GPT)-4 promises enhanced capabilities that require evaluation. Examinations (final and calculation) and written assignments for nuclear medicine subjects were tested using GPT-3.5 and GPT-4. GPT-3.5 and GPT-4 responses were evaluated by Turnitin software for artificial intelligence scores, marked against standardized rubrics, and compared with the mean performance of student cohorts. ChatGPT powered by GPT-3.5 performed poorly in calculation examinations (31.4%), compared with GPT-4 (59.1%). GPT-3.5 failed each of 3 written tasks (39.9%), whereas GPT-4 passed each task (56.3%). Although GPT-3.5 poses a minimal risk to academic integrity, its usefulness as a cheating tool can be significantly enhanced by GPT-4 but remains prone to hallucination and fabrication.

摘要

ChatGPT 的出现对教学机构的学术诚信提出了挑战，包括提供核医学培训的机构。尽管之前对 ChatGPT 的评估表明其在学术写作方面的应用范围有限，但 2023 年 3 月发布的生成式预训练转换器 (GPT)-4 承诺具有增强的功能，需要进行评估。使用 GPT-3.5 和 GPT-4 对核医学科目的考试（期末和计算）和书面作业进行了测试。GPT-3.5 和 GPT-4 的回复由 Turnitin 软件进行人工智能评分，根据标准化评分标准进行标记，并与学生群体的平均表现进行比较。由 GPT-3.5 提供支持的 ChatGPT 在计算考试中的表现（31.4%）不如 GPT-4（59.1%）。GPT-3.5 未能通过 3 项书面任务中的每一项（39.9%），而 GPT-4 则通过了每项任务（56.3%）。尽管 GPT-3.5 对学术诚信构成的风险很小，但通过 GPT-4 可以显著增强其作为作弊工具的用途，但它仍然容易出现幻觉和捏造。

相似文献

GPT-4 in Nuclear Medicine Education: Does It Outperform GPT-3.5?核医学教育中的 GPT-4：它比 GPT-3.5 表现更好吗？

J Nucl Med Technol. 2023 Dec 5;51(4):314-317. doi: 10.2967/jnmt.123.266485.

ChatGPT in Nuclear Medicine Education.ChatGPT 在核医学教育中的应用。

J Nucl Med Technol. 2023 Sep;51(3):247-254. doi: 10.2967/jnmt.123.265844. Epub 2023 Jul 11.

ChatGPT in medical imaging higher education.ChatGPT 在医学影像学高等教育中的应用。

Radiography (Lond). 2023 Jul;29(4):792-799. doi: 10.1016/j.radi.2023.05.011. Epub 2023 Jun 2.

ChatGPT and Patient Information in Nuclear Medicine: GPT-3.5 Versus GPT-4.ChatGPT 和核医学中的患者信息：GPT-3.5 与 GPT-4

J Nucl Med Technol. 2023 Dec 5;51(4):307-313. doi: 10.2967/jnmt.123.266151.

Academic integrity and artificial intelligence: is ChatGPT hype, hero or heresy?学术诚信与人工智能：ChatGPT 是炒作、英雄还是异端？

Semin Nucl Med. 2023 Sep;53(5):719-730. doi: 10.1053/j.semnuclmed.2023.04.008. Epub 2023 May 22.

A Conversation with ChatGPT.与 ChatGPT 的对话。

J Nucl Med Technol. 2023 Sep;51(3):255-260. doi: 10.2967/jnmt.123.265864. Epub 2023 Jul 11.

Performance and exploration of ChatGPT in medical examination, records and education in Chinese: Pave the way for medical AI.ChatGPT 在中文体检、病历和教育方面的表现和探索：为医疗 AI 铺平道路。

Int J Med Inform. 2023 Sep;177:105173. doi: 10.1016/j.ijmedinf.2023.105173. Epub 2023 Aug 4.

Comparison of the Performance of GPT-3.5 and GPT-4 With That of Medical Students on the Written German Medical Licensing Examination: Observational Study.GPT-3.5 和 GPT-4 与医学生在书面德语文凭考试中的表现比较：观察性研究。

JMIR Med Educ. 2024 Feb 8;10:e50965. doi: 10.2196/50965.

A Generative Pretrained Transformer (GPT)-Powered Chatbot as a Simulated Patient to Practice History Taking: Prospective, Mixed Methods Study.基于生成式预训练转换器（GPT）的聊天机器人作为模拟患者进行病史采集的实践研究：前瞻性混合方法研究。

JMIR Med Educ. 2024 Jan 16;10:e53961. doi: 10.2196/53961.

ChatGPT-4 and Human Researchers Are Equal in Writing Scientific Introduction Sections: A Blinded, Randomized, Non-inferiority Controlled Study.ChatGPT-4与人类研究人员在撰写科学引言部分时表现相当：一项双盲、随机、非劣效性对照研究。

Cureus. 2023 Nov 18;15(11):e49019. doi: 10.7759/cureus.49019. eCollection 2023 Nov.

引用本文的文献

Evaluating the competency of ChatGPT in MRCP Part 1 and a systematic literature review of its capabilities in postgraduate medical assessments.评估 ChatGPT 在 MRCP 第 1 部分中的能力，并对其在研究生医学评估中的能力进行系统文献回顾。

PLoS One. 2024 Jul 31;19(7):e0307372. doi: 10.1371/journal.pone.0307372. eCollection 2024.

Assessment of the Quality and Readability of Information Provided by ChatGPT in Relation to the Use of Platelet-Rich Plasma Therapy for Osteoarthritis.关于富血小板血浆疗法治疗骨关节炎的信息，对ChatGPT所提供信息的质量和可读性评估

J Pers Med. 2024 May 8;14(5):495. doi: 10.3390/jpm14050495.

Opportunities, challenges, and future directions of large language models, including ChatGPT in medical education: a systematic scoping review.大型语言模型（包括 ChatGPT 在医学教育中的应用）的机遇、挑战及未来发展方向：系统范围界定综述。

J Educ Eval Health Prof. 2024;21:6. doi: 10.3352/jeehp.2024.21.6. Epub 2024 Mar 15.

Assessment of Quality and Readability of Information Provided by ChatGPT in Relation to Anterior Cruciate Ligament Injury.ChatGPT提供的关于前交叉韧带损伤信息的质量和可读性评估

J Pers Med. 2024 Jan 18;14(1):104. doi: 10.3390/jpm14010104.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

核医学教育中的 GPT-4：它比 GPT-3.5 表现更好吗？

GPT-4 in Nuclear Medicine Education: Does It Outperform GPT-3.5?

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献