Suppr超能文献

评估ChatGPT-4o肿瘤学专家与标准医学肿瘤学知识相比的表现:聚焦于与治疗相关的临床问题。

Evaluating the Performance of ChatGPT-4o Oncology Expert in Comparison to Standard Medical Oncology Knowledge: A Focus on Treatment-Related Clinical Questions.

作者信息

Kinikoglu Oguzcan, Isik Deniz

机构信息

Medical Oncology, Kartal Dr. Lütfi Kirdar City Hospital, Health Science University, Istanbul, TUR.

出版信息

Cureus. 2025 Jan 27;17(1):e78076. doi: 10.7759/cureus.78076. eCollection 2025 Jan.

Abstract

Integrating artificial intelligence (AI) into oncology can revolutionize decision-making by providing accurate information. This study evaluates the performance of ChatGPT-4o (OpenAI, San Francisco, CA) Oncology Expert, in addressing open-ended clinical oncology questions. Thirty-seven treatment-related questions on solid organ tumors were selected from a hematology-oncology textbook. Responses from ChatGPT-4o Oncology Expert and the textbook were anonymized and independently evaluated by two medical oncologists using a structured scoring system focused on accuracy and clinical justification. Statistical analysis, including paired t-tests, was conducted to compare scores, and interrater reliability was assessed using Cohen's Kappa. Oncology Expert achieved a significantly higher average score of 7.83 compared to the textbook's 7.0 (p < 0.01). In 10 cases, Oncology Expert provided more accurate and updated answers, demonstrating its ability to integrate recent medical knowledge. In 26 cases, both sources provided equally relevant answers, but the Oncology Expert's responses were clearer and easier to understand. Cohen's Kappa indicated almost perfect agreement (κ = 0.93). Both sources included outdated information for bladder cancer treatment, underscoring the need for regular updates. ChatGPT-4o Oncology Expert shows significant potential as a clinical tool in oncology by offering precise, up-to-date, and user-friendly responses. It could transform oncology practice by enhancing decision-making efficiency, improving educational tools, and serving as a reliable adjunct to clinical workflows. However, its integration requires regular updates, expert validation, and a collaborative approach to ensure reliability and relevance in the rapidly evolving field of oncology.

摘要

将人工智能(AI)整合到肿瘤学中,可以通过提供准确信息来彻底改变决策方式。本研究评估了ChatGPT-4o(OpenAI,加利福尼亚州旧金山)肿瘤学专家在回答开放式临床肿瘤学问题方面的表现。从一本血液学肿瘤学教科书中选取了37个关于实体器官肿瘤的治疗相关问题。ChatGPT-4o肿瘤学专家和教科书的回答进行了匿名处理,并由两名医学肿瘤学家使用侧重于准确性和临床依据的结构化评分系统进行独立评估。进行了包括配对t检验在内的统计分析以比较分数,并使用科恩kappa系数评估评分者间的可靠性。肿瘤学专家的平均得分显著高于教科书,分别为7.83分和7.0分(p < 0.01)。在10个案例中,肿瘤学专家提供了更准确和最新的答案,展示了其整合最新医学知识的能力。在26个案例中,两个来源提供的答案同样相关,但肿瘤学专家的回答更清晰、更易于理解。科恩kappa系数表明几乎完全一致(κ = 0.93)。两个来源都包含了过时的膀胱癌治疗信息,凸显了定期更新的必要性。ChatGPT-4o肿瘤学专家通过提供精确、最新且用户友好的回答,显示出作为肿瘤学临床工具的巨大潜力。它可以通过提高决策效率、改进教育工具以及作为临床工作流程的可靠辅助手段来改变肿瘤学实践。然而,其整合需要定期更新、专家验证以及采用协作方法,以确保在快速发展的肿瘤学领域中的可靠性和相关性。

相似文献

本文引用的文献

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验