• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人类与机器:整形与重建外科决策的未来

Human vs Machine: The Future of Decision-making in Plastic and Reconstructive Surgery.

作者信息

Duran Alpay, Demiröz Anıl, Çörtük Oguz, Ok Bora, Özten Mustafa, Eroğlu Sinem

出版信息

Aesthet Surg J. 2025 Mar 17;45(4):434-440. doi: 10.1093/asj/sjaf015.

DOI:10.1093/asj/sjaf015
PMID:39862057
Abstract

BACKGROUND

Artificial intelligence-driven technologies offer transformative potential in plastic surgery, spanning preoperative planning, surgical procedures, and postoperative care, with the promise of improved patient outcomes.

OBJECTIVES

To compare the web-based ChatGPT-4o (omni; OpenAI, San Francisco, CA) and Gemini Advanced (Alphabet Inc., Mountain View, CA), focusing on their data upload feature and examining outcomes before and after exposure to continuing medical education (CME) articles, particularly regarding their efficacy relative to human participants.

METHODS

Participants and large language models (LLMs) completed 22 multiple-choice questions to assess baseline knowledge of CME topics. Initially, both LLMs and participants answered without article access. In incognito mode, the LLMs repeated the tests over 6 days. After accessing the articles, responses from both LLMs and participants were extracted and analyzed.

RESULTS

There was a significant increase in mean scores after the article was read in the resident group, indicating a significant rise. In the LLM groups, the ChatGPT-4o (omni) group showed no significant difference between pre- and postarticle scores, but the Gemini Advanced group demonstrated a significant increase. It can be stated that the ChatGPT-4o and Gemini Advanced groups have higher accuracy means compared with the resident group in both pre- and postarticle periods.

CONCLUSIONS

The analysis between human participants and LLMs indicates promising implications for the incorporation of LLMs in medical education. Because these models increase in sophistication, they offer the potential to serve as supplementary tools within traditional learning environments. This could aid in bridging the gap between theoretical knowledge and practical implementation.

摘要

背景

人工智能驱动的技术在整形手术中具有变革潜力,涵盖术前规划、手术过程和术后护理,有望改善患者预后。

目的

比较基于网络的ChatGPT-4o(全能版;OpenAI,加利福尼亚州旧金山)和Gemini Advanced(Alphabet公司,加利福尼亚州山景城),重点关注它们的数据上传功能,并检查接触继续医学教育(CME)文章前后的结果,特别是它们相对于人类参与者的功效。

方法

参与者和大语言模型(LLMs)完成了22道多项选择题,以评估对CME主题的基线知识。最初,LLMs和参与者在无法获取文章的情况下作答。在隐身模式下,LLMs在6天内重复进行测试。在获取文章后,提取并分析LLMs和参与者的回答。

结果

住院医师组在阅读文章后平均分数显著提高,表明有显著提升。在LLM组中,ChatGPT-4o(全能版)组文章前后分数无显著差异,但Gemini Advanced组分数显著提高。可以说,ChatGPT-4o组和Gemini Advanced组在文章前后阶段的准确率均值均高于住院医师组。

结论

人类参与者与LLMs之间的分析表明,将LLMs纳入医学教育具有广阔前景。由于这些模型日益复杂,它们有潜力在传统学习环境中作为辅助工具。这有助于弥合理论知识与实际应用之间的差距。

相似文献

1
Human vs Machine: The Future of Decision-making in Plastic and Reconstructive Surgery.人类与机器:整形与重建外科决策的未来
Aesthet Surg J. 2025 Mar 17;45(4):434-440. doi: 10.1093/asj/sjaf015.
2
Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.ChatGPT与互联网搜索用于职业医学临床研究和决策的比较:随机对照试验
JMIR Form Res. 2025 May 20;9:e63857. doi: 10.2196/63857.
3
Performance of ChatGPT-4o and Four Open-Source Large Language Models in Generating Diagnoses Based on China's Rare Disease Catalog: Comparative Study.ChatGPT-4o与四个开源大语言模型基于中国罕见病目录生成诊断的性能:比较研究
J Med Internet Res. 2025 Jun 18;27:e69929. doi: 10.2196/69929.
4
Does Augmenting Irradiated Autografts With Free Vascularized Fibula Graft in Patients With Bone Loss From a Malignant Tumor Achieve Union, Function, and Complication Rate Comparably to Patients Without Bone Loss and Augmentation When Reconstructing Intercalary Resections in the Lower Extremity?对于因恶性肿瘤导致骨缺损的患者,在重建下肢节段性切除时,采用带血管游离腓骨移植来增强照射后的自体骨移植,其骨愈合、功能及并发症发生率与无骨缺损且未进行增强的患者相比是否相当?
Clin Orthop Relat Res. 2025 Jun 26. doi: 10.1097/CORR.0000000000003599.
5
Enhancing the Readability of Online Patient Education Materials Using Large Language Models: Cross-Sectional Study.使用大语言模型提高在线患者教育材料的可读性:横断面研究。
J Med Internet Res. 2025 Jun 4;27:e69955. doi: 10.2196/69955.
6
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
7
Eliciting adverse effects data from participants in clinical trials.从临床试验参与者中获取不良反应数据。
Cochrane Database Syst Rev. 2018 Jan 16;1(1):MR000039. doi: 10.1002/14651858.MR000039.pub2.
8
Intravenous magnesium sulphate and sotalol for prevention of atrial fibrillation after coronary artery bypass surgery: a systematic review and economic evaluation.静脉注射硫酸镁和索他洛尔预防冠状动脉搭桥术后房颤:系统评价与经济学评估
Health Technol Assess. 2008 Jun;12(28):iii-iv, ix-95. doi: 10.3310/hta12280.
9
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
10
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.