• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估人工智能在减重手术中的能力:关于ChatGPT-4和DALL·E 3识别与绘图准确性的研究

Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL·E 3's Recognition and Illustration Accuracy.

作者信息

Mahjoubi Mohammad, Shahabi Shahab, Sheikhbahaei Saba, Jazi Amir Hossein Davarpanah

机构信息

Minimally Invasive Surgery Research Center, Iran University of Medical Sciences, Tehran, Iran.

Hunter New England Local Health District, Newcastle, Australia.

出版信息

Obes Surg. 2025 Feb;35(2):638-641. doi: 10.1007/s11695-024-07653-z. Epub 2024 Dec 29.

DOI:10.1007/s11695-024-07653-z
PMID:39733375
Abstract

BACKGROUND

With the rise of artificial intelligence (AI) in medical education, tools like OpenAI's ChatGPT-4 and DALL·E 3 have potential applications in enhancing learning materials. This study aims to evaluate ChatGPT-4o's proficiency in recognizing bariatric surgical procedures from illustrations and assess DALL·E 3's effectiveness in generating accurate surgical illustrations.

METHODS

Illustrations of six bariatric surgical procedures (One Anastomosis Gastric Bypass, Roux-en-Y Gastric Bypass, Single Anastomosis Duodeno-Ileal Bypass with Sleeve Gastrectomy, Sleeve Gastrectomy, Biliopancreatic Diversion, and Adjustable Gastric Banding) were sourced from the IFSO Atlas of Metabolic and Bariatric Surgery. ChatGPT-4 was tasked with identifying each procedure based on these illustrations to evaluate its classification accuracy. Simultaneously, DALL·E 3 was prompted with the specific names of each procedure to generate corresponding medical illustrations.

RESULTS

ChatGPT-4 correctly identified only the Adjustable Gastric Banding illustration, misclassifying the other five procedures. DALL·E 3 failed to produce accurate illustrations for all six procedures.

CONCLUSION

The study underscores the need for further evaluation of AI in bariatric surgery. Both ChatGPT-4 and DALL·E 3, while promising, have significant limitations in recognizing and generating accurate illustrations of bariatric surgical procedures. These findings call for continued research and development to make AI models suitable for medical education applications in bariatric surgery.

摘要

背景

随着人工智能(AI)在医学教育中的兴起,像OpenAI的ChatGPT-4和DALL·E 3这样的工具在增强学习材料方面具有潜在应用。本研究旨在评估ChatGPT-4识别减肥手术程序插图的能力,并评估DALL·E 3生成准确手术插图的有效性。

方法

六种减肥手术程序(单吻合口胃旁路术、Roux-en-Y胃旁路术、单吻合口十二指肠-回肠旁路术联合袖状胃切除术、袖状胃切除术、胆胰转流术和可调节胃束带术)的插图来源于国际肥胖与代谢病外科联盟(IFSO)代谢与减肥外科学图谱。ChatGPT-4的任务是根据这些插图识别每种手术程序,以评估其分类准确性。同时,向DALL·E 3输入每种手术程序的具体名称,以生成相应的医学插图。

结果

ChatGPT-4仅正确识别了可调节胃束带术的插图,将其他五种手术程序误分类。DALL·E 3未能为所有六种手术程序生成准确的插图。

结论

该研究强调了在减肥手术中进一步评估人工智能的必要性。ChatGPT-4和DALL·E 3虽然很有前景,但在识别和生成减肥手术程序的准确插图方面存在重大局限性。这些发现呼吁继续进行研究和开发,以使人工智能模型适用于减肥手术的医学教育应用。

相似文献

1
Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL·E 3's Recognition and Illustration Accuracy.评估人工智能在减重手术中的能力:关于ChatGPT-4和DALL·E 3识别与绘图准确性的研究
Obes Surg. 2025 Feb;35(2):638-641. doi: 10.1007/s11695-024-07653-z. Epub 2024 Dec 29.
2
Evaluating the Accuracy of Artificial Intelligence (AI)-Generated Illustrations for Laser-Assisted In Situ Keratomileusis (LASIK), Photorefractive Keratectomy (PRK), and Small Incision Lenticule Extraction (SMILE).评估人工智能生成的用于准分子原位角膜磨镶术(LASIK)、准分子激光角膜切削术(PRK)和小切口基质透镜切除术(SMILE)的插图的准确性。
Cureus. 2024 Aug 25;16(8):e67747. doi: 10.7759/cureus.67747. eCollection 2024 Aug.
3
Assessment of Generative Artificial Intelligence (AI) Models in Creating Medical Illustrations for Various Corneal Transplant Procedures.生成式人工智能(AI)模型在为各种角膜移植手术创建医学插图方面的评估。
Cureus. 2024 Aug 26;16(8):e67833. doi: 10.7759/cureus.67833. eCollection 2024 Aug.
4
EAES rapid guideline: systematic review, network meta-analysis, CINeMA and GRADE assessment, and European consensus on bariatric surgery-extension 2022.EAES 快速指南:系统评价、网络荟萃分析、CINeMA 和 GRADE 评估以及 2022 年肥胖手术扩展的欧洲共识。
Surg Endosc. 2022 Mar;36(3):1709-1725. doi: 10.1007/s00464-022-09008-0. Epub 2022 Jan 20.
5
Can DALL-E 3 Reliably Generate 12-Lead ECGs and Teaching Illustrations?DALL-E 3能否可靠地生成12导联心电图和教学插图?
Cureus. 2024 Jan 22;16(1):e52748. doi: 10.7759/cureus.52748. eCollection 2024 Jan.
6
Performance of artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in the American Society for Metabolic and Bariatric Surgery textbook of bariatric surgery questions.人工智能在减重手术中的表现:ChatGPT-4、Bing 和 Bard 在《美国代谢与减重外科学会减重手术教科书》减重手术问题中的比较分析。
Surg Obes Relat Dis. 2024 Jul;20(7):609-613. doi: 10.1016/j.soard.2024.04.014. Epub 2024 May 8.
7
The Performance of Artificial Intelligence in One Anastomosis Gastric Bypass Surgery: Comparative Efficacy of ChatGPT-4.0, ChatGPT-Omni, and Gemini AI.人工智能在单吻合口胃旁路手术中的表现:ChatGPT-4.0、ChatGPT-Omni和Gemini AI的疗效比较
Obes Surg. 2025 Apr;35(4):1469-1475. doi: 10.1007/s11695-025-07794-9. Epub 2025 Mar 18.
8
Roux-en-Y gastric bypass, adjustable gastric banding, or sleeve gastrectomy for severe obesity (By-Band-Sleeve): a multicentre, open label, three-group, randomised controlled trial.Roux-en-Y胃旁路术、可调节胃束带术或袖状胃切除术治疗重度肥胖(胃旁路术-胃束带术-袖状胃切除术):一项多中心、开放标签、三组随机对照试验
Lancet Diabetes Endocrinol. 2025 May;13(5):410-426. doi: 10.1016/S2213-8587(25)00025-7. Epub 2025 Mar 31.
9
Assessing ChatGPT vs. Standard Medical Resources for Endoscopic Sleeve Gastroplasty Education: A Medical Professional Evaluation Study.评估 ChatGPT 与标准医学资源在经内镜袖状胃切除术教育中的作用:一项医学专业人员评估研究。
Obes Surg. 2024 Jul;34(7):2718-2724. doi: 10.1007/s11695-024-07283-5. Epub 2024 May 17.
10
Bariatric Evaluation Through AI: a Survey of Expert Opinions Versus ChatGPT-4 (BETA-SEOV).通过人工智能进行肥胖评估:专家意见与 ChatGPT-4 (BETA-SEOV) 的对比调查。
Obes Surg. 2023 Dec;33(12):3971-3980. doi: 10.1007/s11695-023-06903-w. Epub 2023 Oct 27.

引用本文的文献

1
Enhancing the Accuracy of Human Phenotype Ontology Identification: Comparative Evaluation of Multimodal Large Language Models.提高人类表型本体识别的准确性:多模态大语言模型的比较评估
J Med Internet Res. 2025 Jun 2;27:e73233. doi: 10.2196/73233.
2
International expert consensus on the current status and future prospects of artificial intelligence in metabolic and bariatric surgery.国际专家对人工智能在代谢与减重手术中的现状及未来前景的共识。
Sci Rep. 2025 Mar 18;15(1):9312. doi: 10.1038/s41598-025-94335-0.
3
AI Capabilities in Bariatric Surgery.

本文引用的文献

1
Art or Artifact: Evaluating the Accuracy, Appeal, and Educational Value of AI-Generated Imagery in DALL·E 3 for Illustrating Congenital Heart Diseases.艺术还是人工制品:评估 DALL·E 3 中人工智能生成图像在阐明先天性心脏病方面的准确性、吸引力和教育价值。
J Med Syst. 2024 May 23;48(1):54. doi: 10.1007/s10916-024-02072-0.
2
Evaluating dermatologic domain knowledge in DALL-E 2 and potential applications for dermatology-specific algorithms.评估DALL-E 2中的皮肤病学领域知识以及皮肤病学特定算法的潜在应用。
Int J Dermatol. 2023 Oct;62(10):e521-e523. doi: 10.1111/ijd.16683. Epub 2023 Apr 14.
3
The history of metabolic and bariatric surgery: Development of standards for patient safety and efficacy.
减肥手术中的人工智能能力。
Obes Surg. 2025 Feb;35(2):666-667. doi: 10.1007/s11695-025-07677-z. Epub 2025 Jan 9.
代谢和减重手术的历史:患者安全和疗效标准的制定。
Metabolism. 2018 Feb;79:97-107. doi: 10.1016/j.metabol.2017.12.010. Epub 2018 Jan 5.