文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

Dall-E在手部外科手术中的应用:探索ChatGPT图像生成的效用。

Dall-E in hand surgery: Exploring the utility of ChatGPT image generation.

作者信息

Soroudi Daniel, Rouhani Daniel S, Patel Alap, Sadjadi Ryan, Behnam-Hanona Reta, Oleck Nicholas C, Falade Israel, Piper Merisa, Hansen Scott L

机构信息

University of California San Francisco, School of Medicine, San Francisco, CA, USA.

University of California San Francisco, Department of Surgery, Division of Plastic and Reconstructive Surgery, San Francisco, CA, USA.

出版信息

Surg Open Sci. 2025 May 10;26:64-78. doi: 10.1016/j.sopen.2025.04.012. eCollection 2025 Jun.


DOI:10.1016/j.sopen.2025.04.012
PMID:40487714
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12143819/
Abstract

BACKGROUND: Artificial intelligence (AI) has significantly influenced various medical fields, including plastic surgery. Large language model (LLM) chatbots such as ChatGPT and text-to-image tools like Dall-E and GPT-4o are gaining broader adoption. This study explores the capabilities and limitations of these tools in hand surgery, focusing on their application in patient and medical education. METHODS: Utilizing Google Trends data, common search terms were identified and queried on ChatGPT-4.5 and ChatGPT-3.5 from the following categories: "Hand Anatomy", "Hand Fracture", "Hand Joint Injury", "Hand Tumor", and "Hand Dislocation". Responses were graded on a 1-5 scale for accuracy and evaluated using the Flesch-Kincaid Grade Level, Patient Education Materials Assessment Tool (PEMAT), and DISCERN instrument. GPT 4o, DALL-E 3, and DALL-E 2 illustrated visual representations of selected ChatGPT responses in each category, which were further evaluated. RESULTS: ChatGPT-4.5 achieved a DISCERN overall score of 3.80 ± 0.23. Its responses averaged 91.67 ± 0.29 for PEMAT understandability and 54.67 ± 0.55 for actionability. Accuracy was 4.47 ± 0.52, with a Flesch-Kincaid Grade Level of 9.26 ± 1.04. ChatGPT-4.5 consistently outperformed ChatGPT-3.5 across all evaluation metrics. For text-to-image generation, GPT-4o produced more accurate visuals compared to DALL-E 3 and DALL-E 2. CONCLUSIONS: This study highlights the strengths and limitations of ChatGPT-4.5 and GPT-4o in hand surgery education. While combining accurate text generation with image creation shows promise, these AI tools still need further refinement before widespread clinical adoption.

摘要

背景:人工智能(AI)已对包括整形手术在内的各个医学领域产生了重大影响。诸如ChatGPT之类的大语言模型(LLM)聊天机器人以及像Dall-E和GPT-4o这样的文本到图像工具正得到更广泛的应用。本研究探讨了这些工具在手外科中的能力和局限性,重点关注它们在患者和医学教育中的应用。 方法:利用谷歌趋势数据,确定了常见搜索词,并在ChatGPT-4.5和ChatGPT-3.5上查询了以下类别:“手部解剖学”、“手部骨折”、“手部关节损伤”、“手部肿瘤”和“手部脱位”。对回答的准确性按1-5级评分,并使用弗莱什-金凯德年级水平、患者教育材料评估工具(PEMAT)和辨别工具进行评估。GPT 4o、DALL-E 3和DALL-E 2对每个类别中选定的ChatGPT回答进行了可视化展示,并进一步进行了评估。 结果:ChatGPT-4.5的辨别总体评分为3.80±0.23。其回答的PEMAT可理解性平均为91.67±0.29,可操作性平均为54.67±0.55。准确性为4.47±0.52,弗莱什-金凯德年级水平为9.26±1.04。在所有评估指标上,ChatGPT-4.5始终优于ChatGPT-3.5。对于文本到图像生成,与DALL-E 3和DALL-E 2相比,GPT-4o生成的视觉效果更准确。 结论:本研究突出了ChatGPT-4.5和GPT-4o在手外科教育中的优势和局限性。虽然将准确的文本生成与图像创建相结合显示出了前景,但这些人工智能工具在广泛临床应用之前仍需进一步完善。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/4a4e81a3fcda/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/06c3db93b1ca/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/6230d9392c68/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/fa2aa4850e1a/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/2812d0417d86/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/4a4e81a3fcda/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/06c3db93b1ca/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/6230d9392c68/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/fa2aa4850e1a/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/2812d0417d86/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/178d/12143819/4a4e81a3fcda/gr5.jpg

相似文献

[1]
Dall-E in hand surgery: Exploring the utility of ChatGPT image generation.

Surg Open Sci. 2025-5-10

[2]
Evaluating the Efficacy of ChatGPT as a Patient Education Tool in Prostate Cancer: Multimetric Assessment.

J Med Internet Res. 2024-8-14

[3]
Evaluating the Accuracy of Artificial Intelligence (AI)-Generated Illustrations for Laser-Assisted In Situ Keratomileusis (LASIK), Photorefractive Keratectomy (PRK), and Small Incision Lenticule Extraction (SMILE).

Cureus. 2024-8-25

[4]
Large Language Models: Pioneering New Educational Frontiers in Childhood Myopia.

Ophthalmol Ther. 2025-6

[5]
Assessing chatbots ability to produce leaflets on cataract surgery: Bing AI, chatGPT 3.5, chatGPT 4o, ChatSonic, Google Bard, Perplexity, and Pi.

J Cataract Refract Surg. 2025-5-1

[6]
Assessment of Artificial Intelligence Chatbot Responses to Top Searched Queries About Cancer.

JAMA Oncol. 2023-10-1

[7]
Comparative evaluation of responses from DeepSeek-R1, ChatGPT-o1, ChatGPT-4, and dental GPT chatbots to patient inquiries about dental and maxillofacial prostheses.

BMC Oral Health. 2025-5-31

[8]
Assessing the quality and readability of patient education materials on chemotherapy cardiotoxicity from artificial intelligence chatbots: An observational cross-sectional study.

Medicine (Baltimore). 2025-4-11

[9]
Evaluating the role of AI chatbots in patient education for abdominal aortic aneurysms: a comparison of ChatGPT and conventional resources.

ANZ J Surg. 2025-4

[10]
Assessment of Generative Artificial Intelligence (AI) Models in Creating Medical Illustrations for Various Corneal Transplant Procedures.

Cureus. 2024-8-26

本文引用的文献

[1]
Current applications and challenges in large language models for patient care: a systematic review.

Commun Med (Lond). 2025-1-21

[2]
Evaluating Text-to-Image Generated Photorealistic Images of Human Anatomy.

Cureus. 2024-11-21

[3]
Leveraging large language models to improve patient education on dry eye disease.

Eye (Lond). 2025-4

[4]
Large language models in patient education: a scoping review of applications in medicine.

Front Med (Lausanne). 2024-10-29

[5]
Evaluating Large Language Models in Dental Anesthesiology: A Comparative Analysis of ChatGPT-4, Claude 3 Opus, and Gemini 1.0 on the Japanese Dental Society of Anesthesiology Board Certification Exam.

Cureus. 2024-9-27

[6]
Clinician voices on ethics of LLM integration in healthcare: a thematic analysis of ethical concerns and implications.

BMC Med Inform Decis Mak. 2024-9-9

[7]
Prompt engineering on leveraging large language models in generating response to InBasket messages.

J Am Med Inform Assoc. 2024-10-1

[8]
Can artificial intelligence help for scientific illustration? Details matter.

Crit Care. 2024-6-10

[9]
Artificial Intelligence-Generated Facial Images for Medical Education.

Med Sci Educ. 2023-11-14

[10]
Using AI Text-to-Image Generation to Create Novel Illustrations for Medical Education: Current Limitations as Illustrated by Hypothyroidism and Horner Syndrome.

JMIR Med Educ. 2024-2-22

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索