Department of Plastic and Reconstructive Surgery, Peninsula Health, Melbourne 3199, Australia.
Department of Plastic and Reconstructive Surgery, Odense University Hospital, 5000 Odense, Denmark.
Medicina (Kaunas). 2024 Sep 14;60(9):1500. doi: 10.3390/medicina60091500.
: Despite CTAs being critical for preoperative planning in autologous breast reconstruction, experienced plastic surgeons may have differing preferences for which side of the abdomen to use for unilateral breast reconstruction. Large language models (LLMs) have the potential to assist medical imaging interpretation. This study compares the perforator selection preferences of experienced plastic surgeons with four popular LLMs based on CTA images for breast reconstruction. : Six experienced plastic surgeons from Australia, the US, Italy, Denmark, and Argentina reviewed ten CTA images, indicated their preferred side of the abdomen for unilateral breast reconstruction and recommended the type of autologous reconstruction. The LLMs were prompted to do the same. The average decisions were calculated, recorded in suitable tables, and compared. : The six consultants predominantly recommend the DIEP procedure (83%). This suggests experienced surgeons feel more comfortable raising DIEP than TRAM flaps, which they recommended only 3% of the time. They also favoured MS TRAM and SIEA less frequently (11% and 2%, respectively). Three LLMs-ChatGPT-4o, ChatGPT-4, and Bing CoPilot-exclusively recommended DIEP (100%), while Claude suggested DIEP 90% and MS TRAM 10%. Despite minor variations in side recommendations, consultants and AI models clearly preferred DIEP. : Consultants and LLMs consistently preferred DIEP procedures, indicating strong confidence among experienced surgeons, though LLMs occasionally deviated in recommendations, highlighting limitations in their image interpretation capabilities. This emphasises the need for ongoing refinement of AI-assisted decision support systems to ensure they align more closely with expert clinical judgment and enhance their reliability in clinical practice.
尽管 CTA 对自体乳房重建的术前规划至关重要,但经验丰富的整形外科医生可能对用于单侧乳房重建的腹部哪一侧有不同的偏好。大型语言模型(LLM)有可能辅助医学影像解读。本研究比较了经验丰富的整形外科医生与基于 CTA 图像的四种流行 LLM 在乳房重建方面的穿支选择偏好。
来自澳大利亚、美国、意大利、丹麦和阿根廷的六名经验丰富的整形外科医生对十张 CTA 图像进行了审查,指明了他们对单侧乳房重建的首选腹部侧,并推荐了自体重建的类型。提示 LLM 也这样做。计算出平均决策,记录在合适的表格中并进行比较。
六位顾问主要推荐 DIEP 手术(83%)。这表明经验丰富的外科医生更愿意进行 DIEP 手术,而不是 TRAM 皮瓣,他们只推荐了 3%的时间。他们也不太倾向于 MS TRAM 和 SIEA(分别为 11%和 2%)。三个 LLM——ChatGPT-4o、ChatGPT-4 和 Bing CoPilot——完全推荐 DIEP(100%),而 Claude 则建议 DIEP 90%和 MS TRAM 10%。尽管在侧推荐方面存在细微差异,但顾问和 AI 模型显然更倾向于 DIEP 手术。
顾问和 LLM 一致倾向于 DIEP 手术,这表明经验丰富的外科医生有强烈的信心,尽管 LLM 在建议上偶尔会有所偏离,这突出了它们在图像解读能力方面的局限性。这强调了需要不断改进 AI 辅助决策支持系统,以确保它们更紧密地与专家临床判断保持一致,并提高它们在临床实践中的可靠性。