• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

GPT-4o与乳腺超声成像中的专业人工智能:准确性、一致性、局限性及诊断潜力的比较研究

GPT-4o and Specialized AI in Breast Ultrasound Imaging: A comparative Study on Accuracy, Agreement, Limitations, and Diagnostic Potential.

作者信息

Sanli Deniz Esin Tekcan, Sanli Ahmet Necati, Buyukdereli Atadag Yildiz, Kurt Atakan, Esmerer Emel

机构信息

Department of Radiology, Faculty of Medicine, Gaziantep University, Gaziantep, Turkey.

Department of General Surgery, Abdulkadir Yuksel State Hospital, Gaziantep, Turkey.

出版信息

J Ultrasound Med. 2025 Jun 23. doi: 10.1002/jum.16749.

DOI:10.1002/jum.16749
PMID:40548624
Abstract

OBJECTIVES

This study aimed to evaluate the ability of ChatGPT and Breast Ultrasound Helper, a special ChatGPT-based subprogram trained on ultrasound image analysis, to analyze and differentiate benign and malignant breast lesions on ultrasound images.

METHODS

Ultrasound images of histopathologically confirmed breast cancer and fibroadenoma patients were read GPT-4o (the latest ChatGPT version) and Breast Ultrasound Helper (BUH), a tool from the "Explore" section of ChatGPT. Both were prompted in English using ACR BI-RADS Breast Ultrasound Lexicon criteria: lesion shape, orientation, margin, internal echo pattern, echogenicity, posterior acoustic features, microcalcifications or hyperechoic foci, perilesional hyperechoic rim, edema or architectural distortion, lesion size, and BI-RADS category. Two experienced radiologists evaluated the images and the responses of the programs in consensus. The outputs, BI-RADS category agreement, and benign/malignant discrimination were statistically compared.

RESULTS

A total of 232 ultrasound images were analyzed, of which 133 (57.3%) were malignant and 99 (42.7%) benign. In comparative analysis, BUH showed superior performance overall, with higher kappa values and statistically significant results across multiple features (P .001). However, the overall level of agreement with the radiologists' consensus for all features was similar for BUH (κ: 0.387-0.755) and GPT-4o (κ: 0.317-0.803). On the other hand, BI-RADS category agreement was slightly higher in GPT-4o than in BUH (69.4% versus 65.9%), but BUH was slightly more successful in distinguishing benign lesions from malignant lesions (65.9% versus 67.7%).

CONCLUSIONS

Although both AI tools show moderate-good performance in ultrasound image analysis, their limited compatibility with radiologists' evaluations and BI-RADS categorization suggests that their clinical application in breast ultrasound interpretation is still early and unreliable.

摘要

目的

本研究旨在评估ChatGPT以及基于超声图像分析训练的特殊的基于ChatGPT的子程序“乳腺超声助手”对超声图像上的乳腺良恶性病变进行分析和鉴别的能力。

方法

对组织病理学确诊的乳腺癌和纤维腺瘤患者的超声图像由GPT-4o(最新版ChatGPT)和ChatGPT“探索”板块中的工具“乳腺超声助手”(BUH)进行解读。两者均使用美国放射学会(ACR)乳腺影像报告和数据系统(BI-RADS)乳腺超声术语标准以英文进行提问:病变形状、方向、边缘、内部回声模式、回声性、后方声学特征、微钙化或高回声灶、病灶周围高回声边缘、水肿或结构扭曲、病变大小以及BI-RADS分类。两名经验丰富的放射科医生共同评估图像及程序的回答。对输出结果、BI-RADS分类一致性以及良恶性鉴别进行统计学比较。

结果

共分析了232幅超声图像,其中133幅(57.3%)为恶性,99幅(42.7%)为良性。在对比分析中,总体而言BUH表现更优,kappa值更高,且在多个特征上结果具有统计学意义(P < 0.001)。然而,对于所有特征,BUH(κ:0.387 - 0.755)与GPT-4o(κ:0.317 - 0.803)与放射科医生共识的总体一致水平相似。另一方面,GPT-4o的BI-RADS分类一致性略高于BUH(69.4%对65.9%),但在区分良性病变与恶性病变方面BUH略更成功(65.9%对67.7%)。

结论

尽管这两种人工智能工具在超声图像分析中均表现出中等良好的性能,但它们与放射科医生评估及BI-RADS分类的兼容性有限,这表明它们在乳腺超声解读中的临床应用仍处于早期且不可靠。

相似文献

1
GPT-4o and Specialized AI in Breast Ultrasound Imaging: A comparative Study on Accuracy, Agreement, Limitations, and Diagnostic Potential.GPT-4o与乳腺超声成像中的专业人工智能:准确性、一致性、局限性及诊断潜力的比较研究
J Ultrasound Med. 2025 Jun 23. doi: 10.1002/jum.16749.
2
Using a Large Language Model for Breast Imaging Reporting and Data System Classification and Malignancy Prediction to Enhance Breast Ultrasound Diagnosis: Retrospective Study.使用大语言模型进行乳腺影像报告和数据系统分类及恶性肿瘤预测以增强乳腺超声诊断:回顾性研究
JMIR Med Inform. 2025 Jun 11;13:e70924. doi: 10.2196/70924.
3
The application of multimodal ultrasound examination in the differential diagnosis of benign and malignant breast lesions of BI-RADS category 4.多模态超声检查在BI-RADS 4类乳腺良恶性病变鉴别诊断中的应用
Front Med (Lausanne). 2025 Jun 9;12:1596100. doi: 10.3389/fmed.2025.1596100. eCollection 2025.
4
Diagnostic Performance of ChatGPT-4o in Detecting Hip Fractures on Pelvic X-rays.ChatGPT-4o在骨盆X光片检测髋部骨折中的诊断性能
Cureus. 2025 Jun 24;17(6):e86654. doi: 10.7759/cureus.86654. eCollection 2025 Jun.
5
A rapid and systematic review of the clinical effectiveness and cost-effectiveness of topotecan for ovarian cancer.拓扑替康治疗卵巢癌的临床有效性和成本效益的快速系统评价。
Health Technol Assess. 2001;5(28):1-110. doi: 10.3310/hta5280.
6
Performance of ChatGPT-4o and Four Open-Source Large Language Models in Generating Diagnoses Based on China's Rare Disease Catalog: Comparative Study.ChatGPT-4o与四个开源大语言模型基于中国罕见病目录生成诊断的性能:比较研究
J Med Internet Res. 2025 Jun 18;27:e69929. doi: 10.2196/69929.
7
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
8
Application Value of Deep Learning-Based AI Model in the Classification of Breast Nodules.基于深度学习的人工智能模型在乳腺结节分类中的应用价值
Br J Hosp Med (Lond). 2025 Jun 25;86(6):1-19. doi: 10.12968/hmed.2025.0078. Epub 2025 Jun 15.
9
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
10
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状荟萃分析。
Cochrane Database Syst Rev. 2017 Dec 22;12(12):CD011535. doi: 10.1002/14651858.CD011535.pub2.