• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于神经网络的内镜超声图像分类的语音辅助标注。

Voice-Assisted Image Labeling for Endoscopic Ultrasound Classification Using Neural Networks.

出版信息

IEEE Trans Med Imaging. 2022 Jun;41(6):1311-1319. doi: 10.1109/TMI.2021.3139023. Epub 2022 Jun 1.

DOI:10.1109/TMI.2021.3139023
PMID:34962866
Abstract

Ultrasound imaging is a commonly used technology for visualising patient anatomy in real-time during diagnostic and therapeutic procedures. High operator dependency and low reproducibility make ultrasound imaging and interpretation challenging with a steep learning curve. Automatic image classification using deep learning has the potential to overcome some of these challenges by supporting ultrasound training in novices, as well as aiding ultrasound image interpretation in patient with complex pathology for more experienced practitioners. However, the use of deep learning methods requires a large amount of data in order to provide accurate results. Labelling large ultrasound datasets is a challenging task because labels are retrospectively assigned to 2D images without the 3D spatial context available in vivo or that would be inferred while visually tracking structures between frames during the procedure. In this work, we propose a multi-modal convolutional neural network (CNN) architecture that labels endoscopic ultrasound (EUS) images from raw verbal comments provided by a clinician during the procedure. We use a CNN composed of two branches, one for voice data and another for image data, which are joined to predict image labels from the spoken names of anatomical landmarks. The network was trained using recorded verbal comments from expert operators. Our results show a prediction accuracy of 76% at image level on a dataset with 5 different labels. We conclude that the addition of spoken commentaries can increase the performance of ultrasound image classification, and eliminate the burden of manually labelling large EUS datasets necessary for deep learning applications.

摘要

超声成像是一种在诊断和治疗过程中实时可视化患者解剖结构的常用技术。由于操作人员高度依赖和可重复性低,因此超声成像和解释具有挑战性,学习曲线陡峭。使用深度学习进行自动图像分类有潜力克服其中的一些挑战,例如支持新手进行超声培训,以及为经验丰富的从业者提供具有复杂病理的患者的超声图像解释辅助。然而,深度学习方法的使用需要大量的数据才能提供准确的结果。标记大型超声数据集是一项具有挑战性的任务,因为标签是在没有体内 3D 空间背景或在程序中通过视觉跟踪帧之间的结构时推断出的情况下,从 2D 图像回溯分配的。在这项工作中,我们提出了一种多模态卷积神经网络(CNN)架构,该架构从临床医生在程序中提供的原始口头评论中标记内镜超声(EUS)图像。我们使用由两个分支组成的 CNN,一个用于语音数据,另一个用于图像数据,这两个分支结合起来,根据解剖学地标的口头名称预测图像标签。该网络使用来自专家操作人员的记录口头评论进行训练。我们的结果表明,在具有 5 个不同标签的数据集上,图像级别的预测准确率为 76%。我们得出结论,添加口头评论可以提高超声图像分类的性能,并消除为深度学习应用程序手动标记大型 EUS 数据集的负担。

相似文献

1
Voice-Assisted Image Labeling for Endoscopic Ultrasound Classification Using Neural Networks.基于神经网络的内镜超声图像分类的语音辅助标注。
IEEE Trans Med Imaging. 2022 Jun;41(6):1311-1319. doi: 10.1109/TMI.2021.3139023. Epub 2022 Jun 1.
2
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
3
Fast interactive medical image segmentation with weakly supervised deep learning method.基于弱监督深度学习方法的快速交互式医学图像分割。
Int J Comput Assist Radiol Surg. 2020 Sep;15(9):1437-1444. doi: 10.1007/s11548-020-02223-x. Epub 2020 Jul 11.
4
Comparison of Deep-Learning and Conventional Machine-Learning Methods for the Automatic Recognition of the Hepatocellular Carcinoma Areas from Ultrasound Images.深度学习与传统机器学习方法在自动识别超声图像肝癌区域的比较。
Sensors (Basel). 2020 May 29;20(11):3085. doi: 10.3390/s20113085.
5
Application of high resolution computed tomography image assisted classification model of middle ear diseases based on 3D-convolutional neural network.基于 3D 卷积神经网络的中耳疾病高分辨率 CT 图像辅助分类模型的应用。
Zhong Nan Da Xue Xue Bao Yi Xue Ban. 2022 Aug 28;47(8):1037-1048. doi: 10.11817/j.issn.1672-7347.2022.210704.
6
MABAL: a Novel Deep-Learning Architecture for Machine-Assisted Bone Age Labeling.MABAL:一种用于机器辅助骨龄标注的新型深度学习架构。
J Digit Imaging. 2018 Aug;31(4):513-519. doi: 10.1007/s10278-018-0053-3.
7
Automated fundus ultrasound image classification based on siamese convolutional neural networks with multi-attention.基于具有多注意力机制的孪生卷积神经网络的眼底超声图像自动分类。
BMC Med Imaging. 2023 Jul 6;23(1):89. doi: 10.1186/s12880-023-01047-w.
8
A novel convolutional neural network for kidney ultrasound images segmentation.一种用于肾脏超声图像分割的新型卷积神经网络。
Comput Methods Programs Biomed. 2022 May;218:106712. doi: 10.1016/j.cmpb.2022.106712. Epub 2022 Feb 26.
9
Landmark tracking in liver US images using cascade convolutional neural networks with long short-term memory.使用带有长短期记忆的级联卷积神经网络对肝脏超声图像进行地标跟踪。
Meas Sci Technol. 2023 May 1;34(5):054002. doi: 10.1088/1361-6501/acb5b3. Epub 2023 Feb 2.
10
Interactively Fusing Global and Local Features for Benign and Malignant Classification of Breast Ultrasound Images.交互式融合全局和局部特征用于乳腺超声图像的良恶性分类
Ultrasound Med Biol. 2025 Mar;51(3):525-534. doi: 10.1016/j.ultrasmedbio.2024.11.014. Epub 2024 Dec 20.

引用本文的文献

1
Applications of Artificial Intelligence in Gastrointestinal Endoscopic Ultrasound: Current Developments, Limitations and Future Directions.人工智能在胃肠道内镜超声中的应用:当前进展、局限性及未来方向
Cancers (Basel). 2024 Dec 17;16(24):4196. doi: 10.3390/cancers16244196.
2
Artificial Intelligence in Pancreatic Image Analysis: A Review.人工智能在胰腺影像分析中的应用:综述
Sensors (Basel). 2024 Jul 22;24(14):4749. doi: 10.3390/s24144749.
3
A Comprehensive Guide to Artificial Intelligence in Endoscopic Ultrasound.《内镜超声人工智能综合指南》
J Clin Med. 2023 May 30;12(11):3757. doi: 10.3390/jcm12113757.
4
Deep clustering for abdominal organ classification in ultrasound imaging.用于超声成像中腹部器官分类的深度聚类
J Med Imaging (Bellingham). 2023 May;10(3):034502. doi: 10.1117/1.JMI.10.3.034502. Epub 2023 May 18.
5
Prediction of hyperkalemia in ESRD patients by identification of multiple leads and multiple features on ECG.通过识别心电图上的多个导联和多个特征预测 ESRD 患者的高钾血症。
Ren Fail. 2023 Dec;45(1):2212800. doi: 10.1080/0886022X.2023.2212800.