• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过模拟超声医师视觉注意力进行超声图像表征学习

Ultrasound Image Representation Learning by Modeling Sonographer Visual Attention.

作者信息

Droste Richard, Cai Yifan, Sharma Harshita, Chatelain Pierre, Drukker Lior, Papageorghiou Aris T, Noble J Alison

机构信息

Department of Engineering Science, University of Oxford, UK.

Nuffield Department of Women's & Reproductive Health, University of Oxford, UK.

出版信息

Inf Process Med Imaging. 2019 Jun;26:592-604. doi: 10.1007/978-3-030-20351-1_46. Epub 2019 May 22.

DOI:10.1007/978-3-030-20351-1_46
PMID:31992944
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6986905/
Abstract

Image representations are commonly learned from class labels, which are a simplistic approximation of human image understanding. In this paper we demonstrate that transferable representations of images can be learned without manual annotations by modeling human visual attention. The basis of our analyses is a unique gaze tracking dataset of sonographers performing routine clinical fetal anomaly screenings. Models of sonographer visual attention are learned by training a convolutional neural network (CNN) to predict gaze on ultrasound video frames through visual saliency prediction or gaze-point regression. We evaluate the transferability of the learned representations to the task of ultrasound standard plane detection in two contexts. Firstly, we perform transfer learning by fine-tuning the CNN with a limited number of labeled standard plane images. We find that fine-tuning the saliency predictor is superior to training from random initialization, with an average F1-score improvement of 9.6% overall and 15.3% for the cardiac planes. Secondly, we train a simple softmax regression on the feature activations of each CNN layer in order to evaluate the representations independently of transfer learning hyper-parameters. We find that the attention models derive strong representations, approaching the precision of a fully-supervised baseline model for all but the last layer.

摘要

图像表示通常是从类别标签中学习的,而类别标签是对人类图像理解的一种简单近似。在本文中,我们证明了通过对人类视觉注意力进行建模,可以在无需人工标注的情况下学习图像的可迁移表示。我们分析的基础是一个独特的注视跟踪数据集,该数据集来自进行常规临床胎儿异常筛查的超声医师。通过训练卷积神经网络(CNN)以通过视觉显著性预测或注视点回归来预测超声视频帧上的注视,从而学习超声医师视觉注意力模型。我们在两种情况下评估所学表示对超声标准平面检测任务的可迁移性。首先,我们通过使用有限数量的带标签的标准平面图像对CNN进行微调来执行迁移学习。我们发现,微调显著性预测器优于从随机初始化开始训练,总体平均F1分数提高了9.6%,心脏平面提高了15.3%。其次,我们在每个CNN层的特征激活上训练一个简单的softmax回归,以便独立于迁移学习超参数来评估这些表示。我们发现注意力模型能够得出强大的表示,除最后一层外,所有层的精度都接近完全监督基线模型。

相似文献

1
Ultrasound Image Representation Learning by Modeling Sonographer Visual Attention.通过模拟超声医师视觉注意力进行超声图像表征学习
Inf Process Med Imaging. 2019 Jun;26:592-604. doi: 10.1007/978-3-030-20351-1_46. Epub 2019 May 22.
2
Discovering Salient Anatomical Landmarks by Predicting Human Gaze.通过预测人类视线来发现显著的解剖学标志。
Proc IEEE Int Symp Biomed Imaging. 2020 Apr 3;2020:1711-1714. doi: 10.1109/ISBI45749.2020.9098505.
3
Spatio-temporal visual attention modelling of standard biometry plane-finding navigation.标准生物测量平面定位导航的时空视觉注意力建模。
Med Image Anal. 2020 Oct;65:101762. doi: 10.1016/j.media.2020.101762. Epub 2020 Jun 20.
4
Multi-task SonoEyeNet: Detection of Fetal Standardized Planes Assisted by Generated Sonographer Attention Maps.多任务超声眼动网络:借助生成的超声医师注意力图检测胎儿标准平面
Med Image Comput Comput Assist Interv. 2018 Sep;11070:871-879. doi: 10.1007/978-3-030-00928-1_98. Epub 2018 Sep 26.
5
First Trimester Gaze Pattern Estimation Using Stochastic Augmentation Policy Search for Single Frame Saliency Prediction.使用随机增强策略搜索进行单帧显著性预测的孕早期注视模式估计
Med Image Underst Anal (2021). 2021 Jul;2021:361-374. doi: 10.1007/978-3-030-80432-9_28. Epub 2021 Jul 6.
6
SonoEyeNet: Standardized Fetal Ultrasound Plane Detection Informed by Eye Tracking.SonoEyeNet:基于眼动追踪的标准化胎儿超声平面检测
Proc IEEE Int Symp Biomed Imaging. 2018 Apr;2018:1475-1478. doi: 10.1109/ISBI.2018.8363851. Epub 2018 May 24.
7
Self-Supervised Representation Learning for Ultrasound Video.超声视频的自监督表征学习
Proc IEEE Int Symp Biomed Imaging. 2020 Apr 3;2020:1847-1850. doi: 10.1109/ISBI45749.2020.9098666.
8
Gaze-assisted automatic captioning of fetal ultrasound videos using three-way multi-modal deep neural networks.使用三向多模态深度神经网络的胎儿超声视频注视辅助自动字幕生成。
Med Image Anal. 2022 Nov;82:102630. doi: 10.1016/j.media.2022.102630. Epub 2022 Sep 17.
9
Self-supervised Contrastive Video-Speech Representation Learning for Ultrasound.用于超声的自监督对比视频-语音表征学习
Med Image Comput Comput Assist Interv. 2020 Oct;12263:534-543. doi: 10.1007/978-3-030-59716-0_51.
10
Correspondence between Monkey Visual Cortices and Layers of a Saliency Map Model Based on a Deep Convolutional Neural Network for Representations of Natural Images.基于深度卷积神经网络的自然图像表示的显著性映射模型的猴子视觉皮层与层之间的对应关系。
eNeuro. 2021 Feb 9;8(1). doi: 10.1523/ENEURO.0200-20.2020. Print 2021 Jan-Feb.

引用本文的文献

1
Audio-visual modelling in a clinical setting.临床环境中的视听建模。
Sci Rep. 2024 Jul 6;14(1):15569. doi: 10.1038/s41598-024-66160-4.
2
The Use of Machine Learning in Eye Tracking Studies in Medical Imaging: A Review.机器学习在医学成像眼动研究中的应用:综述。
IEEE J Biomed Health Inform. 2024 Jun;28(6):3597-3612. doi: 10.1109/JBHI.2024.3371893. Epub 2024 Jun 6.
3
Gaze-probe joint guidance with multi-task learning in obstetric ultrasound scanning.基于多任务学习的产科超声扫描中注视-探头联合引导
Med Image Anal. 2023 Dec;90:102981. doi: 10.1016/j.media.2023.102981. Epub 2023 Sep 29.
4
Anatomy-Aware Contrastive Representation Learning for Fetal Ultrasound.用于胎儿超声的解剖学感知对比表示学习
Comput Vis ECCV. 2022 Oct;2022:422-436. doi: 10.1007/978-3-031-25066-8_23.
5
Self-supervised learning for medical image classification: a systematic review and implementation guidelines.用于医学图像分类的自监督学习:系统综述与实施指南
NPJ Digit Med. 2023 Apr 26;6(1):74. doi: 10.1038/s41746-023-00811-0.
6
Multimodal-GuideNet: Gaze-Probe Bidirectional Guidance in Obstetric Ultrasound Scanning.多模态引导网络:产科超声扫描中的注视探测双向引导
Med Image Comput Comput Assist Interv. 2022 Sep 17;13437:94-103. doi: 10.1007/978-3-031-16449-1_10.
7
Multimodal Continual Learning with Sonographer Eye-Tracking in Fetal Ultrasound.基于超声医师眼动追踪的胎儿超声多模态持续学习
Simpl Med Ultrasound (2021). 2021 Sep 21;12967:14-24. doi: 10.1007/978-3-030-87583-1_2.
8
Transforming obstetric ultrasound into data science using eye tracking, voice recording, transducer motion and ultrasound video.利用眼动追踪、语音记录、探头运动和超声视频将产科超声转化为数据科学。
Sci Rep. 2021 Jul 8;11(1):14109. doi: 10.1038/s41598-021-92829-1.
9
Artificial Intelligence-Based Multiclass Classification of Benign or Malignant Mucosal Lesions of the Stomach.基于人工智能的胃黏膜良性或恶性病变的多分类
Front Pharmacol. 2020 Oct 2;11:572372. doi: 10.3389/fphar.2020.572372. eCollection 2020.
10
Self-Supervised Representation Learning for Ultrasound Video.超声视频的自监督表征学习
Proc IEEE Int Symp Biomed Imaging. 2020 Apr 3;2020:1847-1850. doi: 10.1109/ISBI45749.2020.9098666.

本文引用的文献

1
Squeeze-and-Excitation Networks.挤压激励网络。
IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2011-2023. doi: 10.1109/TPAMI.2019.2913372. Epub 2019 Apr 29.
2
Multi-task SonoEyeNet: Detection of Fetal Standardized Planes Assisted by Generated Sonographer Attention Maps.多任务超声眼动网络:借助生成的超声医师注意力图检测胎儿标准平面
Med Image Comput Comput Assist Interv. 2018 Sep;11070:871-879. doi: 10.1007/978-3-030-00928-1_98. Epub 2018 Sep 26.
3
Evaluation of Gaze Tracking Calibration for Longitudinal Biomedical Imaging Studies.纵向生物医学成像研究中的注视跟踪校准评估。
IEEE Trans Cybern. 2020 Jan;50(1):153-163. doi: 10.1109/TCYB.2018.2866274. Epub 2018 Sep 5.
4
SonoNet: Real-Time Detection and Localisation of Fetal Standard Scan Planes in Freehand Ultrasound.SonoNet:徒手超声中胎儿标准扫描平面的实时检测与定位
IEEE Trans Med Imaging. 2017 Nov;36(11):2204-2215. doi: 10.1109/TMI.2017.2712367. Epub 2017 Jul 11.
5
DeepFix: A Fully Convolutional Neural Network for Predicting Human Eye Fixations.DeepFix:一种用于预测人眼注视点的全卷积神经网络。
IEEE Trans Image Process. 2017 Sep;26(9):4446-4456. doi: 10.1109/TIP.2017.2710620.
6
Discriminative Unsupervised Feature Learning with Exemplar Convolutional Neural Networks.基于示例卷积神经网络的判别式无监督特征学习。
IEEE Trans Pattern Anal Mach Intell. 2016 Sep;38(9):1734-47. doi: 10.1109/TPAMI.2015.2496141. Epub 2015 Oct 29.
7
Guidance of visual attention by semantic information in real-world scenes.现实场景中语义信息对视觉注意力的引导
Front Psychol. 2014 Feb 6;5:54. doi: 10.3389/fpsyg.2014.00054. eCollection 2014.