• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

EnNet:具有零阶优化的增强型交互式信息网络。

EnNet: Enhanced Interactive Information Network with Zero-Order Optimization.

作者信息

Shao Yingzhao, Chen Yanxin, Yang Pengfei, Cheng Fei

机构信息

State Key Laboratory of Integrated Services Networks, Xidian University, Xi'an 710071, China.

Key Laboratory of Smart Human-Computer Interaction and Wearable Technology of Shaanxi Province, Xi'an 710071, China.

出版信息

Sensors (Basel). 2024 Sep 30;24(19):6361. doi: 10.3390/s24196361.

DOI:10.3390/s24196361
PMID:39409398
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11478929/
Abstract

Interactive image segmentation extremely accelerates the generation of high-quality annotation image datasets, which are the pillars of the applications of deep learning. However, these methods suffer from the insignificance of interaction information and excessively high optimization costs, resulting in unexpected segmentation outcomes and increased computational burden. To address these issues, this paper focuses on interactive information mining from the network architecture and optimization procedure. In terms of network architecture, the issue mentioned above arises from two perspectives: the less representative feature of interactive regions in each layer and the interactive information weakened by the network hierarchy structure. Therefore, the paper proposes a network called EnNet. The network addresses the two aforementioned issues by employing attention mechanisms to integrate user interaction information across the entire image and incorporating interaction information twice in a design that progresses from coarse to fine. In terms of optimization, this paper proposes a method of using zero-order optimization during the first four iterations of training. This approach can reduce computational overhead with only a minimal reduction in accuracy. The experimental results on GrabCut, Berkeley, DAVIS, and SBD datasets validate the effectiveness of the proposed method, with our approach achieving an average NOC@90 that surpasses RITM by 0.35.

摘要

交互式图像分割极大地加速了高质量标注图像数据集的生成,而这些数据集是深度学习应用的支柱。然而,这些方法存在交互信息不显著以及优化成本过高的问题,导致分割结果不理想且计算负担增加。为了解决这些问题,本文着重于从网络架构和优化过程中挖掘交互信息。在网络架构方面,上述问题从两个角度产生:每层中交互区域的特征代表性不足,以及网络层次结构削弱了交互信息。因此,本文提出了一种名为EnNet的网络。该网络通过采用注意力机制在整个图像上整合用户交互信息,并在从粗到精的设计中两次纳入交互信息,解决了上述两个问题。在优化方面,本文提出了一种在训练的前四次迭代中使用零阶优化的方法。这种方法可以减少计算开销,同时精度仅有极小的降低。在GrabCut、Berkeley、DAVIS和SBD数据集上的实验结果验证了所提方法的有效性,我们的方法实现的平均NOC@90比RITM高出0.35。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/111ad7683f7f/sensors-24-06361-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/4ef3cb376abb/sensors-24-06361-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/4d8405be79a5/sensors-24-06361-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/c492a003db1e/sensors-24-06361-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/c3f76e2e85f6/sensors-24-06361-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/cdbae26dd131/sensors-24-06361-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/da3e4468fd15/sensors-24-06361-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/0148ec4e6267/sensors-24-06361-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/b2eeb75564b6/sensors-24-06361-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/dacb1b1a2196/sensors-24-06361-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/5785f9be0735/sensors-24-06361-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/463acbf77e96/sensors-24-06361-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/111ad7683f7f/sensors-24-06361-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/4ef3cb376abb/sensors-24-06361-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/4d8405be79a5/sensors-24-06361-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/c492a003db1e/sensors-24-06361-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/c3f76e2e85f6/sensors-24-06361-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/cdbae26dd131/sensors-24-06361-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/da3e4468fd15/sensors-24-06361-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/0148ec4e6267/sensors-24-06361-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/b2eeb75564b6/sensors-24-06361-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/dacb1b1a2196/sensors-24-06361-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/5785f9be0735/sensors-24-06361-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/463acbf77e96/sensors-24-06361-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4ffd/11478929/111ad7683f7f/sensors-24-06361-g012.jpg

相似文献

1
EnNet: Enhanced Interactive Information Network with Zero-Order Optimization.EnNet:具有零阶优化的增强型交互式信息网络。
Sensors (Basel). 2024 Sep 30;24(19):6361. doi: 10.3390/s24196361.
2
Interactive segmentation of medical images using deep learning.基于深度学习的医学图像交互式分割。
Phys Med Biol. 2024 Feb 5;69(4). doi: 10.1088/1361-6560/ad1cf8.
3
Semi-supervised abdominal multi-organ segmentation by object-redrawing.通过对象重绘实现半监督腹部多器官分割
Med Phys. 2024 Nov;51(11):8334-8347. doi: 10.1002/mp.17364. Epub 2024 Aug 21.
4
An Interactive Image Segmentation Method Based on Multi-Level Semantic Fusion.一种基于多级语义融合的交互式图像分割方法。
Sensors (Basel). 2023 Jul 14;23(14):6394. doi: 10.3390/s23146394.
5
Interactive Few-Shot Learning: Limited Supervision, Better Medical Image Segmentation.交互式Few-Shot 学习:有限监督,更好的医学图像分割。
IEEE Trans Med Imaging. 2021 Oct;40(10):2575-2588. doi: 10.1109/TMI.2021.3060551. Epub 2021 Sep 30.
6
Interactive Cascaded Network for Prostate Cancer Segmentation from Multimodality MRI with Automated Quality Assessment.用于多模态磁共振成像前列腺癌分割及自动质量评估的交互式级联网络
Bioengineering (Basel). 2024 Aug 6;11(8):796. doi: 10.3390/bioengineering11080796.
7
A dual autoencoder and singular value decomposition based feature optimization for the segmentation of brain tumor from MRI images.基于双自动编码器和奇异值分解的特征优化在 MRI 图像脑部肿瘤分割中的应用。
BMC Med Imaging. 2021 May 13;21(1):82. doi: 10.1186/s12880-021-00614-3.
8
Feature relocation network for fine-grained image classification.用于细粒度图像分类的特征重定位网络。
Neural Netw. 2023 Apr;161:306-317. doi: 10.1016/j.neunet.2023.01.050. Epub 2023 Feb 4.
9
Dual-branch Transformer for semi-supervised medical image segmentation.双分支Transformer 用于半监督医学图像分割。
J Appl Clin Med Phys. 2024 Oct;25(10):e14483. doi: 10.1002/acm2.14483. Epub 2024 Aug 12.
10
Semi-supervised breast cancer pathology image segmentation based on fine-grained classification guidance.基于细粒度分类指导的半监督乳腺癌病理图像分割。
Med Biol Eng Comput. 2024 Mar;62(3):901-912. doi: 10.1007/s11517-023-02970-4. Epub 2023 Dec 12.

本文引用的文献

1
A fully convolutional two-stream fusion network for interactive image segmentation.用于交互式图像分割的全卷积双流融合网络。
Neural Netw. 2019 Jan;109:31-42. doi: 10.1016/j.neunet.2018.10.009. Epub 2018 Oct 21.
2
Robust interactive image segmentation using convex active contours.基于凸主动轮廓的鲁棒交互式图像分割。
IEEE Trans Image Process. 2012 Aug;21(8):3734-43. doi: 10.1109/TIP.2012.2191566. Epub 2012 Mar 21.
3
Isoperimetric graph partitioning for image segmentation.用于图像分割的等周图划分
IEEE Trans Pattern Anal Mach Intell. 2006 Mar;28(3):469-75. doi: 10.1109/TPAMI.2006.57.