• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种结合了迁移学习的图像标注本地化策略。

A localization strategy combined with transfer learning for image annotation.

机构信息

Information Technology, SEGi University, Kota Damansara, Petaling Jaya, Selangor, Malaysia.

Department of Computer Science and Technology, Xinzhou Teachers University, Xinzhou, China.

出版信息

PLoS One. 2021 Dec 8;16(12):e0260758. doi: 10.1371/journal.pone.0260758. eCollection 2021.

DOI:10.1371/journal.pone.0260758
PMID:34879097
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8654185/
Abstract

This study aims to solve the overfitting problem caused by insufficient labeled images in the automatic image annotation field. We propose a transfer learning model called CNN-2L that incorporates the label localization strategy described in this study. The model consists of an InceptionV3 network pretrained on the ImageNet dataset and a label localization algorithm. First, the pretrained InceptionV3 network extracts features from the target dataset that are used to train a specific classifier and fine-tune the entire network to obtain an optimal model. Then, the obtained model is used to derive the probabilities of the predicted labels. For this purpose, we introduce a squeeze and excitation (SE) module into the network architecture that augments the useful feature information, inhibits useless feature information, and conducts feature reweighting. Next, we perform label localization to obtain the label probabilities and determine the final label set for each image. During this process, the number of labels must be determined. The optimal K value is obtained experimentally and used to determine the number of predicted labels, thereby solving the empty label set problem that occurs when the predicted label values of images are below a fixed threshold. Experiments on the Corel5k multilabel image dataset verify that CNN-2L improves the labeling precision by 18% and 15% compared with the traditional multiple-Bernoulli relevance model (MBRM) and joint equal contribution (JEC) algorithms, respectively, and it improves the recall by 6% compared with JEC. Additionally, it improves the precision by 20% and 11% compared with the deep learning methods Weight-KNN and adaptive hypergraph learning (AHL), respectively. Although CNN-2L fails to improve the recall compared with the semantic extension model (SEM), it improves the comprehensive index of the F1 value by 1%. The experimental results reveal that the proposed transfer learning model based on a label localization strategy is effective for automatic image annotation and substantially boosts the multilabel image annotation performance.

摘要

这项研究旨在解决自动图像标注领域中因标注图像不足而导致的过拟合问题。我们提出了一种称为 CNN-2L 的迁移学习模型,该模型结合了本研究中描述的标签定位策略。该模型由在 ImageNet 数据集上预训练的 InceptionV3 网络和标签定位算法组成。首先,预训练的 InceptionV3 网络从目标数据集提取特征,用于训练特定的分类器,并微调整个网络以获得最佳模型。然后,使用获得的模型来推导出预测标签的概率。为此,我们在网络架构中引入了一个挤压激励 (SE) 模块,该模块增强了有用的特征信息,抑制了无用的特征信息,并进行了特征重新加权。接下来,我们执行标签定位以获得标签概率,并确定每张图像的最终标签集。在此过程中,必须确定标签的数量。通过实验获得最佳 K 值,并用于确定预测标签的数量,从而解决了当图像的预测标签值低于固定阈值时出现的空标签集问题。在 Corel5k 多标签图像数据集上的实验验证了,与传统的多伯努利关联模型(MBRM)和联合均等贡献(JEC)算法相比,CNN-2L 分别提高了 18%和 15%的标注精度,与 JEC 相比,召回率提高了 6%。此外,与深度学习方法 Weight-KNN 和自适应超图学习(AHL)相比,精度分别提高了 20%和 11%。虽然 CNN-2L 在与语义扩展模型(SEM)的召回率相比没有提高,但它将 F1 值的综合指标提高了 1%。实验结果表明,所提出的基于标签定位策略的迁移学习模型对于自动图像标注是有效的,并大大提高了多标签图像标注性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/1cf8e70e311f/pone.0260758.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/c951df333915/pone.0260758.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/99b6a57250be/pone.0260758.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/d723f9b767f7/pone.0260758.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/e33172f1c298/pone.0260758.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/5cb8998cc110/pone.0260758.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/c6054d0e7ce0/pone.0260758.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/1cf8e70e311f/pone.0260758.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/c951df333915/pone.0260758.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/99b6a57250be/pone.0260758.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/d723f9b767f7/pone.0260758.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/e33172f1c298/pone.0260758.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/5cb8998cc110/pone.0260758.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/c6054d0e7ce0/pone.0260758.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/118a/8654185/1cf8e70e311f/pone.0260758.g007.jpg

相似文献

1
A localization strategy combined with transfer learning for image annotation.一种结合了迁移学习的图像标注本地化策略。
PLoS One. 2021 Dec 8;16(12):e0260758. doi: 10.1371/journal.pone.0260758. eCollection 2021.
2
Automatic image annotation method based on a convolutional neural network with threshold optimization.基于阈值优化卷积神经网络的自动图像标注方法。
PLoS One. 2020 Sep 23;15(9):e0238956. doi: 10.1371/journal.pone.0238956. eCollection 2020.
3
FlyIT: Drosophila Embryogenesis Image Annotation based on Image Tiling and Convolutional Neural Networks.FlyIT:基于图像分块和卷积神经网络的果蝇胚胎发育图像标注
IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb;18(1):194-204. doi: 10.1109/TCBB.2019.2935723. Epub 2021 Feb 3.
4
Evaluation of a multiview architecture for automatic vertebral labeling of palliative radiotherapy simulation CT images.评估一种多视图架构,用于自动标记姑息性放疗模拟 CT 图像的椎体。
Med Phys. 2020 Nov;47(11):5592-5608. doi: 10.1002/mp.14415. Epub 2020 Sep 15.
5
CM-supplement network model for reducing the memory consumption during multilabel image annotation.基于 CM 补充网络模型降低多标签图像标注中的内存消耗。
PLoS One. 2020 Jun 1;15(6):e0234014. doi: 10.1371/journal.pone.0234014. eCollection 2020.
6
Deep-LIFT: Deep Label-Specific Feature Learning for Image Annotation.深度提升:用于图像标注的深度特定标签特征学习
IEEE Trans Cybern. 2022 Aug;52(8):7732-7741. doi: 10.1109/TCYB.2021.3049630. Epub 2022 Jul 19.
7
Targeted transfer learning to improve performance in small medical physics datasets.靶向迁移学习以提高小型医学物理数据集的性能。
Med Phys. 2020 Dec;47(12):6246-6256. doi: 10.1002/mp.14507. Epub 2020 Oct 25.
8
MAIA-A machine learning assisted image annotation method for environmental monitoring and exploration.MAIA-一种用于环境监测和探索的机器学习辅助图像标注方法。
PLoS One. 2018 Nov 16;13(11):e0207498. doi: 10.1371/journal.pone.0207498. eCollection 2018.
9
A deep learning- and partial least square regression-based model observer for a low-contrast lesion detection task in CT.基于深度学习和偏最小二乘回归的 CT 低对比度病灶检测任务模型观察器。
Med Phys. 2019 May;46(5):2052-2063. doi: 10.1002/mp.13500. Epub 2019 Apr 1.
10
The Role of Knowledge Creation-Oriented Convolutional Neural Network in Learning Interaction.面向知识创造的卷积神经网络在学习交互中的作用。
Comput Intell Neurosci. 2022 Mar 16;2022:6493311. doi: 10.1155/2022/6493311. eCollection 2022.

本文引用的文献

1
Squeeze-and-Excitation Networks.挤压激励网络。
IEEE Trans Pattern Anal Mach Intell. 2020 Aug;42(8):2011-2023. doi: 10.1109/TPAMI.2019.2913372. Epub 2019 Apr 29.
2
Using texture to analyze and manage large collections of remote sensed image and video data.利用纹理分析和管理大量遥感图像和视频数据。
Appl Opt. 2004 Jan 10;43(2):210-7. doi: 10.1364/ao.43.000210.