• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于局部或全局不变量的卷积神经网络在几何形状分类中的学习迁移

Transfer of Learning in the Convolutional Neural Networks on Classifying Geometric Shapes Based on Local or Global Invariants.

作者信息

Zheng Yufeng, Huang Jun, Chen Tianwen, Ou Yang, Zhou Wu

机构信息

Department of Data Science, University of Mississippi Medical Centre, Jackson, MS, United States.

Department of Otolaryngology-Head and Neck Surgery, University of Mississippi Medical Centre, Jackson, MS, United States.

出版信息

Front Comput Neurosci. 2021 Feb 19;15:637144. doi: 10.3389/fncom.2021.637144. eCollection 2021.

DOI:10.3389/fncom.2021.637144
PMID:33679359
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7935523/
Abstract

The convolutional neural networks (CNNs) are a powerful tool of image classification that has been widely adopted in applications of automated scene segmentation and identification. However, the mechanisms underlying CNN image classification remain to be elucidated. In this study, we developed a new approach to address this issue by investigating transfer of learning in representative CNNs (AlexNet, VGG, ResNet-101, and Inception-ResNet-v2) on classifying geometric shapes based on local/global features or invariants. While the local features are based on simple components, such as orientation of line segment or whether two lines are parallel, the global features are based on the whole object such as whether an object has a hole or whether an object is inside of another object. Six experiments were conducted to test two hypotheses on CNN shape classification. The first hypothesis is that transfer of learning based on local features is higher than transfer of learning based on global features. The second hypothesis is that the CNNs with more layers and advanced architectures have higher transfer of learning based global features. The first two experiments examined how the CNNs transferred learning of discriminating local features (square, rectangle, trapezoid, and parallelogram). The other four experiments examined how the CNNs transferred learning of discriminating global features (presence of a hole, connectivity, and inside/outside relationship). While the CNNs exhibited robust learning on classifying shapes, transfer of learning varied from task to task, and model to model. The results rejected both hypotheses. First, some CNNs exhibited lower transfer of learning based on local features than that based on global features. Second the advanced CNNs exhibited lower transfer of learning on global features than that of the earlier models. Among the tested geometric features, we found that learning of discriminating inside/outside relationship was the most difficult to be transferred, indicating an effective benchmark to develop future CNNs. In contrast to the "ImageNet" approach that employs natural images to train and analyze the CNNs, the results show proof of concept for the "ShapeNet" approach that employs well-defined geometric shapes to elucidate the strengths and limitations of the computation in CNN image classification. This "ShapeNet" approach will also provide insights into understanding visual information processing the primate visual systems.

摘要

卷积神经网络(CNNs)是一种强大的图像分类工具,已广泛应用于自动场景分割和识别领域。然而,CNN图像分类背后的机制仍有待阐明。在本研究中,我们开发了一种新方法来解决这个问题,即通过研究代表性CNN(AlexNet、VGG、ResNet - 101和Inception - ResNet - v2)在基于局部/全局特征或不变量对几何形状进行分类时的学习迁移情况。局部特征基于简单的组件,如线段的方向或两条线是否平行,而全局特征基于整个对象,如对象是否有孔或一个对象是否在另一个对象内部。进行了六个实验来检验关于CNN形状分类的两个假设。第一个假设是基于局部特征的学习迁移高于基于全局特征的学习迁移。第二个假设是具有更多层和先进架构的CNN基于全局特征的学习迁移更高。前两个实验研究了CNN如何迁移区分局部特征(正方形、长方形、梯形和平行四边形)的学习。另外四个实验研究了CNN如何迁移区分全局特征(是否有孔、连通性以及内外关系)的学习。虽然CNN在形状分类上表现出强大的学习能力,但学习迁移因任务和模型而异。结果否定了这两个假设。首先,一些CNN基于局部特征的学习迁移低于基于全局特征的学习迁移。其次,先进的CNN基于全局特征的学习迁移低于早期模型。在测试的几何特征中,我们发现区分内外关系的学习最难迁移,这为未来CNN的发展提供了一个有效的基准。与采用自然图像训练和分析CNN的“ImageNet”方法不同,结果证明了采用定义明确的几何形状来阐明CNN图像分类中计算的优势和局限性的“ShapeNet”方法的概念验证。这种“ShapeNet”方法也将为理解灵长类视觉系统中的视觉信息处理提供见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/f51302d7456b/fncom-15-637144-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/16c2626ddd94/fncom-15-637144-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/e2a4fc061ed6/fncom-15-637144-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/f880d40719fb/fncom-15-637144-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/84da41413897/fncom-15-637144-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/620e41ac6205/fncom-15-637144-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/50d3253a9390/fncom-15-637144-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/f51302d7456b/fncom-15-637144-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/16c2626ddd94/fncom-15-637144-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/e2a4fc061ed6/fncom-15-637144-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/f880d40719fb/fncom-15-637144-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/84da41413897/fncom-15-637144-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/620e41ac6205/fncom-15-637144-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/50d3253a9390/fncom-15-637144-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ef6/7935523/f51302d7456b/fncom-15-637144-g0007.jpg

相似文献

1
Transfer of Learning in the Convolutional Neural Networks on Classifying Geometric Shapes Based on Local or Global Invariants.基于局部或全局不变量的卷积神经网络在几何形状分类中的学习迁移
Front Comput Neurosci. 2021 Feb 19;15:637144. doi: 10.3389/fncom.2021.637144. eCollection 2021.
2
A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.未能学习物体形状几何:对卷积神经网络作为生物视觉合理模型的影响。
Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.
3
Representations of regular and irregular shapes by deep Convolutional Neural Networks, monkey inferotemporal neurons and human judgments.深度卷积神经网络、猴子下颞叶神经元和人类判断对规则和不规则形状的表示。
PLoS Comput Biol. 2018 Oct 26;14(10):e1006557. doi: 10.1371/journal.pcbi.1006557. eCollection 2018 Oct.
4
Local features and global shape information in object classification by deep convolutional neural networks.深度卷积神经网络在目标分类中的局部特征和全局形状信息。
Vision Res. 2020 Jul;172:46-61. doi: 10.1016/j.visres.2020.04.003. Epub 2020 May 12.
5
A deep learning framework for automatic detection of arbitrarily shaped fiducial markers in intrafraction fluoroscopic images.一种用于在分次透视图像中自动检测任意形状基准标记的深度学习框架。
Med Phys. 2019 May;46(5):2286-2297. doi: 10.1002/mp.13519. Epub 2019 Apr 15.
6
Exploring convolutional neural networks with transfer learning for diagnosing Lyme disease from skin lesion images.利用迁移学习探索卷积神经网络,以从皮肤损伤图像诊断莱姆病。
Comput Methods Programs Biomed. 2022 Mar;215:106624. doi: 10.1016/j.cmpb.2022.106624. Epub 2022 Jan 10.
7
Evaluation of Transfer Learning with Deep Convolutional Neural Networks for Screening Osteoporosis in Dental Panoramic Radiographs.基于深度卷积神经网络的迁移学习在牙科全景X光片中筛查骨质疏松症的评估
J Clin Med. 2020 Feb 1;9(2):392. doi: 10.3390/jcm9020392.
8
An Ensemble of Fine-Tuned Convolutional Neural Networks for Medical Image Classification.用于医学图像分类的微调卷积神经网络集成
IEEE J Biomed Health Inform. 2017 Jan;21(1):31-40. doi: 10.1109/JBHI.2016.2635663. Epub 2016 Dec 5.
9
Study of the Application of Deep Convolutional Neural Networks (CNNs) in Processing Sensor Data and Biomedical Images.深度学习卷积神经网络(CNNs)在传感器数据和生物医学图像处理中的应用研究。
Sensors (Basel). 2019 Aug 17;19(16):3584. doi: 10.3390/s19163584.
10
Convolutional neural networks for Alzheimer's disease detection on MRI images.用于基于MRI图像检测阿尔茨海默病的卷积神经网络。
J Med Imaging (Bellingham). 2021 Mar;8(2):024503. doi: 10.1117/1.JMI.8.2.024503. Epub 2021 Apr 29.

本文引用的文献

1
Rapid Processing of a Global Feature in the ON Visual Pathways of Behaving Monkeys.行为猕猴视通路中全局特征的快速处理
Front Neurosci. 2017 Aug 25;11:474. doi: 10.3389/fnins.2017.00474. eCollection 2017.
2
Topological structure in visual perception.视觉感知中的拓扑结构。
Science. 1982 Nov 12;218(4573):699-700. doi: 10.1126/science.7134969.
3
Visual detection of line segments: an object-superiority effect.线段的视觉检测:一种客体优势效应。
Science. 1974 Nov 22;186(4165):752-5. doi: 10.1126/science.186.4165.752.
4
Holes and wholes: a reply to Rubin and Kanwisher.空洞与整体:对鲁宾和坎维舍的回应
Percept Psychophys. 1990 Jan;47(1):47-53. doi: 10.3758/bf03208163.
5
Ferrier lecture. Functional architecture of macaque monkey visual cortex.费里尔讲座。猕猴视觉皮层的功能结构
Proc R Soc Lond B Biol Sci. 1977 Jul 28;198(1130):1-59. doi: 10.1098/rspb.1977.0085.