• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积神经网络的多视图分类。

Multi-view classification with convolutional neural networks.

机构信息

Institute for Computer and Systems Engineering, Technische Universität Ilmenau, Ilmenau, Germany.

出版信息

PLoS One. 2021 Jan 12;16(1):e0245230. doi: 10.1371/journal.pone.0245230. eCollection 2021.

DOI:10.1371/journal.pone.0245230
PMID:33434208
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7802953/
Abstract

Humans' decision making process often relies on utilizing visual information from different views or perspectives. However, in machine-learning-based image classification we typically infer an object's class from just a single image showing an object. Especially for challenging classification problems, the visual information conveyed by a single image may be insufficient for an accurate decision. We propose a classification scheme that relies on fusing visual information captured through images depicting the same object from multiple perspectives. Convolutional neural networks are used to extract and encode visual features from the multiple views and we propose strategies for fusing these information. More specifically, we investigate the following three strategies: (1) fusing convolutional feature maps at differing network depths; (2) fusion of bottleneck latent representations prior to classification; and (3) score fusion. We systematically evaluate these strategies on three datasets from different domains. Our findings emphasize the benefit of integrating information fusion into the network rather than performing it by post-processing of classification scores. Furthermore, we demonstrate through a case study that already trained networks can be easily extended by the best fusion strategy, outperforming other approaches by large margin.

摘要

人类的决策过程通常依赖于利用来自不同视角的视觉信息。然而,在基于机器学习的图像分类中,我们通常仅从显示对象的单个图像推断对象的类别。特别是对于具有挑战性的分类问题,单个图像传达的视觉信息可能不足以做出准确的决策。我们提出了一种分类方案,该方案依赖于融合通过从多个视角拍摄的同一对象的图像捕获的视觉信息。使用卷积神经网络从多个视图中提取和编码视觉特征,并提出了融合这些信息的策略。更具体地说,我们研究了以下三种策略:(1)融合在不同网络深度的卷积特征图;(2)在分类之前融合瓶颈潜在表示;以及(3)得分融合。我们在来自不同领域的三个数据集上系统地评估了这些策略。我们的研究结果强调了将信息融合到网络中而不是通过分类分数的后处理来执行的好处。此外,我们通过案例研究证明,已经训练好的网络可以通过最佳融合策略轻松扩展,并且大大优于其他方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/e64d27056dc6/pone.0245230.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/5368155eef2c/pone.0245230.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/cf24ff2b74ec/pone.0245230.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/50355c55b96a/pone.0245230.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/4d5aaaa389ea/pone.0245230.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/e64d27056dc6/pone.0245230.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/5368155eef2c/pone.0245230.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/cf24ff2b74ec/pone.0245230.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/50355c55b96a/pone.0245230.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/4d5aaaa389ea/pone.0245230.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1c2/7802953/e64d27056dc6/pone.0245230.g005.jpg

相似文献

1
Multi-view classification with convolutional neural networks.基于卷积神经网络的多视图分类。
PLoS One. 2021 Jan 12;16(1):e0245230. doi: 10.1371/journal.pone.0245230. eCollection 2021.
2
Decision Fusion-Based Fetal Ultrasound Image Plane Classification Using Convolutional Neural Networks.基于决策融合的卷积神经网络胎儿超声图像平面分类
Ultrasound Med Biol. 2019 May;45(5):1259-1273. doi: 10.1016/j.ultrasmedbio.2018.11.016. Epub 2019 Feb 27.
3
HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images.HTC-retina:一种使用来自光学相干断层扫描图像的变压器-卷积神经网络的混合视网膜疾病分类模型。
Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9.
4
Transfer learning using a multi-scale and multi-network ensemble for skin lesion classification.使用多尺度和多网络集成的迁移学习进行皮肤病变分类。
Comput Methods Programs Biomed. 2020 Sep;193:105475. doi: 10.1016/j.cmpb.2020.105475. Epub 2020 Mar 21.
5
A novel fused convolutional neural network for biomedical image classification.一种用于生物医学图像分类的新型融合卷积神经网络。
Med Biol Eng Comput. 2019 Jan;57(1):107-121. doi: 10.1007/s11517-018-1819-y. Epub 2018 Jul 12.
6
Multi-Organ Plant Classification Based on Convolutional and Recurrent Neural Networks.基于卷积和循环神经网络的多器官植物分类。
IEEE Trans Image Process. 2018 Sep;27(9):4287-4301. doi: 10.1109/TIP.2018.2836321.
7
Breast ultrasound image segmentation: A coarse-to-fine fusion convolutional neural network.乳腺超声图像分割:一种粗到细融合的卷积神经网络。
Med Phys. 2021 Aug;48(8):4262-4278. doi: 10.1002/mp.15006. Epub 2021 Jul 29.
8
Skin lesion classification with ensembles of deep convolutional neural networks.基于深度卷积神经网络集成的皮肤损伤分类。
J Biomed Inform. 2018 Oct;86:25-32. doi: 10.1016/j.jbi.2018.08.006. Epub 2018 Aug 10.
9
A failure to learn object shape geometry: Implications for convolutional neural networks as plausible models of biological vision.未能学习物体形状几何:对卷积神经网络作为生物视觉合理模型的影响。
Vision Res. 2021 Dec;189:81-92. doi: 10.1016/j.visres.2021.09.004. Epub 2021 Oct 8.
10
Classification of multi-feature fusion ultrasound images of breast tumor within category 4 using convolutional neural networks.使用卷积神经网络对 4 类乳腺肿瘤多特征融合超声图像进行分类。
Med Phys. 2024 Jun;51(6):4243-4257. doi: 10.1002/mp.16946. Epub 2024 Mar 4.

引用本文的文献

1
Automatic fused multimodal deep learning for plant identification.用于植物识别的自动融合多模态深度学习
Front Plant Sci. 2025 Aug 5;16:1616020. doi: 10.3389/fpls.2025.1616020. eCollection 2025.
2
Successes and limitations of pretrained YOLO detectors applied to unseen time-lapse images for automated pollinator monitoring.预训练的YOLO检测器应用于未见过的延时图像进行自动传粉者监测的成功与局限
Sci Rep. 2025 Aug 21;15(1):30671. doi: 10.1038/s41598-025-16140-z.
3
A deep-learning model for predicting post-stroke cognitive impairment based on brain network damage.

本文引用的文献

1
Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark.异构网络表示学习:一个包含综述与基准测试的统一框架
IEEE Trans Knowl Data Eng. 2022 Oct;34(10):4854-4873. doi: 10.1109/tkde.2020.3045924. Epub 2020 Dec 21.
2
Flora Capture: a citizen science application for collecting structured plant observations.植物记录:一款用于收集结构化植物观测数据的公民科学应用程序。
BMC Bioinformatics. 2020 Dec 14;21(1):576. doi: 10.1186/s12859-020-03920-9.
3
CSID: A Novel Multimodal Image Fusion Algorithm for Enhanced Clinical Diagnosis.
一种基于脑网络损伤预测中风后认知障碍的深度学习模型。
Quant Imaging Med Surg. 2025 May 1;15(5):3964-3981. doi: 10.21037/qims-24-2010. Epub 2025 Apr 21.
4
Enhancing Apple Cultivar Classification Using Multiview Images.利用多视图图像增强苹果品种分类
J Imaging. 2024 Apr 17;10(4):94. doi: 10.3390/jimaging10040094.
5
FACES: A Deep-Learning-Based Parametric Model to Improve Rosacea Diagnoses.FACES:一种基于深度学习的参数模型,用于改善酒渣鼻诊断。
Appl Sci (Basel). 2023 Jan 2;13(2). doi: 10.3390/app13020970. Epub 2023 Jan 11.
6
DeePSC: A Deep Learning Model for Automated Diagnosis of Primary Sclerosing Cholangitis at Two-dimensional MR Cholangiopancreatography.DeePSC:一种用于二维磁共振胰胆管造影术自动诊断原发性硬化性胆管炎的深度学习模型。
Radiol Artif Intell. 2023 Apr 19;5(3):e220160. doi: 10.1148/ryai.220160. eCollection 2023 May.
7
Multi-view image-based behavior classification of wet-dog shake in Kainate rat model.基于多视图图像的红藻氨酸盐大鼠模型中湿狗摇行为分类
Front Behav Neurosci. 2023 May 2;17:1148549. doi: 10.3389/fnbeh.2023.1148549. eCollection 2023.
8
MODILM: towards better complex diseases classification using a novel multi-omics data integration learning model.MODILM:一种利用新型多组学数据整合学习模型改善复杂疾病分类的方法。
BMC Med Inform Decis Mak. 2023 May 5;23(1):82. doi: 10.1186/s12911-023-02173-9.
9
Predicting Daily Aerobiological Risk Level of Potato Late Blight Using C5.0 and Random Forest Algorithms under Field Conditions.基于现场条件下 C5.0 和随机森林算法预测马铃薯晚疫病的每日气传发病风险等级
Sensors (Basel). 2023 Apr 8;23(8):3818. doi: 10.3390/s23083818.
10
Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data.使用单细胞 RNA 测序数据对 COVID-19 结果预测的分析组合进行基准测试。
Brief Bioinform. 2023 May 19;24(3). doi: 10.1093/bib/bbad159.
CSID:一种用于增强临床诊断的新型多模态图像融合算法。
Diagnostics (Basel). 2020 Nov 5;10(11):904. doi: 10.3390/diagnostics10110904.
4
Image-based classification of plant genus and family for trained and untrained plant species.基于图像的植物属和科分类,适用于训练和未训练的植物物种。
BMC Bioinformatics. 2019 Jan 3;20(1):4. doi: 10.1186/s12859-018-2474-x.
5
Multi-Organ Plant Classification Based on Convolutional and Recurrent Neural Networks.基于卷积和循环神经网络的多器官植物分类。
IEEE Trans Image Process. 2018 Sep;27(9):4287-4301. doi: 10.1109/TIP.2018.2836321.
6
Automated plant species identification-Trends and future directions.自动化植物物种鉴定——趋势与未来方向。
PLoS Comput Biol. 2018 Apr 5;14(4):e1005993. doi: 10.1371/journal.pcbi.1005993. eCollection 2018 Apr.
7
Ant genera identification using an ensemble of convolutional neural networks.使用卷积神经网络集成进行蚁属鉴定。
PLoS One. 2018 Jan 31;13(1):e0192011. doi: 10.1371/journal.pone.0192011. eCollection 2018.
8
Pulmonary Nodule Detection in CT Images: False Positive Reduction Using Multi-View Convolutional Networks.CT 图像中的肺结节检测:使用多视图卷积网络减少假阳性。
IEEE Trans Med Imaging. 2016 May;35(5):1160-1169. doi: 10.1109/TMI.2016.2536809. Epub 2016 Mar 1.
9
Deep learning.深度学习。
Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.