• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种高通量筛选方法,用于发现具有良好生物学启发的视觉表示形式。

A high-throughput screening approach to discovering good forms of biologically inspired visual representation.

机构信息

McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachussetts, USA.

出版信息

PLoS Comput Biol. 2009 Nov;5(11):e1000579. doi: 10.1371/journal.pcbi.1000579. Epub 2009 Nov 26.

DOI:10.1371/journal.pcbi.1000579
PMID:19956750
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2775908/
Abstract

While many models of biological object recognition share a common set of "broad-stroke" properties, the performance of any one model depends strongly on the choice of parameters in a particular instantiation of that model--e.g., the number of units per layer, the size of pooling kernels, exponents in normalization operations, etc. Since the number of such parameters (explicit or implicit) is typically large and the computational cost of evaluating one particular parameter set is high, the space of possible model instantiations goes largely unexplored. Thus, when a model fails to approach the abilities of biological visual systems, we are left uncertain whether this failure is because we are missing a fundamental idea or because the correct "parts" have not been tuned correctly, assembled at sufficient scale, or provided with enough training. Here, we present a high-throughput approach to the exploration of such parameter sets, leveraging recent advances in stream processing hardware (high-end NVIDIA graphic cards and the PlayStation 3's IBM Cell Processor). In analogy to high-throughput screening approaches in molecular biology and genetics, we explored thousands of potential network architectures and parameter instantiations, screening those that show promising object recognition performance for further analysis. We show that this approach can yield significant, reproducible gains in performance across an array of basic object recognition tasks, consistently outperforming a variety of state-of-the-art purpose-built vision systems from the literature. As the scale of available computational power continues to expand, we argue that this approach has the potential to greatly accelerate progress in both artificial vision and our understanding of the computational underpinning of biological vision.

摘要

虽然许多生物目标识别模型都具有一组共同的“粗线条”特性,但任何一个模型的性能都强烈依赖于该模型特定实例中参数的选择——例如,每层的单元数量、池化核的大小、归一化操作中的指数等。由于此类参数(显式或隐式)的数量通常很大,并且评估一个特定参数集的计算成本很高,因此可能的模型实例空间在很大程度上未被探索。因此,当模型未能达到生物视觉系统的能力时,我们不确定这种失败是因为我们缺少一个基本思想,还是因为正确的“部分”没有被正确调整、以足够的规模组装,或者没有得到足够的训练。在这里,我们提出了一种利用流处理硬件(高端 NVIDIA 显卡和 PlayStation 3 的 IBM Cell 处理器)探索此类参数集的高通量方法。类似于分子生物学和遗传学中的高通量筛选方法,我们探索了数千种潜在的网络架构和参数实例,筛选出那些在对象识别性能方面表现出前景的架构和参数实例进行进一步分析。我们表明,这种方法可以在一系列基本对象识别任务中产生显著的、可重复的性能提升,始终优于各种来自文献的最先进的专用视觉系统。随着可用计算能力的规模继续扩大,我们认为这种方法有可能大大加快人工视觉和我们对生物视觉计算基础的理解的进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/b8d235334d7a/pcbi.1000579.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/5a7cbfa056a3/pcbi.1000579.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/4133b92a7a39/pcbi.1000579.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/090917cb19bf/pcbi.1000579.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/5d0f65cb3c17/pcbi.1000579.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/fa921b4ec523/pcbi.1000579.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/63d979b0db90/pcbi.1000579.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/00ca5d60ae3e/pcbi.1000579.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/b8d235334d7a/pcbi.1000579.g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/5a7cbfa056a3/pcbi.1000579.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/4133b92a7a39/pcbi.1000579.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/090917cb19bf/pcbi.1000579.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/5d0f65cb3c17/pcbi.1000579.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/fa921b4ec523/pcbi.1000579.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/63d979b0db90/pcbi.1000579.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/00ca5d60ae3e/pcbi.1000579.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/b8d235334d7a/pcbi.1000579.g008.jpg

相似文献

1
A high-throughput screening approach to discovering good forms of biologically inspired visual representation.一种高通量筛选方法,用于发现具有良好生物学启发的视觉表示形式。
PLoS Comput Biol. 2009 Nov;5(11):e1000579. doi: 10.1371/journal.pcbi.1000579. Epub 2009 Nov 26.
2
Robust object recognition with cortex-like mechanisms.具有类皮质机制的稳健目标识别
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):411-26. doi: 10.1109/TPAMI.2007.56.
3
Object detection through search with a foveated visual system.通过具有中央凹视觉系统的搜索进行目标检测。
PLoS Comput Biol. 2017 Oct 9;13(10):e1005743. doi: 10.1371/journal.pcbi.1005743. eCollection 2017 Oct.
4
Real-time unconstrained object recognition: a processing pipeline based on the mammalian visual system.实时无约束目标识别:一种基于哺乳动物视觉系统的处理流程
IEEE Pulse. 2012 Mar;3(2):53-6. doi: 10.1109/MPUL.2011.2181025.
5
Why is real-world visual object recognition hard?为什么现实世界中的视觉物体识别很难?
PLoS Comput Biol. 2008 Jan;4(1):e27. doi: 10.1371/journal.pcbi.0040027.
6
Computational object recognition: a biologically motivated approach.计算目标识别:一种受生物启发的方法。
Biol Cybern. 2009 Jan;100(1):59-79. doi: 10.1007/s00422-008-0281-6. Epub 2008 Dec 17.
7
Improved object recognition using neural networks trained to mimic the brain's statistical properties.利用模仿大脑统计特性的神经网络来提高物体识别能力。
Neural Netw. 2020 Nov;131:103-114. doi: 10.1016/j.neunet.2020.07.013. Epub 2020 Jul 29.
8
Assessment of bioinspired models for pattern recognition in biomimetic systems.用于仿生系统中模式识别的仿生模型评估。
Bioinspir Biomim. 2008 Mar;3:016004. doi: 10.1088/1748-3182/3/1/016004. Epub 2008 Mar 10.
9
A visual-attention model using Earth Mover's Distance-based saliency measurement and nonlinear feature combination.基于 Earth Mover's Distance 的显著度测量和非线性特征组合的视觉注意模型。
IEEE Trans Pattern Anal Mach Intell. 2013 Feb;35(2):314-28. doi: 10.1109/TPAMI.2012.119.
10
The effect of nonlinear human visual system components on performance of a channelized Hotelling observer in structured backgrounds.非线性人类视觉系统组件对结构化背景下通道化霍特林观察者性能的影响。
IEEE Trans Med Imaging. 2006 Oct;25(10):1348-62. doi: 10.1109/tmi.2006.880681.

引用本文的文献

1
A tale of two lexica: Investigating computational pressures on word representation with neural networks.两部词典的故事:用神经网络研究单词表征的计算压力
Front Artif Intell. 2023 Mar 27;6:1062230. doi: 10.3389/frai.2023.1062230. eCollection 2023.
2
Methodology for Neural Network-Based Material Card Calibration Using LS-DYNA Considering Failure with GISSMO.基于神经网络的材料卡片校准方法,使用LS-DYNA并结合GISSMO考虑失效情况。
Materials (Basel). 2022 Jan 15;15(2):643. doi: 10.3390/ma15020643.
3
Natural Image Reconstruction From fMRI Using Deep Learning: A Survey.

本文引用的文献

1
Learning Invariance from Transformation Sequences.从变换序列中学习不变性。
Neural Comput. 1991 Summer;3(2):194-200. doi: 10.1162/neco.1991.3.2.194.
2
Invariant object recognition and pose estimation with slow feature analysis.基于慢特征分析的不变目标识别与位姿估计。
Neural Comput. 2011 Sep;23(9):2289-323. doi: 10.1162/NECO_a_00171. Epub 2011 Jun 14.
3
Multi-PIE.多姿态、光照和表情数据库
使用深度学习从功能磁共振成像进行自然图像重建:一项综述。
Front Neurosci. 2021 Dec 20;15:795488. doi: 10.3389/fnins.2021.795488. eCollection 2021.
4
Face detection in untrained deep neural networks.未训练的深度神经网络中的人脸检测。
Nat Commun. 2021 Dec 16;12(1):7328. doi: 10.1038/s41467-021-27606-9.
5
Transfer learning in medical image segmentation: New insights from analysis of the dynamics of model parameters and learned representations.迁移学习在医学图像分割中的应用:基于模型参数和学习表示动态分析的新见解。
Artif Intell Med. 2021 Jun;116:102078. doi: 10.1016/j.artmed.2021.102078. Epub 2021 Apr 23.
6
Survey of Image Processing Techniques for Brain Pathology Diagnosis: Challenges and Opportunities.脑病理学诊断的图像处理技术综述:挑战与机遇
Front Robot AI. 2018 Nov 2;5:120. doi: 10.3389/frobt.2018.00120. eCollection 2018.
7
Artificial Neural Networks-Based Material Parameter Identification for Numerical Simulations of Additively Manufactured Parts by Material Extrusion.基于人工神经网络的材料挤压增材制造零件数值模拟材料参数识别
Polymers (Basel). 2020 Dec 10;12(12):2949. doi: 10.3390/polym12122949.
8
Estimating and interpreting nonlinear receptive field of sensory neural responses with deep neural network models.用深度神经网络模型估计和解释感觉神经反应的非线性感受野。
Elife. 2020 Jun 26;9:e53445. doi: 10.7554/eLife.53445.
9
Geometrical structure of perceptual color space: Mental representations and adaptation invariance.感知颜色空间的几何结构:心理表征与适应性不变性。
J Vis. 2019 Oct 1;19(12):1. doi: 10.1167/19.12.1.
10
Automatic diagnostics of tuberculosis using convolutional neural networks analysis of MODS digital images.利用卷积神经网络分析 MODS 数字图像对结核病进行自动诊断。
PLoS One. 2019 Feb 27;14(2):e0212094. doi: 10.1371/journal.pone.0212094. eCollection 2019.
Proc Int Conf Autom Face Gesture Recognit. 2010 May 1;28(5):807-813. doi: 10.1016/j.imavis.2009.08.002.
4
Unsupervised natural experience rapidly alters invariant object representation in visual cortex.无监督的自然体验会迅速改变视觉皮层中不变物体的表征。
Science. 2008 Sep 12;321(5895):1502-7. doi: 10.1126/science.1160028.
5
Evaluation of Face Datasets as Tools for Assessing the Performance of Face Recognition Methods.将面部数据集作为评估人脸识别方法性能工具的评估
Int J Comput Vis. 2008;79(3):225-230. doi: 10.1007/s11263-008-0143-7.
6
Why is real-world visual object recognition hard?为什么现实世界中的视觉物体识别很难?
PLoS Comput Biol. 2008 Jan;4(1):e27. doi: 10.1371/journal.pcbi.0040027.
7
Untangling invariant object recognition.解开不变物体识别之谜。
Trends Cogn Sci. 2007 Aug;11(8):333-41. doi: 10.1016/j.tics.2007.06.010. Epub 2007 Jul 16.
8
Robust object recognition with cortex-like mechanisms.具有类皮质机制的稳健目标识别
IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):411-26. doi: 10.1109/TPAMI.2007.56.
9
'Breaking' position-invariant object recognition.“突破性”位置不变目标识别。
Nat Neurosci. 2005 Sep;8(9):1145-7. doi: 10.1038/nn1519. Epub 2005 Aug 7.
10
Learning viewpoint invariant object representations using a temporal coherence principle.利用时间相干原理学习视角不变的物体表示。
Biol Cybern. 2005 Jul;93(1):79-90. doi: 10.1007/s00422-005-0585-8. Epub 2005 Jul 13.