一种高通量筛选方法，用于发现具有良好生物学启发的视觉表示形式。

A high-throughput screening approach to discovering good forms of biologically inspired visual representation.

机构信息

McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachussetts, USA.

出版信息

PLoS Comput Biol. 2009 Nov;5(11):e1000579. doi: 10.1371/journal.pcbi.1000579. Epub 2009 Nov 26.

DOI:10.1371/journal.pcbi.1000579

PMID:19956750

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2775908/

Abstract

While many models of biological object recognition share a common set of "broad-stroke" properties, the performance of any one model depends strongly on the choice of parameters in a particular instantiation of that model--e.g., the number of units per layer, the size of pooling kernels, exponents in normalization operations, etc. Since the number of such parameters (explicit or implicit) is typically large and the computational cost of evaluating one particular parameter set is high, the space of possible model instantiations goes largely unexplored. Thus, when a model fails to approach the abilities of biological visual systems, we are left uncertain whether this failure is because we are missing a fundamental idea or because the correct "parts" have not been tuned correctly, assembled at sufficient scale, or provided with enough training. Here, we present a high-throughput approach to the exploration of such parameter sets, leveraging recent advances in stream processing hardware (high-end NVIDIA graphic cards and the PlayStation 3's IBM Cell Processor). In analogy to high-throughput screening approaches in molecular biology and genetics, we explored thousands of potential network architectures and parameter instantiations, screening those that show promising object recognition performance for further analysis. We show that this approach can yield significant, reproducible gains in performance across an array of basic object recognition tasks, consistently outperforming a variety of state-of-the-art purpose-built vision systems from the literature. As the scale of available computational power continues to expand, we argue that this approach has the potential to greatly accelerate progress in both artificial vision and our understanding of the computational underpinning of biological vision.

摘要

虽然许多生物目标识别模型都具有一组共同的“粗线条”特性，但任何一个模型的性能都强烈依赖于该模型特定实例中参数的选择——例如，每层的单元数量、池化核的大小、归一化操作中的指数等。由于此类参数（显式或隐式）的数量通常很大，并且评估一个特定参数集的计算成本很高，因此可能的模型实例空间在很大程度上未被探索。因此，当模型未能达到生物视觉系统的能力时，我们不确定这种失败是因为我们缺少一个基本思想，还是因为正确的“部分”没有被正确调整、以足够的规模组装，或者没有得到足够的训练。在这里，我们提出了一种利用流处理硬件（高端 NVIDIA 显卡和 PlayStation 3 的 IBM Cell 处理器）探索此类参数集的高通量方法。类似于分子生物学和遗传学中的高通量筛选方法，我们探索了数千种潜在的网络架构和参数实例，筛选出那些在对象识别性能方面表现出前景的架构和参数实例进行进一步分析。我们表明，这种方法可以在一系列基本对象识别任务中产生显著的、可重复的性能提升，始终优于各种来自文献的最先进的专用视觉系统。随着可用计算能力的规模继续扩大，我们认为这种方法有可能大大加快人工视觉和我们对生物视觉计算基础的理解的进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/15f9/2775908/5a7cbfa056a3/pcbi.1000579.g001.jpg

相似文献

A high-throughput screening approach to discovering good forms of biologically inspired visual representation.

PLoS Comput Biol. 2009 Nov;5(11):e1000579. doi: 10.1371/journal.pcbi.1000579. Epub 2009 Nov 26.

Robust object recognition with cortex-like mechanisms.

IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):411-26. doi: 10.1109/TPAMI.2007.56.

Object detection through search with a foveated visual system.

PLoS Comput Biol. 2017 Oct 9;13(10):e1005743. doi: 10.1371/journal.pcbi.1005743. eCollection 2017 Oct.

Real-time unconstrained object recognition: a processing pipeline based on the mammalian visual system.

IEEE Pulse. 2012 Mar;3(2):53-6. doi: 10.1109/MPUL.2011.2181025.

Why is real-world visual object recognition hard?

PLoS Comput Biol. 2008 Jan;4(1):e27. doi: 10.1371/journal.pcbi.0040027.

Computational object recognition: a biologically motivated approach.

Biol Cybern. 2009 Jan;100(1):59-79. doi: 10.1007/s00422-008-0281-6. Epub 2008 Dec 17.

Improved object recognition using neural networks trained to mimic the brain's statistical properties.

Neural Netw. 2020 Nov;131:103-114. doi: 10.1016/j.neunet.2020.07.013. Epub 2020 Jul 29.

Assessment of bioinspired models for pattern recognition in biomimetic systems.

Bioinspir Biomim. 2008 Mar;3:016004. doi: 10.1088/1748-3182/3/1/016004. Epub 2008 Mar 10.

A visual-attention model using Earth Mover's Distance-based saliency measurement and nonlinear feature combination.

IEEE Trans Pattern Anal Mach Intell. 2013 Feb;35(2):314-28. doi: 10.1109/TPAMI.2012.119.

The effect of nonlinear human visual system components on performance of a channelized Hotelling observer in structured backgrounds.

IEEE Trans Med Imaging. 2006 Oct;25(10):1348-62. doi: 10.1109/tmi.2006.880681.

引用本文的文献

A tale of two lexica: Investigating computational pressures on word representation with neural networks.

Front Artif Intell. 2023 Mar 27;6:1062230. doi: 10.3389/frai.2023.1062230. eCollection 2023.

Methodology for Neural Network-Based Material Card Calibration Using LS-DYNA Considering Failure with GISSMO.

Materials (Basel). 2022 Jan 15;15(2):643. doi: 10.3390/ma15020643.

Natural Image Reconstruction From fMRI Using Deep Learning: A Survey.

Front Neurosci. 2021 Dec 20;15:795488. doi: 10.3389/fnins.2021.795488. eCollection 2021.

Face detection in untrained deep neural networks.

Nat Commun. 2021 Dec 16;12(1):7328. doi: 10.1038/s41467-021-27606-9.

Transfer learning in medical image segmentation: New insights from analysis of the dynamics of model parameters and learned representations.

Artif Intell Med. 2021 Jun;116:102078. doi: 10.1016/j.artmed.2021.102078. Epub 2021 Apr 23.

Survey of Image Processing Techniques for Brain Pathology Diagnosis: Challenges and Opportunities.

Front Robot AI. 2018 Nov 2;5:120. doi: 10.3389/frobt.2018.00120. eCollection 2018.

Artificial Neural Networks-Based Material Parameter Identification for Numerical Simulations of Additively Manufactured Parts by Material Extrusion.

Polymers (Basel). 2020 Dec 10;12(12):2949. doi: 10.3390/polym12122949.

Estimating and interpreting nonlinear receptive field of sensory neural responses with deep neural network models.

Elife. 2020 Jun 26;9:e53445. doi: 10.7554/eLife.53445.

Geometrical structure of perceptual color space: Mental representations and adaptation invariance.

J Vis. 2019 Oct 1;19(12):1. doi: 10.1167/19.12.1.

Automatic diagnostics of tuberculosis using convolutional neural networks analysis of MODS digital images.

PLoS One. 2019 Feb 27;14(2):e0212094. doi: 10.1371/journal.pone.0212094. eCollection 2019.

本文引用的文献

Learning Invariance from Transformation Sequences.

Neural Comput. 1991 Summer;3(2):194-200. doi: 10.1162/neco.1991.3.2.194.

Invariant object recognition and pose estimation with slow feature analysis.

Neural Comput. 2011 Sep;23(9):2289-323. doi: 10.1162/NECO_a_00171. Epub 2011 Jun 14.

Multi-PIE.

Proc Int Conf Autom Face Gesture Recognit. 2010 May 1;28(5):807-813. doi: 10.1016/j.imavis.2009.08.002.

Unsupervised natural experience rapidly alters invariant object representation in visual cortex.

Science. 2008 Sep 12;321(5895):1502-7. doi: 10.1126/science.1160028.

Evaluation of Face Datasets as Tools for Assessing the Performance of Face Recognition Methods.

Int J Comput Vis. 2008;79(3):225-230. doi: 10.1007/s11263-008-0143-7.

Why is real-world visual object recognition hard?

PLoS Comput Biol. 2008 Jan;4(1):e27. doi: 10.1371/journal.pcbi.0040027.

Untangling invariant object recognition.

Trends Cogn Sci. 2007 Aug;11(8):333-41. doi: 10.1016/j.tics.2007.06.010. Epub 2007 Jul 16.

Robust object recognition with cortex-like mechanisms.

IEEE Trans Pattern Anal Mach Intell. 2007 Mar;29(3):411-26. doi: 10.1109/TPAMI.2007.56.

'Breaking' position-invariant object recognition.

Nat Neurosci. 2005 Sep;8(9):1145-7. doi: 10.1038/nn1519. Epub 2005 Aug 7.

Learning viewpoint invariant object representations using a temporal coherence principle.

Biol Cybern. 2005 Jul;93(1):79-90. doi: 10.1007/s00422-005-0585-8. Epub 2005 Jul 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种高通量筛选方法，用于发现具有良好生物学启发的视觉表示形式。

A high-throughput screening approach to discovering good forms of biologically inspired visual representation.

机构信息

McGovern Institute for Brain Research, Massachusetts Institute of Technology, Cambridge, Massachussetts, USA.

出版信息

PLoS Comput Biol. 2009 Nov;5(11):e1000579. doi: 10.1371/journal.pcbi.1000579. Epub 2009 Nov 26.

DOI:10.1371/journal.pcbi.1000579

PMID:19956750

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2775908/

Abstract

摘要

一种高通量筛选方法，用于发现具有良好生物学启发的视觉表示形式。

A high-throughput screening approach to discovering good forms of biologically inspired visual representation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

一种高通量筛选方法，用于发现具有良好生物学启发的视觉表示形式。

A high-throughput screening approach to discovering good forms of biologically inspired visual representation.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献