• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

拓扑学方法解决目标分割和跟踪问题。

A topological solution to object segmentation and tracking.

机构信息

OpticArray Technologies, Rockville, MD 20850.

Department of Molecular & Cell Biology, Helen Wills Neuroscience Institute, Berkeley, CA 94704.

出版信息

Proc Natl Acad Sci U S A. 2022 Oct 11;119(41):e2204248119. doi: 10.1073/pnas.2204248119. Epub 2022 Oct 6.

DOI:10.1073/pnas.2204248119
PMID:36201537
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9564096/
Abstract

The world is composed of objects, the ground, and the sky. Visual perception of objects requires solving two fundamental challenges: 1) segmenting visual input into discrete units and 2) tracking identities of these units despite appearance changes due to object deformation, changing perspective, and dynamic occlusion. Current computer vision approaches to segmentation and tracking that approach human performance all require learning, raising the question, Can objects be segmented and tracked without learning? Here, we show that the mathematical structure of light rays reflected from environment surfaces yields a natural representation of persistent surfaces, and this surface representation provides a solution to both the segmentation and tracking problems. We describe how to generate this surface representation from continuous visual input and demonstrate that our approach can segment and invariantly track objects in cluttered synthetic video despite severe appearance changes, without requiring learning.

摘要

世界由物体、地面和天空组成。对物体的视觉感知需要解决两个基本挑战:1)将视觉输入分割成离散的单元;2)即使由于物体变形、视角变化和动态遮挡导致外观发生变化,也要跟踪这些单元的身份。目前,接近人类表现的计算机视觉分割和跟踪方法都需要学习,这就提出了一个问题:是否可以不通过学习来分割和跟踪物体?在这里,我们表明,从环境表面反射的光线的数学结构产生了持久表面的自然表示,并且该表面表示为分割和跟踪问题提供了一个解决方案。我们描述了如何从连续的视觉输入中生成这种表面表示,并证明了我们的方法可以在杂乱的合成视频中分割和不变地跟踪对象,即使外观发生严重变化,也无需学习。

相似文献

1
A topological solution to object segmentation and tracking.拓扑学方法解决目标分割和跟踪问题。
Proc Natl Acad Sci U S A. 2022 Oct 11;119(41):e2204248119. doi: 10.1073/pnas.2204248119. Epub 2022 Oct 6.
2
Visual Object Tracking Using Structured Sparse PCA-Based Appearance Representation and Online Learning.基于结构化稀疏 PCA 的表观表示和在线学习的视觉目标跟踪。
Sensors (Basel). 2018 Oct 18;18(10):3513. doi: 10.3390/s18103513.
3
Object segmentation and reconstruction via an oscillatory neural network: interaction among learning, memory, topological organization and gamma-band synchronization.通过振荡神经网络进行目标分割与重建:学习、记忆、拓扑组织和伽马波段同步之间的相互作用。
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:4953-6. doi: 10.1109/IEMBS.2006.260435.
4
Object segmentation and recovery via neural oscillators implementing the similarity and prior knowledge gestalt rules.通过实现相似性和先验知识格式塔规则的神经振荡器进行目标分割与恢复
Biosystems. 2006 Sep;85(3):201-18. doi: 10.1016/j.biosystems.2006.01.005. Epub 2006 Apr 25.
5
Learning of perceptual grouping for object segmentation on RGB-D data.基于RGB-D数据的目标分割感知分组学习。
J Vis Commun Image Represent. 2014 Jan;25(1):64-73. doi: 10.1016/j.jvcir.2013.04.006.
6
Efficient joint model learning, segmentation and model updating for visual tracking.高效联合模型学习、分割和模型更新的视觉跟踪。
Neural Netw. 2022 Mar;147:175-185. doi: 10.1016/j.neunet.2021.12.018. Epub 2022 Jan 1.
7
Representation in natural and artificial agents: an embodied cognitive science perspective.自然与人工智能体中的表征:一种具身认知科学视角
Z Naturforsch C J Biosci. 1998 Jul-Aug;53(7-8):480-503. doi: 10.1515/znc-1998-7-804.
8
Gamifying Video Object Segmentation.视频对象分割的游戏化。
IEEE Trans Pattern Anal Mach Intell. 2017 Oct;39(10):1942-1958. doi: 10.1109/TPAMI.2016.2610973. Epub 2016 Sep 19.
9
The Neural Dynamics of Attentional Selection in Natural Scenes.自然场景中注意选择的神经动力学
J Neurosci. 2016 Oct 12;36(41):10522-10528. doi: 10.1523/JNEUROSCI.1385-16.2016.
10
Segmentation and tracking multiple objects under occlusion from multiview video.基于多视角视频的遮挡情况下的多个目标的分割与跟踪。
IEEE Trans Image Process. 2011 Nov;20(11):3308-13. doi: 10.1109/TIP.2011.2159228. Epub 2011 Jun 9.

引用本文的文献

1
Ecological Resonance Is Reflected in Human Brain Activity.生态共振反映在人类大脑活动中。
Psychophysiology. 2025 Sep;62(9):e70136. doi: 10.1111/psyp.70136.
2
Orthogonal neural representations support perceptual judgments of natural stimuli.正交神经表征支持对自然刺激的感知判断。
Sci Rep. 2025 Feb 13;15(1):5316. doi: 10.1038/s41598-025-88910-8.
3
Is the impact of spontaneous movements on early visual cortex species specific?自发运动对早期视觉皮层的影响具有物种特异性吗?

本文引用的文献

1
The Mouse Action Recognition System (MARS) software pipeline for automated analysis of social behaviors in mice.鼠标动作识别系统(MARS)软件流水线,用于自动分析小鼠的社交行为。
Elife. 2021 Nov 30;10:e63720. doi: 10.7554/eLife.63720.
2
The macaque face patch system: a turtle's underbelly for the brain.猕猴面区系统:大脑的龟腹。
Nat Rev Neurosci. 2020 Dec;21(12):695-716. doi: 10.1038/s41583-020-00393-w. Epub 2020 Nov 3.
3
Wide-field retinotopy reveals a new visuotopic cluster in macaque posterior parietal cortex.宽场视网膜定位揭示了猕猴后顶叶皮层中的一个新视知觉簇。
Trends Neurosci. 2025 Jan;48(1):7-21. doi: 10.1016/j.tins.2024.11.006. Epub 2024 Dec 18.
4
Figure-ground segmentation based on motion in the archerfish.基于喷水鱼运动的图底分割
Anim Cogn. 2024 Apr 15;27(1):33. doi: 10.1007/s10071-024-01873-7.
5
Mice and primates use distinct strategies for visual segmentation.老鼠和灵长类动物使用不同的策略进行视觉分割。
Elife. 2023 Feb 15;12:e74394. doi: 10.7554/eLife.74394.
6
New Approaches to 3D Vision.三维视觉的新方法。
Philos Trans R Soc Lond B Biol Sci. 2023 Jan 30;378(1869):20210443. doi: 10.1098/rstb.2021.0443. Epub 2022 Dec 13.
7
The reasonable effectiveness of contours in vision.轮廓在视觉中的合理有效性。
Proc Natl Acad Sci U S A. 2022 Nov;119(44):e2215097119. doi: 10.1073/pnas.2215097119. Epub 2022 Oct 20.
Brain Struct Funct. 2020 Nov;225(8):2447-2461. doi: 10.1007/s00429-020-02134-2. Epub 2020 Sep 2.
4
Event-Based Vision: A Survey.基于事件的视觉:综述。
IEEE Trans Pattern Anal Mach Intell. 2022 Jan;44(1):154-180. doi: 10.1109/TPAMI.2020.3008413. Epub 2021 Dec 7.
5
Self-Supervised Visual Feature Learning With Deep Neural Networks: A Survey.基于深度神经网络的自监督视觉特征学习:综述
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):4037-4058. doi: 10.1109/TPAMI.2020.2992393. Epub 2021 Oct 1.
6
Image-Based 3D Object Reconstruction: State-of-the-Art and Trends in the Deep Learning Era.基于图像的 3D 目标重建:深度学习时代的现状与趋势。
IEEE Trans Pattern Anal Mach Intell. 2021 May;43(5):1578-1604. doi: 10.1109/TPAMI.2019.2954885. Epub 2021 Apr 1.
7
Perceptual straightening of natural videos.自然视频的感知校正。
Nat Neurosci. 2019 Jun;22(6):984-991. doi: 10.1038/s41593-019-0377-4. Epub 2019 Apr 29.
8
Detachable object detection: segmentation and depth ordering from short-baseline video.可分离物体检测:基于短基线视频的分割和深度排序。
IEEE Trans Pattern Anal Mach Intell. 2012 Oct;34(10):1942-51. doi: 10.1109/TPAMI.2011.271.
9
Tracking-Learning-Detection.跟踪-学习-检测。
IEEE Trans Pattern Anal Mach Intell. 2012 Jul;34(7):1409-22. doi: 10.1109/TPAMI.2011.239. Epub 2011 Dec 13.
10
A computational approach to edge detection.一种基于计算的边缘检测方法。
IEEE Trans Pattern Anal Mach Intell. 1986 Jun;8(6):679-98.