• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从视频和新兴属性中学习以对象为中心的动态模式。

Learning Object-Centric Dynamic Modes from Video and Emerging Properties.

作者信息

Massague Armand Comas, Fernandez-Lopez Christian, Ghimire Sandesh, Li Haolin, Sznaier Mario, Camps Octavia

机构信息

Northeastern University.

出版信息

Proc Mach Learn Res. 2023;5:745-769.

PMID:40893485
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12395393/
Abstract

One of the long-term objectives of Machine Learning is to endow machines with the capacity of structuring and interpreting the world as we do. This is particularly challenging in scenes involving time series, such as video sequences, since seemingly different data can correspond to the same underlying dynamics. Recent approaches seek to decompose video sequences into their composing objects, attributes and dynamics in a self-supervised fashion, thus simplifying the task of learning suitable features that can be used to analyze each component. While existing methods can successfully disentangle dynamics from other components, there have been relatively few efforts in learning parsimonious representations of these underlying dynamics. In this paper, motivated by recent advances in non-linear identification, we propose a method to decompose a video into moving objects, their attributes and the dynamic modes of their trajectories. We model video dynamics as the output of a Koopman operator to be learned from the available data. In this context, the dynamic information contained in the scene is encapsulated in the eigenvalues and eigenvectors of the Koopman operator, providing an interpretable and parsimonious representation. We show that such decomposition can be used for instance to perform video analytics, predict future frames or generate synthetic video. We test our framework in a variety of datasets that encompass different dynamic scenarios, while illustrating the novel features that emerge from our dynamic modes decomposition: Video dynamics interpretation and user manipulation at test-time. We successfully forecast challenging object trajectories from pixels, achieving competitive performance while drawing useful insights.

摘要

机器学习的长期目标之一是赋予机器像我们人类一样构建和解释世界的能力。这在涉及时间序列的场景中,如视频序列,尤其具有挑战性,因为看似不同的数据可能对应相同的潜在动态。最近的方法试图以自监督的方式将视频序列分解为其组成对象、属性和动态,从而简化学习可用于分析每个组件的合适特征的任务。虽然现有方法可以成功地将动态与其他组件分离,但在学习这些潜在动态的简洁表示方面相对较少有人努力。在本文中,受非线性识别领域最近进展的启发,我们提出了一种将视频分解为移动对象、其属性及其轨迹动态模式的方法。我们将视频动态建模为一个要从可用数据中学习的柯普曼算子的输出。在这种情况下,场景中包含的动态信息被封装在柯普曼算子的特征值和特征向量中,提供了一种可解释且简洁的表示。我们表明,这种分解例如可用于执行视频分析、预测未来帧或生成合成视频。我们在包含不同动态场景的各种数据集中测试了我们的框架,同时展示了从我们的动态模式分解中出现的新颖特征:测试时的视频动态解释和用户操作。我们成功地从像素预测具有挑战性的对象轨迹,在获得有用见解的同时实现了有竞争力的性能。

相似文献

1
Learning Object-Centric Dynamic Modes from Video and Emerging Properties.从视频和新兴属性中学习以对象为中心的动态模式。
Proc Mach Learn Res. 2023;5:745-769.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Short-Term Memory Impairment短期记忆障碍
4
The Black Book of Psychotropic Dosing and Monitoring.《精神药物剂量与监测黑皮书》
Psychopharmacol Bull. 2024 Jul 8;54(3):8-59.
5
Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果:面向临床医生的网状Meta分析教程
Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.
6
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.
7
A Comprehensive and Modality Diverse Cervical Spine and Back Musculoskeletal Physical Exam Curriculum for Medical Students.面向医学生的全面且多模态的颈椎和背部肌肉骨骼物理检查课程
J Educ Teach Emerg Med. 2025 Jul 31;10(3):SG1-SG8. doi: 10.21980/J8RQ0N. eCollection 2025 Jul.
8
Immunogenicity and seroefficacy of pneumococcal conjugate vaccines: a systematic review and network meta-analysis.肺炎球菌结合疫苗的免疫原性和血清效力:系统评价和网络荟萃分析。
Health Technol Assess. 2024 Jul;28(34):1-109. doi: 10.3310/YWHA3079.
9
Sexual Harassment and Prevention Training性骚扰与预防培训
10
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.

本文引用的文献

1
Generalizing Koopman Theory to Allow for Inputs and Control.将库普曼理论进行推广以纳入输入和控制因素。
SIAM J Appl Dyn Syst. 2018;17(1):909-930. doi: 10.1137/16M1062296. Epub 2018 Mar 27.
2
Graph dynamical networks for unsupervised learning of atomic scale dynamics in materials.用于材料中原子尺度动力学无监督学习的图动态网络。
Nat Commun. 2019 Jun 17;10(1):2667. doi: 10.1038/s41467-019-10663-6.
3
Deep learning for universal linear embeddings of nonlinear dynamics.深度学习用于非线性动力学的通用线性嵌入。
Nat Commun. 2018 Nov 23;9(1):4950. doi: 10.1038/s41467-018-07210-0.
4
VAMPnets for deep learning of molecular kinetics.用于分子动力学深度学习的VAMP网络。
Nat Commun. 2018 Jan 2;9(1):5. doi: 10.1038/s41467-017-02388-1.
5
Chaos as an intermittently forced linear system.作为间歇强迫线性系统的混沌
Nat Commun. 2017 May 30;8(1):19. doi: 10.1038/s41467-017-00030-8.
6
Hamiltonian Systems and Transformation in Hilbert Space.哈密顿系统与希尔伯特空间中的变换
Proc Natl Acad Sci U S A. 1931 May;17(5):315-8. doi: 10.1073/pnas.17.5.315.