标记图核的行为分析。

Labeled Graph Kernel for Behavior Analysis.

出版信息

IEEE Trans Pattern Anal Mach Intell. 2016 Aug;38(8):1640-50. doi: 10.1109/TPAMI.2015.2481404. Epub 2015 Sep 23.

DOI:10.1109/TPAMI.2015.2481404

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4846576/

Abstract

Automatic behavior analysis from video is a major topic in many areas of research, including computer vision, multimedia, robotics, biology, cognitive science, social psychology, psychiatry, and linguistics. Two major problems are of interest when analyzing behavior. First, we wish to automatically categorize observed behaviors into a discrete set of classes (i.e., classification). For example, to determine word production from video sequences in sign language. Second, we wish to understand the relevance of each behavioral feature in achieving this classification (i.e., decoding). For instance, to know which behavior variables are used to discriminate between the words apple and onion in American Sign Language (ASL). The present paper proposes to model behavior using a labeled graph, where the nodes define behavioral features and the edges are labels specifying their order (e.g., before, overlaps, start). In this approach, classification reduces to a simple labeled graph matching. Unfortunately, the complexity of labeled graph matching grows exponentially with the number of categories we wish to represent. Here, we derive a graph kernel to quickly and accurately compute this graph similarity. This approach is very general and can be plugged into any kernel-based classifier. Specifically, we derive a Labeled Graph Support Vector Machine (LGSVM) and a Labeled Graph Logistic Regressor (LGLR) that can be readily employed to discriminate between many actions (e.g., sign language concepts). The derived approach can be readily used for decoding too, yielding invaluable information for the understanding of a problem (e.g., to know how to teach a sign language). The derived algorithms allow us to achieve higher accuracy results than those of state-of-the-art algorithms in a fraction of the time. We show experimental results on a variety of problems and datasets, including multimodal data.

摘要

自动行为分析是计算机视觉、多媒体、机器人、生物、认知科学、社会心理学、精神病学和语言学等许多研究领域的一个主要课题。在分析行为时，有两个主要问题引起了人们的兴趣。首先，我们希望能够自动将观察到的行为分类到离散的类别中（即分类）。例如，从手语视频序列中确定单词的生成。其次，我们希望了解在实现这种分类中每个行为特征的相关性（即解码）。例如，了解在美式手语 (ASL) 中，哪些行为变量被用于区分苹果和洋葱这两个词。本文提出了一种使用标记图来建模行为的方法，其中节点定义行为特征，边则指定其顺序的标签（例如，之前、重叠、开始）。在这种方法中，分类简化为简单的标记图匹配。不幸的是，标记图匹配的复杂度随着我们希望表示的类别数量呈指数增长。在这里，我们推导出一种图核以快速准确地计算这种图相似性。这种方法非常通用，可以插入到任何基于核的分类器中。具体来说，我们推导出一个标记图支持向量机 (LGSVM) 和一个标记图逻辑回归 (LGLR)，它们可以很容易地用于区分许多动作（例如，手语概念）。所得到的方法也可以很容易地用于解码，为理解问题提供宝贵的信息（例如，知道如何教授手语）。所得到的算法允许我们在一小部分时间内获得比最先进算法更高的准确性结果。我们在各种问题和数据集上展示了实验结果，包括多模态数据。

相似文献

Labeled Graph Kernel for Behavior Analysis.标记图核的行为分析。

IEEE Trans Pattern Anal Mach Intell. 2016 Aug;38(8):1640-50. doi: 10.1109/TPAMI.2015.2481404. Epub 2015 Sep 23.

A comparison of graph- and kernel-based -omics data integration algorithms for classifying complex traits.用于复杂性状分类的基于图和核的组学数据整合算法比较。

BMC Bioinformatics. 2017 Dec 6;18(1):539. doi: 10.1186/s12859-017-1982-4.

Classification approach for automatic laparoscopic video database organization.用于自动腹腔镜视频数据库组织的分类方法。

Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1449-60. doi: 10.1007/s11548-015-1183-4. Epub 2015 Apr 7.

GPD: a graph pattern diffusion kernel for accurate graph classification with applications in cheminformatics.GPD：一种图模式扩散核，用于实现化学信息学中具有应用的精确图分类。

IEEE/ACM Trans Comput Biol Bioinform. 2010 Apr-Jun;7(2):197-207. doi: 10.1109/TCBB.2009.80.

A generalized pyramid matching kernel for human action recognition in realistic videos.用于现实视频中人体动作识别的广义金字塔匹配核。

Sensors (Basel). 2013 Oct 24;13(11):14398-416. doi: 10.3390/s131114398.

Learning graph matching.学习图匹配。

IEEE Trans Pattern Anal Mach Intell. 2009 Jun;31(6):1048-58. doi: 10.1109/TPAMI.2009.28.

Approximate Graph Edit Distance in Quadratic Time.二次时间内的近似图编辑距离。

IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):483-494. doi: 10.1109/TCBB.2015.2478463. Epub 2015 Sep 14.

Surgical gesture classification from video and kinematic data.基于视频和运动学数据的外科手势分类。

Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.

A Directed Acyclic Graph-Large Margin Distribution Machine Model for Music Symbol Classification.一种用于音乐符号分类的有向无环图-大间隔分布机模型。

PLoS One. 2016 Mar 17;11(3):e0149688. doi: 10.1371/journal.pone.0149688. eCollection 2016.

Automatic plankton image classification combining multiple view features via multiple kernel learning.基于多核学习的多视角特征融合浮游生物图像自动分类

BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):570. doi: 10.1186/s12859-017-1954-8.

引用本文的文献

Review of Three-Dimensional Human-Computer Interaction with Focus on the Leap Motion Controller.三维人机交互综述，重点关注 Leap Motion 控制器。

Sensors (Basel). 2018 Jul 7;18(7):2194. doi: 10.3390/s18072194.

Computational Models of Face Perception.面部感知的计算模型

Curr Dir Psychol Sci. 2017 Jun;26(3):263-269. doi: 10.1177/0963721417698535. Epub 2017 Jun 14.

本文引用的文献

Spatio-temporal Event Classification using Time-series Kernel based Structured Sparsity.基于时间序列核的结构化稀疏性的时空事件分类

Comput Vis ECCV. 2014;2014:135-140. doi: 10.1007/978-3-319-10593-2_10.

Spontaneous facial expression in unscripted social interactions can be measured automatically.在无脚本的社交互动中，自发的面部表情可以被自动测量。

Behav Res Methods. 2015 Dec;47(4):1136-1147. doi: 10.3758/s13428-014-0536-1.

Compound facial expressions of emotion.复合情绪表情。

Proc Natl Acad Sci U S A. 2014 Apr 15;111(15):E1454-62. doi: 10.1073/pnas.1322355111. Epub 2014 Mar 31.

Discriminant features and temporal structure of nonmanuals in American Sign Language.美国手语中身势语的判别特征和时间结构。

PLoS One. 2014 Feb 6;9(2):e86268. doi: 10.1371/journal.pone.0086268. eCollection 2014.

Modelling and Recognition of the Linguistic Components in American Sign Language.美国手语中语言成分的建模与识别

Image Vis Comput. 2009 Nov 1;27(12):1826-1844. doi: 10.1016/j.imavis.2009.02.005.

The humanID gait challenge problem: data sets, performance, and analysis.人类身份识别步态挑战问题：数据集、性能与分析。

IEEE Trans Pattern Anal Mach Intell. 2005 Feb;27(2):162-77. doi: 10.1109/tpami.2005.39.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验