• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于多媒体事件检测的双层语义表示分析。

Bi-Level Semantic Representation Analysis for Multimedia Event Detection.

出版信息

IEEE Trans Cybern. 2017 May;47(5):1180-1197. doi: 10.1109/TCYB.2016.2539546. Epub 2016 Mar 28.

DOI:10.1109/TCYB.2016.2539546
PMID:28113831
Abstract

Multimedia event detection has been one of the major endeavors in video event analysis. A variety of approaches have been proposed recently to tackle this problem. Among others, using semantic representation has been accredited for its promising performance and desirable ability for human-understandable reasoning. To generate semantic representation, we usually utilize several external image/video archives and apply the concept detectors trained on them to the event videos. Due to the intrinsic difference of these archives, the resulted representation is presumable to have different predicting capabilities for a certain event. Notwithstanding, not much work is available for assessing the efficacy of semantic representation from the source-level. On the other hand, it is plausible to perceive that some concepts are noisy for detecting a specific event. Motivated by these two shortcomings, we propose a bi-level semantic representation analyzing method. Regarding source-level, our method learns weights of semantic representation attained from different multimedia archives. Meanwhile, it restrains the negative influence of noisy or irrelevant concepts in the overall concept-level. In addition, we particularly focus on efficient multimedia event detection with few positive examples, which is highly appreciated in the real-world scenario. We perform extensive experiments on the challenging TRECVID MED 2013 and 2014 datasets with encouraging results that validate the efficacy of our proposed approach.

摘要

多媒体事件检测一直是视频事件分析的主要工作之一。最近已经提出了各种方法来解决这个问题。其中,使用语义表示因其有希望的性能和可理解的人类推理能力而受到赞誉。为了生成语义表示,我们通常使用几个外部的图像/视频档案,并将在这些档案上训练的概念检测器应用于事件视频。由于这些档案的内在差异,所得到的表示对于某个事件可能具有不同的预测能力。然而,很少有工作可用于从源级别评估语义表示的效果。另一方面,可以合理地认为某些概念对于检测特定事件来说是嘈杂的。受这两个缺点的启发,我们提出了一种两级语义表示分析方法。在源级别,我们的方法学习从不同多媒体档案获得的语义表示的权重。同时,它限制了整体概念级别中噪声或不相关概念的负面影响。此外,我们特别关注具有少量正例的高效多媒体事件检测,这在实际场景中是非常需要的。我们在具有挑战性的 TRECVID MED 2013 和 2014 数据集上进行了广泛的实验,结果令人鼓舞,验证了我们提出的方法的有效性。

相似文献

1
Bi-Level Semantic Representation Analysis for Multimedia Event Detection.用于多媒体事件检测的双层语义表示分析。
IEEE Trans Cybern. 2017 May;47(5):1180-1197. doi: 10.1109/TCYB.2016.2539546. Epub 2016 Mar 28.
2
Joint Attributes and Event Analysis for Multimedia Event Detection.用于多媒体事件检测的联合属性与事件分析
IEEE Trans Neural Netw Learn Syst. 2018 Jul;29(7):2921-2930. doi: 10.1109/TNNLS.2017.2709308. Epub 2017 Jun 15.
3
Event oriented dictionary learning for complex event detection.面向事件的字典学习用于复杂事件检测。
IEEE Trans Image Process. 2015 Jun;24(6):1867-78. doi: 10.1109/TIP.2015.2413294. Epub 2015 Mar 16.
4
Video2vec Embeddings Recognize Events When Examples Are Scarce.Video2vec 嵌入识别在例子稀缺时的事件。
IEEE Trans Pattern Anal Mach Intell. 2017 Oct;39(10):2089-2103. doi: 10.1109/TPAMI.2016.2627563. Epub 2016 Nov 10.
5
Knowledge Adaptation with Partially Shared Features for Event Detection Using Few Exemplars.基于部分共享特征的知识自适应,使用少量样本进行事件检测。
IEEE Trans Pattern Anal Mach Intell. 2014 Sep;36(9):1789-802. doi: 10.1109/TPAMI.2014.2306419.
6
The semantic pathfinder: using an authoring metaphor for generic multimedia indexing.语义路径查找器:使用创作隐喻进行通用多媒体索引编制。
IEEE Trans Pattern Anal Mach Intell. 2006 Oct;28(10):1678-89. doi: 10.1109/TPAMI.2006.212.
7
Hierarchical Latent Concept Discovery for Video Event Detection.分层潜在概念发现的视频事件检测。
IEEE Trans Image Process. 2017 May;26(5):2149-2162. doi: 10.1109/TIP.2017.2670782. Epub 2017 Feb 17.
8
Secure access control and large scale robust representation for online multimedia event detection.用于在线多媒体事件检测的安全访问控制和大规模鲁棒表示
ScientificWorldJournal. 2014;2014:219732. doi: 10.1155/2014/219732. Epub 2014 Jul 22.
9
Semantic Pooling for Complex Event Analysis in Untrimmed Videos.非修剪视频中复杂事件分析的语义池化。
IEEE Trans Pattern Anal Mach Intell. 2017 Aug;39(8):1617-1632. doi: 10.1109/TPAMI.2016.2608901. Epub 2016 Sep 13.
10
Web Video Event Recognition by Semantic Analysis From Ubiquitous Documents.基于无处不在的文档的语义分析的网络视频事件识别。
IEEE Trans Image Process. 2016 Dec;25(12):5689-5701. doi: 10.1109/TIP.2016.2614136. Epub 2016 Sep 27.

引用本文的文献

1
Color Image Retrieval Method Using Low Dimensional Salient Visual Feature Descriptors for IoT Applications.基于低维显著视觉特征描述子的物联网应用彩色图像检索方法。
Comput Intell Neurosci. 2023 Feb 23;2023:6257573. doi: 10.1155/2023/6257573. eCollection 2023.
2
A Dark Target Detection Method Based on the Adjacency Effect: A Case Study on Crack Detection.一种基于邻接效应的暗目标检测方法:以裂缝检测为例
Sensors (Basel). 2019 Jun 25;19(12):2829. doi: 10.3390/s19122829.
3
Low-Rank Graph-Regularized Structured Sparse Regression for Identifying Genetic Biomarkers.
用于识别遗传生物标志物的低秩图正则化结构化稀疏回归
IEEE Trans Big Data. 2017 Oct-Dec;3(4):405-414. doi: 10.1109/TBDATA.2017.2735991. Epub 2017 Aug 4.
4
Predicting Alzheimer's Disease Cognitive Assessment via Robust Low-Rank Structured Sparse Model.通过稳健低秩结构化稀疏模型预测阿尔茨海默病认知评估
IJCAI (U S). 2017 Aug;2017:3880-3886. doi: 10.24963/ijcai.2017/542.