• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于视频的塑料袋抓取动作识别:一个新的视频数据集及基线模型的比较研究

Video-Based Plastic Bag Grabbing Action Recognition: A New Video Dataset and a Comparative Study of Baseline Models.

作者信息

Low Pei Jing, Ng Bo Yan, Mahzan Nur Insyirah, Tian Jing, Leung Cheung-Chi

机构信息

NUS-ISS, National University of Singapore, Singapore 119615, Singapore.

出版信息

Sensors (Basel). 2025 Jan 4;25(1):255. doi: 10.3390/s25010255.

DOI:10.3390/s25010255
PMID:39797046
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11723439/
Abstract

Recognizing the action of plastic bag taking from CCTV video footage represents a highly specialized and niche challenge within the broader domain of action video classification. To address this challenge, our paper introduces a novel benchmark video dataset specifically curated for the task of identifying the action of grabbing a plastic bag. Additionally, we propose and evaluate three distinct baseline approaches. The first approach employs a combination of handcrafted feature extraction techniques and a sequential classification model to analyze motion and object-related features. The second approach leverages a multiple-frame (CNN) to exploit temporal and spatial patterns in the video data. The third approach explores a 3D CNN-based deep learning model, which is capable of processing video data as volumetric inputs. To assess the performance of these methods, we conduct a comprehensive comparative study, demonstrating the strengths and limitations of each approach within this specialized domain.

摘要

从央视视频片段中识别拿取塑料袋的动作,在更广泛的动作视频分类领域中是一项高度专业化且细分的挑战。为应对这一挑战,我们的论文引入了一个专门为识别抓取塑料袋动作任务精心策划的新型基准视频数据集。此外,我们提出并评估了三种不同的基线方法。第一种方法采用手工特征提取技术和序列分类模型的组合来分析运动和与物体相关的特征。第二种方法利用多帧卷积神经网络(CNN)来挖掘视频数据中的时空模式。第三种方法探索基于3D CNN的深度学习模型,该模型能够将视频数据作为体数据输入进行处理。为评估这些方法的性能,我们进行了全面的比较研究,展示了每种方法在这个专业领域的优势和局限性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/0cfa886ff32b/sensors-25-00255-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/5d27b4595cba/sensors-25-00255-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/d4ed4f13dd53/sensors-25-00255-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/9a83568b89ce/sensors-25-00255-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/a3a3c81fa4d5/sensors-25-00255-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/60bdb555894a/sensors-25-00255-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/1a1d0ef9552c/sensors-25-00255-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/0cfa886ff32b/sensors-25-00255-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/5d27b4595cba/sensors-25-00255-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/d4ed4f13dd53/sensors-25-00255-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/9a83568b89ce/sensors-25-00255-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/a3a3c81fa4d5/sensors-25-00255-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/60bdb555894a/sensors-25-00255-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/1a1d0ef9552c/sensors-25-00255-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f9e9/11723439/0cfa886ff32b/sensors-25-00255-g007.jpg

相似文献

1
Video-Based Plastic Bag Grabbing Action Recognition: A New Video Dataset and a Comparative Study of Baseline Models.基于视频的塑料袋抓取动作识别:一个新的视频数据集及基线模型的比较研究
Sensors (Basel). 2025 Jan 4;25(1):255. doi: 10.3390/s25010255.
2
Introducing a novel dataset for facial emotion recognition and demonstrating significant enhancements in deep learning performance through pre-processing techniques.介绍一个用于面部表情识别的新数据集,并通过预处理技术展示深度学习性能的显著提升。
Heliyon. 2024 Oct 4;10(20):e38913. doi: 10.1016/j.heliyon.2024.e38913. eCollection 2024 Oct 30.
3
Automated classification of nasal polyps in endoscopy video-frames using handcrafted and CNN features.使用手工制作和 CNN 特征对内镜视频中的鼻息肉进行自动分类。
Comput Biol Med. 2022 Aug;147:105725. doi: 10.1016/j.compbiomed.2022.105725. Epub 2022 Jun 13.
4
Body and Hand-Object ROI-Based Behavior Recognition Using Deep Learning.基于身体和手部-物体 ROI 的深度学习行为识别。
Sensors (Basel). 2021 Mar 6;21(5):1838. doi: 10.3390/s21051838.
5
CNN-LSTM Model for Recognizing Video-Recorded Actions Performed in a Traditional Chinese Exercise.用于识别传统中国功法中视频记录动作的 CNN-LSTM 模型。
IEEE J Transl Eng Health Med. 2023 Jun 2;11:351-359. doi: 10.1109/JTEHM.2023.3282245. eCollection 2023.
6
A Deep Sequence Learning Framework for Action Recognition in Small-Scale Depth Video Dataset.用于小规模深度视频数据集动作识别的深度序列学习框架。
Sensors (Basel). 2022 Sep 9;22(18):6841. doi: 10.3390/s22186841.
7
Fusion of Appearance and Motion Features for Daily Activity Recognition from Egocentric Perspective.从自我中心视角融合外观与运动特征进行日常活动识别
Sensors (Basel). 2023 Jul 30;23(15):6804. doi: 10.3390/s23156804.
8
Surgical gesture classification from video and kinematic data.基于视频和运动学数据的外科手势分类。
Med Image Anal. 2013 Oct;17(7):732-45. doi: 10.1016/j.media.2013.04.007. Epub 2013 Apr 28.
9
Analysis of Movement and Activities of Handball Players Using Deep Neural Networks.使用深度神经网络对手球运动员的运动和活动进行分析。
J Imaging. 2023 Apr 13;9(4):80. doi: 10.3390/jimaging9040080.
10
SVM directed machine learning classifier for human action recognition network.用于人体动作识别网络的支持向量机导向的机器学习分类器。
Sci Rep. 2025 Jan 3;15(1):672. doi: 10.1038/s41598-024-83529-7.

本文引用的文献

1
OoD-Control: Generalizing Control in Unseen Environments.未知环境控制:在未见环境中泛化控制
IEEE Trans Pattern Anal Mach Intell. 2024 Nov;46(11):7421-7433. doi: 10.1109/TPAMI.2024.3395484. Epub 2024 Oct 3.
2
Video Transformers: A Survey.视频Transformer综述
IEEE Trans Pattern Anal Mach Intell. 2023 Nov;45(11):12922-12943. doi: 10.1109/TPAMI.2023.3243465. Epub 2023 Oct 3.
3
A Computer Vision-Based Yoga Pose Grading Approach Using Contrastive Skeleton Feature Representations.一种基于计算机视觉的瑜伽体式分级方法,使用对比骨骼特征表示
Healthcare (Basel). 2021 Dec 25;10(1):36. doi: 10.3390/healthcare10010036.
4
Using Social Signals to Predict Shoplifting: A Transparent Approach to a Sensitive Activity Analysis Problem.利用社交信号预测商店行窃:一种敏感活动分析问题的透明方法。
Sensors (Basel). 2021 Oct 13;21(20):6812. doi: 10.3390/s21206812.
5
Analysis of the Hands in Egocentric Vision: A Survey.自我中心视觉中的手分析:调查。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6846-6866. doi: 10.1109/TPAMI.2020.2986648. Epub 2023 May 5.
6
Need a bag? A review of public policies on plastic carrier bags - Where, how and to what effect?需要袋子吗?对塑料袋公共政策的回顾——在哪里、如何以及有何影响?
Waste Manag. 2019 Mar 15;87:428-440. doi: 10.1016/j.wasman.2019.02.025. Epub 2019 Feb 19.