• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一项融合超分辨率和目标检测的课堂行为识别研究。

A Study of Classroom Behavior Recognition Incorporating Super-Resolution and Target Detection.

作者信息

Zhang Xiaoli, Nie Jialei, Wei Shoulin, Zhu Guifu, Dai Wei, Yang Can

机构信息

Key Laboratory of Computer Science, Kunming University of Science and Technology, Kunming 650500, China.

School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China.

出版信息

Sensors (Basel). 2024 Aug 30;24(17):5640. doi: 10.3390/s24175640.

DOI:10.3390/s24175640
PMID:39275552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11487426/
Abstract

With the development of educational technology, machine learning and deep learning provide technical support for traditional classroom observation assessment. However, in real classroom scenarios, the technique faces challenges such as lack of clarity of raw images, complexity of datasets, multi-target detection errors, and complexity of character interactions. Based on the above problems, a student classroom behavior recognition network incorporating super-resolution and target detection is proposed. To cope with the problem of unclear original images in the classroom scenario, SRGAN (Super Resolution Generative Adversarial Network for Images) is used to improve the image resolution and thus the recognition accuracy. To address the dataset complexity and multi-targeting problems, feature extraction is optimized, and multi-scale feature recognition is enhanced by introducing AKConv and LASK attention mechanisms into the Backbone module of the YOLOv8s algorithm. To improve the character interaction complexity problem, the CBAM attention mechanism is integrated to enhance the recognition of important feature channels and spatial regions. Experiments show that it can detect six behaviors of students-raising their hands, reading, writing, playing on their cell phones, looking down, and leaning on the table-in high-definition images. And the accuracy and robustness of this network is verified. Compared with small-object detection algorithms such as Faster R-CNN, YOLOv5, and YOLOv8s, this network demonstrates good detection performance on low-resolution small objects, complex datasets with numerous targets, occlusion, and overlapping students.

摘要

随着教育技术的发展,机器学习和深度学习为传统课堂观察评估提供了技术支持。然而,在真实的课堂场景中,该技术面临着原始图像清晰度不足、数据集复杂、多目标检测错误以及人物交互复杂等挑战。基于上述问题,提出了一种融合超分辨率和目标检测的学生课堂行为识别网络。为解决课堂场景中原始图像不清晰的问题,使用SRGAN(图像超分辨率生成对抗网络)来提高图像分辨率,进而提高识别准确率。为应对数据集复杂和多目标问题,对特征提取进行了优化,并通过在YOLOv8s算法的骨干模块中引入AKConv和LASK注意力机制来增强多尺度特征识别。为改善人物交互复杂问题,集成了CBAM注意力机制以增强对重要特征通道和空间区域的识别。实验表明,它能够在高清图像中检测出学生举手、阅读、书写、玩手机、低头和趴在桌子上这六种行为。并且验证了该网络的准确性和鲁棒性。与Faster R-CNN、YOLOv5和YOLOv8s等小目标检测算法相比,该网络在低分辨率小目标、具有大量目标的复杂数据集、遮挡以及学生重叠的情况下展现出良好的检测性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/37487e5de05c/sensors-24-05640-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/b1541378c552/sensors-24-05640-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/4c1615b7ea8d/sensors-24-05640-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/ec2ec60fc21a/sensors-24-05640-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/0b966fe15ce4/sensors-24-05640-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/df3fd3ed676e/sensors-24-05640-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/38b3d1031745/sensors-24-05640-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/04f124211180/sensors-24-05640-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/22e912697d57/sensors-24-05640-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/82bef2591f6c/sensors-24-05640-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/0d801a83bbb3/sensors-24-05640-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/8138500fe184/sensors-24-05640-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/32aad77c8984/sensors-24-05640-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/158cff9f38f4/sensors-24-05640-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/37487e5de05c/sensors-24-05640-g014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/b1541378c552/sensors-24-05640-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/4c1615b7ea8d/sensors-24-05640-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/ec2ec60fc21a/sensors-24-05640-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/0b966fe15ce4/sensors-24-05640-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/df3fd3ed676e/sensors-24-05640-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/38b3d1031745/sensors-24-05640-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/04f124211180/sensors-24-05640-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/22e912697d57/sensors-24-05640-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/82bef2591f6c/sensors-24-05640-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/0d801a83bbb3/sensors-24-05640-g010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/8138500fe184/sensors-24-05640-g011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/32aad77c8984/sensors-24-05640-g012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/158cff9f38f4/sensors-24-05640-g013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1c23/11487426/37487e5de05c/sensors-24-05640-g014.jpg

相似文献

1
A Study of Classroom Behavior Recognition Incorporating Super-Resolution and Target Detection.一项融合超分辨率和目标检测的课堂行为识别研究。
Sensors (Basel). 2024 Aug 30;24(17):5640. doi: 10.3390/s24175640.
2
Super-resolution fusion optimization for poultry detection: a multi-object chicken detection method.家禽检测的超分辨率融合优化:一种多目标鸡检测方法。
J Anim Sci. 2023 Jan 3;101. doi: 10.1093/jas/skad249.
3
YOLOv8-RMDA: Lightweight YOLOv8 Network for Early Detection of Small Target Diseases in Tea.YOLOv8-RMDA:用于茶中早期检测小目标疾病的轻量级 YOLOv8 网络。
Sensors (Basel). 2024 May 1;24(9):2896. doi: 10.3390/s24092896.
4
A Contraband Detection Scheme in X-ray Security Images Based on Improved YOLOv8s Network Model.基于改进YOLOv8s网络模型的X射线安全图像违禁品检测方案
Sensors (Basel). 2024 Feb 9;24(4):1158. doi: 10.3390/s24041158.
5
R-YOLO: A YOLO-Based Method for Arbitrary-Oriented Target Detection in High-Resolution Remote Sensing Images.R-YOLO:一种用于高分辨率遥感图像中任意方向目标检测的 YOLO 方法。
Sensors (Basel). 2022 Jul 30;22(15):5716. doi: 10.3390/s22155716.
6
Deep Visual Computing of Behavioral Characteristics in Complex Scenarios and Embedded Object Recognition Applications.复杂场景下行为特征的深度视觉计算与嵌入式目标识别应用。
Sensors (Basel). 2024 Jul 15;24(14):4582. doi: 10.3390/s24144582.
7
ssFPN: Scale Sequence () Feature-Based Feature Pyramid Network for Object Detection.ssFPN:基于尺度序列(Scale Sequence)特征的目标检测特征金字塔网络。
Sensors (Basel). 2023 Apr 30;23(9):4432. doi: 10.3390/s23094432.
8
Small Target-YOLOv5: Enhancing the Algorithm for Small Object Detection in Drone Aerial Imagery Based on YOLOv5.小型目标-YOLOv5:基于YOLOv5增强无人机航空影像中小目标检测算法
Sensors (Basel). 2023 Dec 26;24(1):134. doi: 10.3390/s24010134.
9
CCW-YOLOv5: A forward-looking sonar target method based on coordinate convolution and modified boundary frame loss.CCW-YOLOv5:一种基于坐标卷积和改进边界框损失的前瞻性声纳目标检测方法。
PLoS One. 2024 Jun 3;19(6):e0300976. doi: 10.1371/journal.pone.0300976. eCollection 2024.
10
Improved Small Object Detection Algorithm CRL-YOLOv5.改进的小目标检测算法CRL-YOLOv5
Sensors (Basel). 2024 Oct 4;24(19):6437. doi: 10.3390/s24196437.

引用本文的文献

1
YOLO-CBD: Classroom Behavior Detection Method Based on Behavior Feature Extraction and Aggregation.YOLO-CBD:基于行为特征提取与聚合的课堂行为检测方法
Sensors (Basel). 2025 May 13;25(10):3073. doi: 10.3390/s25103073.
2
Classroom Behavior Recognition Using Computer Vision: A Systematic Review.基于计算机视觉的课堂行为识别:系统综述
Sensors (Basel). 2025 Jan 10;25(2):373. doi: 10.3390/s25020373.

本文引用的文献

1
Reliability and validity of OpenPose for measuring hip-knee-ankle angle in patients with knee osteoarthritis.用于测量膝骨关节炎患者髋膝踝角度的 OpenPose 的可靠性和有效性。
Sci Rep. 2023 Feb 25;13(1):3297. doi: 10.1038/s41598-023-30352-1.
2
High Precision Feature Fast Extraction Strategy for Aircraft Attitude Sensor Fault Based on RepVGG and SENet Attention Mechanism.基于 RepVGG 和 SENet 注意力机制的飞机姿态传感器故障高精度特征快速提取策略。
Sensors (Basel). 2022 Dec 9;22(24):9662. doi: 10.3390/s22249662.
3
ME R-CNN: Multi-Expert R-CNN for Object Detection.
ME R-CNN:用于目标检测的多专家区域卷积神经网络
IEEE Trans Image Process. 2019 Sep 9. doi: 10.1109/TIP.2019.2938879.