Suppr超能文献

基于视频的人类动作识别中的知识蒸馏:一种高效灵活的模型训练直观方法。

Knowledge Distillation in Video-Based Human Action Recognition: An Intuitive Approach to Efficient and Flexible Model Training.

作者信息

Camarena Fernando, Gonzalez-Mendoza Miguel, Chang Leonardo

机构信息

School of Engineering and Science, Tecnologico de Monterrey, Nuevo León 64700, Mexico.

KODS.ai, Mexico City 11510, Mexico.

出版信息

J Imaging. 2024 Mar 30;10(4):85. doi: 10.3390/jimaging10040085.

Abstract

Training a model to recognize human actions in videos is computationally intensive. While modern strategies employ transfer learning methods to make the process more efficient, they still face challenges regarding flexibility and efficiency. Existing solutions are limited in functionality and rely heavily on pretrained architectures, which can restrict their applicability to diverse scenarios. Our work explores knowledge distillation (KD) for enhancing the training of self-supervised video models in three aspects: improving classification accuracy, accelerating model convergence, and increasing model flexibility under regular and limited-data scenarios. We tested our method on the UCF101 dataset using differently balanced proportions: 100%, 50%, 25%, and 2%. We found that using knowledge distillation to guide the model's training outperforms traditional training without affecting the classification accuracy and while reducing the convergence rate of model training in standard settings and a data-scarce environment. Additionally, knowledge distillation enables cross-architecture flexibility, allowing model customization for various applications: from resource-limited to high-performance scenarios.

摘要

训练一个用于识别视频中人类行为的模型计算量很大。虽然现代策略采用迁移学习方法来提高这个过程的效率,但它们在灵活性和效率方面仍然面临挑战。现有的解决方案功能有限,并且严重依赖预训练架构,这可能会限制它们在各种场景中的适用性。我们的工作从三个方面探索知识蒸馏(KD)以增强自监督视频模型的训练:提高分类准确率、加速模型收敛以及在常规和有限数据场景下提高模型灵活性。我们在UCF101数据集上使用不同的平衡比例(100%、50%、25%和2%)测试了我们的方法。我们发现,在不影响分类准确率的情况下,使用知识蒸馏来指导模型训练优于传统训练,同时在标准设置和数据稀缺环境中降低了模型训练的收敛速度。此外,知识蒸馏实现了跨架构的灵活性,允许针对各种应用定制模型:从资源受限场景到高性能场景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/457f/11051277/8b799908df28/jimaging-10-00085-g001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验