Saini Parul, Kumar Krishan, Kashid Shamal, Saini Ashray, Negi Alok
Department of Computer Science and Engineering, National Institute of Technology Uttarakhand, Srinagar Garhwal, Uttarakhand 246174 India.
Artif Intell Rev. 2023 Mar 15:1-39. doi: 10.1007/s10462-023-10444-0.
One of the critical multimedia analysis problems in today's digital world is video summarization (VS). Many VS methods have been suggested based on deep learning methods. Nevertheless, These are inefficient in processing, extracting, and deriving information in the minimum amount of time from long-duration videos. Detailed analysis and investigation of numerous deep learning approach accomplished to determine root of problems connected with different deep learning methods in identifying and summarizing the essential activities in such videos. Various deep learning techniques have been investigated and examined to detect the event and summarization capability for detecting and summarizing multiple activities. Keyframe selection Event detection, categorization, and the activity feature summarization correspond to each activity. The limitations related to each category are also discussed in depth. Concerns about detecting low activity using the deep network on various types of public datasets are also discussed. Viable strategies are suggested to evaluate and improve the generated video summaries on such datasets. Moreover, Potential recommended applications based on literature are listed out. Various deep learning tools for experimental analysis have also been discussed in the paper. Future directions are presented for further exploration of research in VS using deep learning strategies.
视频摘要(VS)是当今数字世界中关键的多媒体分析问题之一。基于深度学习方法已经提出了许多视频摘要方法。然而,这些方法在从长时间视频中以最少时间处理、提取和推导信息方面效率低下。对众多深度学习方法进行了详细分析和研究,以确定与不同深度学习方法相关的问题根源,这些问题涉及识别和总结此类视频中的关键活动。已经研究和检验了各种深度学习技术,以检测事件以及检测和总结多个活动的摘要能力。关键帧选择、事件检测、分类以及活动特征摘要对应于每个活动。还深入讨论了与每个类别相关的局限性。也讨论了在各种类型的公共数据集上使用深度网络检测低活跃度的相关问题。提出了可行的策略来评估和改进在此类数据集上生成的视频摘要。此外,列出了基于文献的潜在推荐应用。本文还讨论了用于实验分析的各种深度学习工具。提出了未来的研究方向,以便使用深度学习策略进一步探索视频摘要研究。