特征融合与聚类用于关键帧提取。

Feature fusion and clustering for key frame extraction.

机构信息

School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.

School of Computer Science and Technology, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.

出版信息

Math Biosci Eng. 2021 Oct 27;18(6):9294-9311. doi: 10.3934/mbe.2021457.

DOI:10.3934/mbe.2021457

PMID:34814346

Abstract

Numerous limitations of Shot-based and Content-based key-frame extraction approaches have encouraged the development of Cluster-based algorithms. This paper proposes an Optimal Threshold and Maximum Weight (OTMW) clustering approach that allows accurate and automatic extraction of video summarization. Firstly, the video content is analyzed using the image color, texture and information complexity, and video feature dataset is constructed. Then a Golden Section method is proposed to determine the threshold function optimal solution. The initial cluster center and the cluster number k are automatically obtained by employing the improved clustering algorithm. k-clusters video frames are produced with the help of K-MEANS algorithm. The representative frame of each cluster is extracted using the Maximum Weight method and an accurate video summarization is obtained. The proposed approach is tested on 16 multi-type videos, and the obtained key-frame quality evaluation index, and the average of Fidelity and Ratio are 96.11925 and 97.128, respectively. Fortunately, the key-frames extracted by the proposed approach are consistent with artificial visual judgement. The performance of the proposed approach is compared with several state-of-the-art cluster-based algorithms, and the Fidelity are increased by 12.49721, 10.86455, 10.62984 and 10.4984375, respectively. In addition, the Ratio is increased by 1.958 on average with small fluctuations. The obtained experimental results demonstrate the advantage of the proposed solution over several related baselines on sixteen diverse datasets and validated that proposed approach can accurately extract video summarization from multi-type videos.

摘要

基于镜头和基于内容的关键帧提取方法存在诸多局限性，这促使了基于聚类的算法的发展。本文提出了一种最优阈值和最大权重（OTMW）聚类方法，该方法允许准确和自动地提取视频摘要。首先，使用图像颜色、纹理和信息复杂度分析视频内容，并构建视频特征数据集。然后，提出了一种黄金分割法来确定最优的阈值函数解。通过采用改进的聚类算法，自动获得初始聚类中心和聚类数量 k。在 K-MEANS 算法的帮助下，生成 k 个聚类的视频帧。使用最大权重法提取每个聚类的代表性帧，从而获得准确的视频摘要。在 16 个多类型视频上对所提出的方法进行了测试，得到的关键帧质量评估指标，以及保真度和比率的平均值分别为 96.11925 和 97.128。幸运的是，所提出方法提取的关键帧与人工视觉判断一致。将所提出的方法与几种基于聚类的最先进算法进行了性能比较，保真度分别提高了 12.49721、10.86455、10.62984 和 10.4984375，比率平均提高了 1.958，且波动较小。实验结果表明，与十六个不同数据集的几种相关基线相比，所提出的方法在提取视频摘要方面具有优势，验证了所提出的方法可以从多种类型的视频中准确地提取视频摘要。

相似文献

Feature fusion and clustering for key frame extraction.特征融合与聚类用于关键帧提取。

Math Biosci Eng. 2021 Oct 27;18(6):9294-9311. doi: 10.3934/mbe.2021457.

News Video Summarization Combining SURF and Color Histogram Features.结合加速鲁棒特征和颜色直方图特征的新闻视频摘要

Entropy (Basel). 2021 Jul 30;23(8):982. doi: 10.3390/e23080982.

Intelligent Sports Video Classification Based on Deep Neural Network (DNN) Algorithm and Transfer Learning.基于深度神经网络（DNN）算法和迁移学习的智能体育视频分类。

Comput Intell Neurosci. 2021 Nov 24;2021:1825273. doi: 10.1155/2021/1825273. eCollection 2021.

Scalable gastroscopic video summarization via similar-inhibition dictionary selection.通过相似抑制字典选择实现可扩展的胃镜视频摘要

Artif Intell Med. 2016 Jan;66:1-13. doi: 10.1016/j.artmed.2015.08.006. Epub 2015 Aug 18.

Keyframe extraction from laparoscopic videos based on visual saliency detection.基于视觉显著性检测的腹腔镜视频关键帧提取。

Comput Methods Programs Biomed. 2018 Oct;165:13-23. doi: 10.1016/j.cmpb.2018.07.004. Epub 2018 Jul 18.

Video Summarization Based on Mutual Information and Entropy Sliding Window Method.基于互信息和熵滑动窗口法的视频摘要

Entropy (Basel). 2020 Nov 12;22(11):1285. doi: 10.3390/e22111285.

Multimodal Stereoscopic Movie Summarization Conforming to Narrative Characteristics.符合叙事特征的多模态立体电影摘要

IEEE Trans Image Process. 2016 Dec;25(12):5828-5840. doi: 10.1109/TIP.2016.2615289. Epub 2016 Oct 5.

Heterogeneity image patch index and its application to consumer video summarization.异质图像块索引及其在消费级视频摘要中的应用。

IEEE Trans Image Process. 2014 Jun;23(6):2704-18. doi: 10.1109/TIP.2014.2320814.

RPCA-KFE: Key Frame Extraction for Video Using Robust Principal Component Analysis.RPCA-KFE：基于鲁棒主成分分析的视频关键帧提取。

IEEE Trans Image Process. 2015 Nov;24(11):3742-53. doi: 10.1109/TIP.2015.2445572. Epub 2015 Jun 15.

Key frame extraction method for lecture videos based on spatio-temporal subtitles.基于时空字幕的讲座视频关键帧提取方法

Multimed Tools Appl. 2023 Jun 2:1-14. doi: 10.1007/s11042-023-15829-5.

特征融合与聚类用于关键帧提取。 - Suppr | 超能文献

特征融合与聚类用于关键帧提取。

Feature fusion and clustering for key frame extraction.

机构信息

School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.

School of Computer Science and Technology, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.

出版信息

Math Biosci Eng. 2021 Oct 27;18(6):9294-9311. doi: 10.3934/mbe.2021457.

DOI:10.3934/mbe.2021457

PMID:34814346

Abstract

摘要

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

特征融合与聚类用于关键帧提取。

Feature fusion and clustering for key frame extraction.

机构信息

出版信息

相似文献

特征融合与聚类用于关键帧提取。

Feature fusion and clustering for key frame extraction.

机构信息

出版信息

相似文献