School of Internet of Things, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.
School of Computer Science and Technology, Nanjing University of Posts and Telecommunications, Nanjing, 210023, China.
Math Biosci Eng. 2021 Oct 27;18(6):9294-9311. doi: 10.3934/mbe.2021457.
Numerous limitations of Shot-based and Content-based key-frame extraction approaches have encouraged the development of Cluster-based algorithms. This paper proposes an Optimal Threshold and Maximum Weight (OTMW) clustering approach that allows accurate and automatic extraction of video summarization. Firstly, the video content is analyzed using the image color, texture and information complexity, and video feature dataset is constructed. Then a Golden Section method is proposed to determine the threshold function optimal solution. The initial cluster center and the cluster number k are automatically obtained by employing the improved clustering algorithm. k-clusters video frames are produced with the help of K-MEANS algorithm. The representative frame of each cluster is extracted using the Maximum Weight method and an accurate video summarization is obtained. The proposed approach is tested on 16 multi-type videos, and the obtained key-frame quality evaluation index, and the average of Fidelity and Ratio are 96.11925 and 97.128, respectively. Fortunately, the key-frames extracted by the proposed approach are consistent with artificial visual judgement. The performance of the proposed approach is compared with several state-of-the-art cluster-based algorithms, and the Fidelity are increased by 12.49721, 10.86455, 10.62984 and 10.4984375, respectively. In addition, the Ratio is increased by 1.958 on average with small fluctuations. The obtained experimental results demonstrate the advantage of the proposed solution over several related baselines on sixteen diverse datasets and validated that proposed approach can accurately extract video summarization from multi-type videos.
基于镜头和基于内容的关键帧提取方法存在诸多局限性,这促使了基于聚类的算法的发展。本文提出了一种最优阈值和最大权重(OTMW)聚类方法,该方法允许准确和自动地提取视频摘要。首先,使用图像颜色、纹理和信息复杂度分析视频内容,并构建视频特征数据集。然后,提出了一种黄金分割法来确定最优的阈值函数解。通过采用改进的聚类算法,自动获得初始聚类中心和聚类数量 k。在 K-MEANS 算法的帮助下,生成 k 个聚类的视频帧。使用最大权重法提取每个聚类的代表性帧,从而获得准确的视频摘要。在 16 个多类型视频上对所提出的方法进行了测试,得到的关键帧质量评估指标,以及保真度和比率的平均值分别为 96.11925 和 97.128。幸运的是,所提出方法提取的关键帧与人工视觉判断一致。将所提出的方法与几种基于聚类的最先进算法进行了性能比较,保真度分别提高了 12.49721、10.86455、10.62984 和 10.4984375,比率平均提高了 1.958,且波动较小。实验结果表明,与十六个不同数据集的几种相关基线相比,所提出的方法在提取视频摘要方面具有优势,验证了所提出的方法可以从多种类型的视频中准确地提取视频摘要。