• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于视觉显著性检测的腹腔镜视频关键帧提取。

Keyframe extraction from laparoscopic videos based on visual saliency detection.

机构信息

Laboratory of Medical Physics, Medical School, National and Kapodistrian University of Athens, Mikras Asias 75 str., Athens 11527, Greece.

School of Electrical and Computer Engineering, National and Technical University of Athens, Athens, Greece.

出版信息

Comput Methods Programs Biomed. 2018 Oct;165:13-23. doi: 10.1016/j.cmpb.2018.07.004. Epub 2018 Jul 18.

DOI:10.1016/j.cmpb.2018.07.004
PMID:30337068
Abstract

BACKGROUND AND OBJECTIVE

Laparoscopic surgery offers the potential for video recording of the operation, which is important for technique evaluation, cognitive training, patient briefing and documentation. An effective way for video content representation is to extract a limited number of keyframes with semantic information. In this paper we present a novel method for keyframe extraction from individual shots of the operational video.

METHODS

The laparoscopic video was first segmented into video shots using an objectness model, which was trained to capture significant changes in the endoscope field of view. Each frame of a shot was then decomposed into three saliency maps in order to model the preference of human vision to regions with higher differentiation with respect to color, motion and texture. The accumulated responses from each map provided a 3D time series of saliency variation across the shot. The time series was modeled as a multivariate autoregressive process with hidden Markov states (HMMAR model). This approach allowed the temporal segmentation of the shot into a predefined number of states. A representative keyframe was extracted from each state based on the highest state-conditional probability of the corresponding saliency vector.

RESULTS

Our method was tested on 168 video shots extracted from various laparoscopic cholecystectomy operations from the publicly available Cholec80 dataset. Four state-of-the-art methodologies were used for comparison. The evaluation was based on two assessment metrics: Color Consistency Score (CCS), which measures the color distance between the ground truth (GT) and the closest keyframe, and Temporal Consistency Score (TCS), which considers the temporal proximity between GT and extracted keyframes. About 81% of the extracted keyframes matched the color content of the GT keyframes, compared to 77% yielded by the second-best method. The TCS of the proposed and the second-best method was close to 1.9 and 1.4 respectively.

CONCLUSIONS

Our results demonstrated that the proposed method yields superior performance in terms of content and temporal consistency to the ground truth. The extracted keyframes provided highly semantic information that may be used for various applications related to surgical video content representation, such as workflow analysis, video summarization and retrieval.

摘要

背景与目的

腹腔镜手术提供了操作视频录像的潜力,这对于技术评估、认知训练、患者介绍和记录非常重要。一种有效的视频内容表示方法是提取具有语义信息的有限数量的关键帧。本文提出了一种从手术视频的单个镜头中提取关键帧的新方法。

方法

首先使用目标模型将腹腔镜视频分割成视频镜头,该模型经过训练可捕获内窥镜视野中显著变化的情况。然后,将每一帧分解成三个显著度图,以便对人类视觉对颜色、运动和纹理差异较大的区域的偏好进行建模。从每个地图上的累积响应提供了整个镜头中显著度变化的 3D 时序列。该时序列被建模为具有隐藏马尔可夫状态(HMMAR 模型)的多元自回归过程。这种方法允许根据对应显著向量的最高状态条件概率将镜头的时间序列分割成预定义数量的状态。从每个状态中提取一个代表关键帧。

结果

我们的方法在从公共 Cholec80 数据集的各种腹腔镜胆囊切除术操作中提取的 168 个视频镜头上进行了测试。使用了四种最先进的方法进行比较。评估基于两个评估指标:颜色一致性得分(CCS),用于测量地面实况(GT)和最近关键帧之间的颜色距离,以及时间一致性得分(TCS),用于考虑 GT 和提取关键帧之间的时间接近度。与第二种方法相比,大约 81%的提取关键帧与 GT 关键帧的颜色内容匹配,而第二种方法的匹配率为 77%。所提出的方法和第二种方法的 TCS 分别接近 1.9 和 1.4。

结论

我们的结果表明,与地面实况相比,所提出的方法在内容和时间一致性方面表现出更好的性能。提取的关键帧提供了高度语义信息,可用于与手术视频内容表示相关的各种应用,例如工作流程分析、视频摘要和检索。

相似文献

1
Keyframe extraction from laparoscopic videos based on visual saliency detection.基于视觉显著性检测的腹腔镜视频关键帧提取。
Comput Methods Programs Biomed. 2018 Oct;165:13-23. doi: 10.1016/j.cmpb.2018.07.004. Epub 2018 Jul 18.
2
Video saliency incorporating spatiotemporal cues and uncertainty weighting.视频显著度融合时空线索和不确定性加权。
IEEE Trans Image Process. 2014 Sep;23(9):3910-21. doi: 10.1109/TIP.2014.2336549. Epub 2014 Jul 16.
3
Regularized feature reconstruction for spatio-temporal saliency detection.正则化特征重构的时空显著检测。
IEEE Trans Image Process. 2013 Aug;22(8):3120-32. doi: 10.1109/TIP.2013.2259837.
4
Scalable gastroscopic video summarization via similar-inhibition dictionary selection.通过相似抑制字典选择实现可扩展的胃镜视频摘要
Artif Intell Med. 2016 Jan;66:1-13. doi: 10.1016/j.artmed.2015.08.006. Epub 2015 Aug 18.
5
Selecting salient frames for spatiotemporal video modeling and segmentation.为时空视频建模与分割选择显著帧。
IEEE Trans Image Process. 2007 Dec;16(12):3035-46. doi: 10.1109/tip.2007.908283.
6
Classification approach for automatic laparoscopic video database organization.用于自动腹腔镜视频数据库组织的分类方法。
Int J Comput Assist Radiol Surg. 2015 Sep;10(9):1449-60. doi: 10.1007/s11548-015-1183-4. Epub 2015 Apr 7.
7
Keyframe Extraction From Laparoscopic Videos via Diverse and Weighted Dictionary Selection.基于多样且加权词典选择的腹腔镜视频关键帧提取。
IEEE J Biomed Health Inform. 2021 May;25(5):1686-1698. doi: 10.1109/JBHI.2020.3019198. Epub 2021 May 11.
8
Interactive exploration of surveillance video through action shot summarization and trajectory visualization.通过动作镜头摘要和轨迹可视化进行监控视频的交互式探索。
IEEE Trans Vis Comput Graph. 2013 Dec;19(12):2119-28. doi: 10.1109/TVCG.2013.168.
9
MRT letter: visual attention driven framework for hysteroscopy video abstraction.MRT 信:用于宫腔镜视频抽象的视觉注意驱动框架。
Microsc Res Tech. 2013 Jun;76(6):559-63. doi: 10.1002/jemt.22205. Epub 2013 Mar 30.
10
Spatiotemporal saliency in dynamic scenes.动态场景中的时空显著特征。
IEEE Trans Pattern Anal Mach Intell. 2010 Jan;32(1):171-7. doi: 10.1109/TPAMI.2009.112.

引用本文的文献

1
Kinematic Adaptive Frame Recognition (KAFR): A Novel Framework for Video Segmentation via Frame Similarity and Surgical Tool Tracking.运动自适应帧识别(KAFR):一种通过帧相似度和手术工具跟踪进行视频分割的新型框架。
IEEE Access. 2025;13:101681-101697. doi: 10.1109/access.2025.3573264. Epub 2025 May 23.
2
Use of artificial intelligence in the analysis of digital videos of invasive surgical procedures: scoping review.人工智能在侵入性外科手术数字视频分析中的应用:范围综述。
BJS Open. 2025 Jul 1;9(4). doi: 10.1093/bjsopen/zraf073.
3
Artificial Intelligence in the Management of Women with Endometriosis and Adenomyosis: Can Machines Ever Be Worse Than Humans?
人工智能在子宫内膜异位症和子宫腺肌病女性管理中的应用:机器会比人类更差吗?
J Clin Med. 2024 May 16;13(10):2950. doi: 10.3390/jcm13102950.
4
Telemedicine and Artificial Intelligence in the Management of Endometriosis: Future Forecast Considering Current Progress.远程医疗与人工智能在子宫内膜异位症管理中的应用:基于当前进展的未来预测
Geburtshilfe Frauenheilkd. 2022 Nov 29;83(1):116-117. doi: 10.1055/a-1950-6634. eCollection 2023 Jan.
5
It's time for a minimum synoptic operation template in patients undergoing laparoscopic cholecystectomy: a systematic review.行腹腔镜胆囊切除术患者最小手术模板的时机:系统评价。
World J Emerg Surg. 2022 Mar 17;17(1):15. doi: 10.1186/s13017-022-00411-5.