• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于时间卷积网络的结肠镜检查视频时间分割方法及基准数据集。

A temporal convolutional network-based approach and a benchmark dataset for colonoscopy video temporal segmentation.

作者信息

Biffi Carlo, Roffo Giorgio, Salvagnini Pietro, Cherubini Andrea

机构信息

Cosmo Intelligent Medical Devices, Dublin, Ireland.

Cosmo Intelligent Medical Devices, Dublin, Ireland.

出版信息

Comput Methods Programs Biomed. 2025 Oct;270:108782. doi: 10.1016/j.cmpb.2025.108782. Epub 2025 Jul 3.

DOI:10.1016/j.cmpb.2025.108782
PMID:40633401
Abstract

BACKGROUND AND OBJECTIVE

Following recent advancements in computer-aided detection and diagnosis systems for colonoscopy, the automated reporting of colonoscopy procedures is set to further revolutionize clinical practice. A crucial yet underexplored aspect in the development of these systems is the creation of computer vision models capable of autonomously segmenting full-procedure colonoscopy videos into anatomical sections and procedural phases. In this work, we aim to create the first open-access dataset for this task and propose a state-of-the-art approach, benchmarked against competitive models.

METHODS

We annotated the publicly available REAL-Colon dataset, consisting of 2.7 million frames from 60 complete colonoscopy videos, with frame-level labels for anatomical locations and colonoscopy phases across nine categories. We then present ColonTCN, a learning-based architecture that employs custom temporal convolutional blocks designed to efficiently capture long temporal dependencies for the temporal segmentation of colonoscopy videos. We also propose a dual k-fold cross-validation evaluation protocol for this benchmark, which includes model assessment on unseen, multi-center data.

RESULTS

ColonTCN achieves state-of-the-art performance in classification accuracy while maintaining a low parameter count when evaluated using the two proposed k-fold cross-validation settings, outperforming competitive models. We report ablation studies to provide insights into the challenges of this task and highlight the benefits of the custom temporal convolutional blocks, which enhance learning and improve model efficiency.

CONCLUSIONS

We believe that the proposed open-access benchmark and the ColonTCN approach represent a significant advancement in the temporal segmentation of colonoscopy procedures, fostering further open-access research to address this clinical need. Code and data are available at: https://github.com/cosmoimd/temporal_segmentation.

摘要

背景与目的

随着结肠镜检查的计算机辅助检测与诊断系统近期取得进展,结肠镜检查程序的自动报告将进一步彻底改变临床实践。这些系统开发中一个关键但未得到充分探索的方面是创建能够将全流程结肠镜检查视频自动分割为解剖部分和操作阶段的计算机视觉模型。在这项工作中,我们旨在为此任务创建首个开放获取数据集,并提出一种先进方法,与竞争模型进行基准测试。

方法

我们对公开可用的REAL - Colon数据集进行注释,该数据集由来自60个完整结肠镜检查视频的270万帧组成,带有针对九个类别的解剖位置和结肠镜检查阶段的帧级标签。然后我们提出了ColonTCN,这是一种基于学习的架构,采用定制的时间卷积块,旨在有效捕捉长时时间依赖性以用于结肠镜检查视频的时间分割。我们还为此基准测试提出了一种双重k折交叉验证评估协议,其中包括对未见的多中心数据进行模型评估。

结果

当使用所提出的两种k折交叉验证设置进行评估时,ColonTCN在分类准确率方面达到了先进水平,同时保持了较低的参数数量,优于竞争模型。我们报告了消融研究,以深入了解此任务的挑战,并突出定制时间卷积块的优势,这些块增强了学习并提高了模型效率。

结论

我们相信所提出的开放获取基准和ColonTCN方法代表了结肠镜检查程序时间分割方面的重大进展,促进了进一步的开放获取研究以满足这一临床需求。代码和数据可在以下网址获取:https://github.com/cosmoimd/temporal_segmentation 。

相似文献

1
A temporal convolutional network-based approach and a benchmark dataset for colonoscopy video temporal segmentation.一种基于时间卷积网络的结肠镜检查视频时间分割方法及基准数据集。
Comput Methods Programs Biomed. 2025 Oct;270:108782. doi: 10.1016/j.cmpb.2025.108782. Epub 2025 Jul 3.
2
Point-cloud segmentation with in-silico data augmentation for prostate cancer treatment.用于前列腺癌治疗的基于计算机模拟数据增强的点云分割
Med Phys. 2025 Apr 3. doi: 10.1002/mp.17815.
3
Short-Term Memory Impairment短期记忆障碍
4
SymTC: A symbiotic Transformer-CNN net for instance segmentation of lumbar spine MRI.SymTC:一种用于腰椎 MRI 实例分割的共生 Transformer-CNN 网络。
Comput Biol Med. 2024 Sep;179:108795. doi: 10.1016/j.compbiomed.2024.108795. Epub 2024 Jul 1.
5
EDT-MCFEF: a multi-channel feature fusion model for emergency department triage of medical texts.EDT-MCFEF:一种用于医学文本急诊科分诊的多通道特征融合模型。
Front Public Health. 2025 Jun 18;13:1591491. doi: 10.3389/fpubh.2025.1591491. eCollection 2025.
6
Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.早期至晚期融合对来自配准不完善的多模态磁共振成像的胰腺分割的影响。
J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.
7
Integrating multi-source data for skin burn classification using deep learning.利用深度学习整合多源数据进行皮肤烧伤分类
Comput Biol Med. 2025 Sep;195:110556. doi: 10.1016/j.compbiomed.2025.110556. Epub 2025 Jun 24.
8
..
Int Ophthalmol. 2025 Jun 27;45(1):266. doi: 10.1007/s10792-025-03602-6.
9
A segment anything model-guided and match-based semi-supervised segmentation framework for medical imaging.一种用于医学成像的基于段式分割模型引导和匹配的半监督分割框架。
Med Phys. 2025 Mar 29. doi: 10.1002/mp.17785.
10
Designing a Computer-Aided Detection system for Barrett 's neoplasia: Insights in architectural choices, training strategies and inference approaches.设计用于巴雷特肿瘤的计算机辅助检测系统:架构选择、训练策略和推理方法的见解
Comput Methods Programs Biomed. 2025 Sep;269:108891. doi: 10.1016/j.cmpb.2025.108891. Epub 2025 Jun 18.

引用本文的文献

1
CAS-Colon: A Comprehensive Colonoscopy Anatomical Segmentation Dataset for Artificial Intelligence Development.CAS-结肠:一个用于人工智能开发的综合结肠镜解剖分割数据集。
Sci Data. 2025 Aug 7;12(1):1382. doi: 10.1038/s41597-025-05588-3.