ICube, University of Strasbourg, CNRS, France; IHU Strasbourg, France.
ICube, University of Strasbourg, CNRS, France; IHU Strasbourg, France; Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy.
Med Image Anal. 2023 Aug;88:102866. doi: 10.1016/j.media.2023.102866. Epub 2023 Jun 15.
Searching through large volumes of medical data to retrieve relevant information is a challenging yet crucial task for clinical care. However the primitive and most common approach to retrieval, involving text in the form of keywords, is severely limited when dealing with complex media formats. Content-based retrieval offers a way to overcome this limitation, by using rich media as the query itself. Surgical video-to-video retrieval in particular is a new and largely unexplored research problem with high clinical value, especially in the real-time case: using real-time video hashing, search can be achieved directly inside of the operating room. Indeed, the process of hashing converts large data entries into compact binary arrays or hashes, enabling large-scale search operations at a very fast rate. However, due to fluctuations over the course of a video, not all bits in a given hash are equally reliable. In this work, we propose a method capable of mitigating this uncertainty while maintaining a light computational footprint. We present superior retrieval results (3%-4% top 10 mean average precision) on a multi-task evaluation protocol for surgery, using cholecystectomy phases, bypass phases, and coming from an entirely new dataset introduced here, surgical events across six different surgery types. Success on this multi-task benchmark shows the generalizability of our approach for surgical video retrieval.
从大量医学数据中检索相关信息是临床护理中一项具有挑战性但至关重要的任务。然而,在处理复杂媒体格式时,涉及关键字形式的文本的原始且最常见的检索方法受到严重限制。基于内容的检索提供了一种克服此限制的方法,即用丰富的媒体作为查询本身。特别是在实时情况下,手术视频到视频的检索是一个具有高临床价值但尚未得到充分探索的新研究问题:使用实时视频哈希,可直接在手术室中进行搜索。实际上,哈希处理过程将大数据条目转换为紧凑的二进制数组或哈希值,从而能够以非常快的速度进行大规模搜索操作。然而,由于视频过程中的波动,给定哈希中的并非所有位都是同样可靠的。在这项工作中,我们提出了一种能够在保持轻量级计算足迹的同时减轻这种不确定性的方法。我们使用这里引入的全新数据集,在胆囊切除术阶段、旁路阶段和来自六个不同手术类型的手术事件的多任务评估协议上,提出了优越的检索结果(3%-4%的前 10 个平均精度)。在这个多任务基准上的成功表明了我们的方法在手术视频检索中的通用性。