Department of Surgery, Massachusetts General Hospital, 15 Parkman Street, WAC460, Boston, MA, 02114, USA.
Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, USA.
Surg Endosc. 2021 Sep;35(9):4918-4929. doi: 10.1007/s00464-021-08578-9. Epub 2021 Jul 6.
The growing interest in analysis of surgical video through machine learning has led to increased research efforts; however, common methods of annotating video data are lacking. There is a need to establish recommendations on the annotation of surgical video data to enable assessment of algorithms and multi-institutional collaboration.
Four working groups were formed from a pool of participants that included clinicians, engineers, and data scientists. The working groups were focused on four themes: (1) temporal models, (2) actions and tasks, (3) tissue characteristics and general anatomy, and (4) software and data structure. A modified Delphi process was utilized to create a consensus survey based on suggested recommendations from each of the working groups.
After three Delphi rounds, consensus was reached on recommendations for annotation within each of these domains. A hierarchy for annotation of temporal events in surgery was established.
While additional work remains to achieve accepted standards for video annotation in surgery, the consensus recommendations on a general framework for annotation presented here lay the foundation for standardization. This type of framework is critical to enabling diverse datasets, performance benchmarks, and collaboration.
通过机器学习对手术视频进行分析的兴趣日益浓厚,这促使研究人员加大了研究力度;但是,目前缺乏对视频数据进行注释的通用方法。有必要制定有关手术视频数据注释的建议,以评估算法和多机构协作。
从包括临床医生、工程师和数据科学家在内的参与者中组建了四个工作组。这些工作组专注于四个主题:(1)时间模型,(2)动作和任务,(3)组织特征和一般解剖结构,以及(4)软件和数据结构。利用改良的 Delphi 流程,根据每个工作组的建议制定了一项共识调查。
经过三轮 Delphi 调查,在这些领域中的每个领域内达成了关于注释建议的共识。建立了手术中时间事件注释的层次结构。
尽管在实现手术视频注释的公认标准方面仍有工作要做,但此处提出的有关注释通用框架的共识建议为标准化奠定了基础。这种类型的框架对于实现多样化数据集、性能基准和协作至关重要。