Howard James P, Tan Jeremy, Shun-Shin Matthew J, Mahdi Dina, Nowbar Alexandra N, Arnold Ahran D, Ahmad Yousif, McCartney Peter, Zolgharni Massoud, Linton Nick W F, Sutaria Nilesh, Rana Bushra, Mayet Jamil, Rueckert Daniel, Cole Graham D, Francis Darrel P
National Heart and Lung Institute, Imperial College London, Hammersmith Hospital, London, UK.
Department of Computing, Imperial College London, Hammersmith Hospital, London, UK.
J Med Artif Intell. 2020 Mar 25;3. doi: 10.21037/jmai.2019.10.03.
Echocardiography is the commonest medical ultrasound examination, but automated interpretation is challenging and hinges on correct recognition of the 'view' (imaging plane and orientation). Current state-of-the-art methods for identifying the view computationally involve 2-dimensional convolutional neural networks (CNNs), but these merely classify individual frames of a video in isolation, and ignore information describing the movement of structures throughout the cardiac cycle. Here we explore the efficacy of novel CNN architectures, including time-distributed networks and two-stream networks, which are inspired by advances in human action recognition. We demonstrate that these new architectures more than halve the error rate of traditional CNNs from 8.1% to 3.9%. These advances in accuracy may be due to these networks' ability to track the movement of specific structures such as heart valves throughout the cardiac cycle. Finally, we show the accuracies of these new state-of-the-art networks are approaching expert agreement (3.6% discordance), with a similar pattern of discordance between views.
超声心动图是最常见的医学超声检查,但自动解读具有挑战性,关键在于正确识别“视图”(成像平面和方向)。目前用于通过计算识别视图的最先进方法涉及二维卷积神经网络(CNN),但这些方法只是孤立地对视频的各个帧进行分类,而忽略了描述整个心动周期中结构运动的信息。在此,我们探索了新型CNN架构的有效性,包括受人类动作识别进展启发的时间分布网络和双流网络。我们证明,这些新架构将传统CNN的错误率从8.1%降低了一半以上,降至3.9%。准确性的这些提高可能归因于这些网络跟踪特定结构(如心脏瓣膜)在整个心动周期中运动的能力。最后,我们表明这些新的最先进网络的准确性接近专家共识(不一致率为3.6%),不同视图之间的不一致模式相似。