School of Electrical Engineering and Computer Science, University of Ottawa, 800 King-Edward Avenue, Ottawa, Ontario K1N-6N5, Canada.
School of Electrical Engineering and Computer Science, University of Ottawa, 800 King-Edward Avenue, Ottawa, Ontario K1N-6N5, Canada.
Methods. 2020 Jul 1;179:26-36. doi: 10.1016/j.ymeth.2020.05.011. Epub 2020 May 22.
One application of medical ultrasound imaging is to visualize and characterize human tongue shape and motion in real-time to study healthy or impaired speech production. Due to the low-contrast characteristic and noisy nature of ultrasound images, it requires knowledge about the tongue structure and ultrasound data interpretation for users to recognize tongue locations and gestures easily. Moreover, quantitative analysis of tongue motion needs the tongue contour to be extracted, tracked and visualized instead of the whole tongue region. Manual tongue contour extraction is a cumbersome, subjective, and error-prone task. Furthermore, it is not a feasible solution for real-time applications where the tongue contour moves rapidly with nuance gestures. This paper presents two new deep neural networks (named BowNet models) that benefit from the ability of global prediction of encoding-decoding fully convolutional neural networks and the capability of full-resolution extraction of dilated convolutions. Both qualitatively and quantitatively studies over datasets from two ultrasound machines disclosed the outstanding performances of the proposed deep learning models in terms of performance speed and robustness. Experimental results also revealed a significant improvement in the accuracy of prediction maps due to the better exploration and exploitation ability of the proposed network models.
医学超声成像是一种实时可视化和描述人体舌形和运动的方法,用于研究健康或受损的言语产生。由于超声图像的低对比度和噪声特性,用户需要了解舌结构和超声数据解释,以便轻松识别舌位和舌动。此外,舌运动的定量分析需要提取、跟踪和可视化舌轮廓,而不是整个舌区域。手动舌轮廓提取是一项繁琐、主观且容易出错的任务。此外,对于需要快速运动并带有细微手势的实时应用程序,手动提取不是一种可行的解决方案。本文提出了两种新的深度神经网络(称为 BowNet 模型),它们受益于编码-解码全卷积神经网络的全局预测能力和扩张卷积的全分辨率提取能力。通过来自两台超声机的数据集进行定性和定量研究,揭示了所提出的深度学习模型在性能速度和鲁棒性方面的出色表现。实验结果还表明,由于所提出的网络模型具有更好的探索和利用能力,预测图的准确性得到了显著提高。