Suppr超能文献

基于编解码器卷积神经网络的实时超声舌轮廓自动跟踪模型。

Encoder-decoder CNN models for automatic tracking of tongue contours in real-time ultrasound data.

机构信息

School of Electrical Engineering and Computer Science, University of Ottawa, 800 King-Edward Avenue, Ottawa, Ontario K1N-6N5, Canada.

School of Electrical Engineering and Computer Science, University of Ottawa, 800 King-Edward Avenue, Ottawa, Ontario K1N-6N5, Canada.

出版信息

Methods. 2020 Jul 1;179:26-36. doi: 10.1016/j.ymeth.2020.05.011. Epub 2020 May 22.

Abstract

One application of medical ultrasound imaging is to visualize and characterize human tongue shape and motion in real-time to study healthy or impaired speech production. Due to the low-contrast characteristic and noisy nature of ultrasound images, it requires knowledge about the tongue structure and ultrasound data interpretation for users to recognize tongue locations and gestures easily. Moreover, quantitative analysis of tongue motion needs the tongue contour to be extracted, tracked and visualized instead of the whole tongue region. Manual tongue contour extraction is a cumbersome, subjective, and error-prone task. Furthermore, it is not a feasible solution for real-time applications where the tongue contour moves rapidly with nuance gestures. This paper presents two new deep neural networks (named BowNet models) that benefit from the ability of global prediction of encoding-decoding fully convolutional neural networks and the capability of full-resolution extraction of dilated convolutions. Both qualitatively and quantitatively studies over datasets from two ultrasound machines disclosed the outstanding performances of the proposed deep learning models in terms of performance speed and robustness. Experimental results also revealed a significant improvement in the accuracy of prediction maps due to the better exploration and exploitation ability of the proposed network models.

摘要

医学超声成像是一种实时可视化和描述人体舌形和运动的方法,用于研究健康或受损的言语产生。由于超声图像的低对比度和噪声特性,用户需要了解舌结构和超声数据解释,以便轻松识别舌位和舌动。此外,舌运动的定量分析需要提取、跟踪和可视化舌轮廓,而不是整个舌区域。手动舌轮廓提取是一项繁琐、主观且容易出错的任务。此外,对于需要快速运动并带有细微手势的实时应用程序,手动提取不是一种可行的解决方案。本文提出了两种新的深度神经网络(称为 BowNet 模型),它们受益于编码-解码全卷积神经网络的全局预测能力和扩张卷积的全分辨率提取能力。通过来自两台超声机的数据集进行定性和定量研究,揭示了所提出的深度学习模型在性能速度和鲁棒性方面的出色表现。实验结果还表明,由于所提出的网络模型具有更好的探索和利用能力,预测图的准确性得到了显著提高。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验