Program in Speech-Language-Hearing Sciences, CUNY Graduate Center, New York, New York, USA.
Haskins Laboratories, New Haven, Connecticut, USA.
Clin Linguist Phon. 2022 Dec 2;36(12):1112-1131. doi: 10.1080/02699206.2021.1998633. Epub 2022 Jan 3.
Contours traced by trained phoneticians have been considered to be the most accurate way to identify the midsagittal tongue surface from ultrasound video frames. In this study, inter-measurer reliability was evaluated using measures that quantified both how closely human-placed contours approximated each other as well as how consistent measurers were in defining the start and end points of contours. High reliability across three measurers was found for all measures, consistent with treating contours placed by trained phoneticians as the 'gold standard.' However, due to the labour-intensive nature of hand-placing contours, automatic algorithms that detect the tongue surface are increasingly being used to extract tongue-surface data from ultrasound videos. Contours placed by six automatic algorithms (SLURP, EdgeTrak, EPCS, and three different configurations of the algorithm provided in Articulate Assistant Advanced) were compared to human-placed contours, with the same measures used to evaluate the consistency of the trained phoneticians. We found that contours defined by SLURP, EdgeTrak, and two of the AAA configurations closely matched the hand-placed contours along sections of the image where the algorithms and humans agreed that there was a discernible contour. All of the algorithms were much less reliable than humans in determining the anterior (tongue-tip) edge of tongue contours. Overall, the contours produced by SLURP, EdgeTrak, and AAA should be useable in a variety of clinical applications, subject to spot-checking. Additional practical considerations of these algorithms are also discussed.
经过训练的语音学家所描绘的轮廓被认为是从超声视频帧中识别正中矢状舌面的最准确方法。在这项研究中,使用了量化人类放置的轮廓彼此接近程度以及测量者在定义轮廓起点和终点方面一致性的度量标准来评估测量者之间的可靠性。对于所有度量标准,三个测量者的可靠性都很高,这与将经过训练的语音学家放置的轮廓视为“金标准”一致。然而,由于手动放置轮廓的劳动密集性质,越来越多的自动算法被用于从超声视频中提取舌面数据。将六个自动算法(SLURP、EdgeTrak、EPCS 和 Articulate Assistant Advanced 中提供的三种不同配置的算法)放置的轮廓与人工放置的轮廓进行了比较,使用相同的度量标准来评估经过训练的语音学家的一致性。我们发现,在算法和人类都认为有可识别轮廓的图像部分,SLURP、EdgeTrak 和两个 AAA 配置定义的轮廓与手工放置的轮廓非常吻合。所有算法在确定舌轮廓的前(舌尖)边缘方面都远不如人类可靠。总的来说,SLURP、EdgeTrak 和 AAA 生成的轮廓应该可以在各种临床应用中使用,但需要进行抽查。还讨论了这些算法的其他实际考虑因素。