Linder Roland, Albers Andreas E, Hess Markus, Pöppl Siegfried J, Schönweiler Rainer
Institute of Medical Informatics, University of Lübeck, D-23538 Lübeck, Germany.
J Voice. 2008 Mar;22(2):155-63. doi: 10.1016/j.jvoice.2006.09.003. Epub 2006 Oct 30.
For diagnosis and classification of dysphonia, voice specialists can choose from an array of diagnostic tools like perceptual tests or acoustic voice analysis. These methods have in common that they require a high level of specialized training and experience, and therefore are mostly reserved to specialized centers. We aimed at developing an acoustic voice analysis system that could be used as a screening device to monitor, document, and diagnose voice problems that are also encountered by non-voice specialists, such as anesthesiologists, head and neck surgeons, and general surgeons before surgery of the thyroid gland and the upper thoracic aperture. An acoustical feature extraction paradigm that focused on jitter, shimmer, standard deviation of fundamental frequency, and the glottal-to-noise excitation ratio was used to reanalyse 120 voice samples previously analyzed by Schönweiler et al (A Novel Approach to Acoustical Voice Analysis Using Artificial Neural Networks. JARO. 2000:1;270-282). An improved artificial neural network (ANN) was used for classification. Building on this preliminary work, we modified the mathematical algorithm to further improve classification accuracy. Eighty percent of all voice samples could be classified correctly as either healthy or hoarse (sensitivity: 63.0%; specificity: 93.9%; area under the curve: 0.854). The adaptation of the ANN-voice analysis system for mobile use may facilitate its use and acceptance by non-voice specialists for the discovery and documentation of preexisting voice disorders, and may thereby lead to a timely initiation of further diagnosis and therapy by voice specialists.
对于嗓音障碍的诊断和分类,嗓音专家可以从一系列诊断工具中进行选择,如感知测试或声学嗓音分析。这些方法的共同之处在于它们需要高水平的专业培训和经验,因此大多仅在专业中心使用。我们旨在开发一种声学嗓音分析系统,该系统可作为一种筛查设备,用于监测、记录和诊断非嗓音专家(如麻醉医生、头颈外科医生以及甲状腺和胸廓上口手术前的普通外科医生)也会遇到的嗓音问题。一种专注于抖动、闪烁、基频标准差和声门噪声激励比的声学特征提取范式,被用于重新分析120个先前由舍恩魏勒等人分析过的嗓音样本(《一种使用人工神经网络进行声学嗓音分析的新方法》。《语音通信》。2000年;1卷;270 - 282页)。一种改进的人工神经网络(ANN)被用于分类。基于这项初步工作,我们修改了数学算法以进一步提高分类准确率。所有嗓音样本中有80%能够被正确分类为健康或嘶哑(敏感性:63.0%;特异性:93.9%;曲线下面积:0.854)。将人工神经网络嗓音分析系统适配为可移动使用,可能会促进非嗓音专家对其的使用和接受,以便发现和记录已存在的嗓音障碍,从而可能促使嗓音专家及时启动进一步的诊断和治疗。