Başkent Deniz, Eiler Cheryl L, Edwards Brent
Starkey Hearing Research Center, Berkeley, California 94704, USA.
Ear Hear. 2007 Jun;28(3):370-80. doi: 10.1097/AUD.0b013e318047935e.
To present a comprehensive analysis of the feasibility of genetic algorithms (GA) for finding the best fit of hearing aids or cochlear implants for individual users in clinical or research settings, where the algorithm is solely driven by subjective human input.
Due to varying pathology, the best settings of an auditory device differ for each user. It is also likely that listening preferences vary at the same time. The settings of a device customized for a particular user can only be evaluated by the user. When optimization algorithms are used for fitting purposes, this situation poses a difficulty for a systematic and quantitative evaluation of the suitability of the fitting parameters produced by the algorithm. In the present study, an artificial listening environment was generated by distorting speech using a noiseband vocoder. The settings produced by the GA for this listening problem could objectively be evaluated by measuring speech recognition and comparing the performance to the best vocoder condition where speech was least distorted. Nine normal-hearing subjects participated in the study. The parameters to be optimized were the number of vocoder channels, the shift between the input frequency range and the synthesis frequency range, and the compression-expansion of the input frequency range over the synthesis frequency range. The subjects listened to pairs of sentences processed with the vocoder, and entered a preference for the sentence with better intelligibility. The GA modified the solutions iteratively according to the subject preferences. The program converged when the user ranked the same set of parameters as the best in three consecutive steps. The results produced by the GA were analyzed for quality by measuring speech intelligibility, for test-retest reliability by running the GA three times with each subject, and for convergence properties.
Speech recognition scores averaged across subjects were similar for the best vocoder solution and for the solutions produced by the GA. The average number of iterations was 8 and the average convergence time was 25.5 minutes. The settings produced by different GA runs for the same subject were slightly different; however, speech recognition scores measured with these settings were similar. Individual data from subjects showed that in each run, a small number of GA solutions produced poorer speech intelligibility than for the best setting. This was probably a result of the combination of the inherent randomness of the GA, the convergence criterion used in the present study, and possible errors that the users might have made during the paired comparisons. On the other hand, the effect of these errors was probably small compared to the other two factors, as a comparison between subjective preferences and objective measures showed that for many subjects the two were in good agreement.
The results showed that the GA was able to produce good solutions by using listener preferences in a relatively short time. For practical applications, the program can be made more robust by running the GA twice or by not using an automatic stopping criterion, and it can be made faster by optimizing the number of the paired comparisons completed in each iteration.
全面分析遗传算法(GA)在临床或研究环境中为个体用户寻找最适合的助听器或人工耳蜗的可行性,其中该算法完全由主观人为输入驱动。
由于病理情况各异,每个用户的听觉设备最佳设置也不同。同时,听力偏好也可能存在差异。为特定用户定制的设备设置只能由用户进行评估。当使用优化算法进行适配时,这种情况给系统定量评估算法产生的适配参数的适用性带来了困难。在本研究中,使用噪声带声码器对语音进行失真处理,生成了一个人工听力环境。通过测量语音识别并将性能与语音失真最小的最佳声码器条件进行比较,可以客观地评估GA针对此听力问题产生的设置。九名听力正常的受试者参与了该研究。要优化的参数有声码器通道数量、输入频率范围与合成频率范围之间的偏移,以及输入频率范围相对于合成频率范围的压缩扩展。受试者听取用声码器处理的句子对,并对可懂度更高的句子表达偏好。GA根据受试者的偏好迭代修改解决方案。当用户在连续三个步骤中将同一组参数列为最佳时,程序收敛。通过测量语音可懂度分析GA产生的结果的质量,通过对每个受试者运行GA三次分析重测可靠性,并分析收敛特性。
最佳声码器解决方案和GA产生的解决方案的受试者平均语音识别分数相似。平均迭代次数为8次,平均收敛时间为25.5分钟。同一受试者不同GA运行产生的设置略有不同;然而,用这些设置测量的语音识别分数相似。受试者的个体数据表明,在每次运行中,少数GA解决方案产生的语音可懂度比最佳设置差。这可能是GA固有随机性、本研究中使用的收敛标准以及用户在配对比较过程中可能出现的错误共同作用的结果。另一方面,与其他两个因素相比,这些错误的影响可能较小,因为主观偏好与客观测量之间的比较表明,对于许多受试者来说,两者吻合度良好。
结果表明,GA能够通过使用听众偏好,在相对较短的时间内产生良好的解决方案。对于实际应用,可以通过运行GA两次或不使用自动停止标准使程序更稳健,并且可以通过优化每次迭代中完成的配对比较数量来加快速度。