Hernández-García Estefanía, Moro-Velázquez Laureano, González-Herranz Ramón, Godino-Llorente Juan Ignacio, Plaza Guillermo
Department of Otorhinolaryngology, Hospital Universitario de Fuenlabrada, Universidad Rey Juan Carlos, Madrid, Spain.
Universidad Politécnica de Madrid, Madrid, Spain; Center for Language and Speech Processing, Johns Hopkins University, Baltimore, Maryland.
J Voice. 2020 Jul;34(4):650.e1-650.e6. doi: 10.1016/j.jvoice.2019.02.012. Epub 2019 Mar 8.
Functional Endoscopic Sinus Surgery (FESS) is the surgery of choice for nasal polyposis and chronic rhinosinusitis. The aim of our study is to assess the influence of this surgery in the acoustic parameters of voice, and their implications in the systems of identification or verification of the speaker through the speech.
A prospective study was performed between January 2017 and June 2017 including two groups of patients: those undergoing FESS, and a control group. Demographic data and GRBAS assessment were statistically analyzed. In addition, a recording of patients' voices was made with a subsequent acoustic analysis and automatic identification of the speaker through machine learning systems, establishing the equal error rate. Samples were taken before surgery, 2 weeks after surgery and 3 months later.
After FESS, a significant difference was observed in Grade, Roughness, Breathiness, Asthenia, Strain (GRBAS). Besides, acoustic analysis showed a significance decrease in fundamental frequency (F0), when compared with the control group. For the automatic identification of the speaker through computer systems, we found that the equal error rate is higher in the FESS group.
Results suggest that FESS produce a decrease of F0 and changes in the vocal tract that derive in an increase in the error of recognition of the speaker in FESS patients.
功能性鼻内镜鼻窦手术(FESS)是鼻息肉和慢性鼻窦炎的首选手术。我们研究的目的是评估该手术对嗓音声学参数的影响,以及其对通过语音进行说话者识别或验证系统的影响。
2017年1月至2017年6月进行了一项前瞻性研究,包括两组患者:接受FESS手术的患者和一个对照组。对人口统计学数据和GRBAS评估进行了统计分析。此外,对患者的嗓音进行了录音,随后进行声学分析,并通过机器学习系统自动识别说话者,确定等错误率。在手术前、手术后2周和3个月后采集样本。
FESS手术后,在分级、粗糙度、气息声、无力、紧张(GRBAS)方面观察到显著差异。此外,声学分析显示,与对照组相比,基频(F0)显著降低。对于通过计算机系统自动识别说话者,我们发现FESS组的等错误率更高。
结果表明,FESS导致F0降低以及声道变化,这导致FESS患者说话者识别错误增加。