Özbay Erdal, Özbay Feyza Altunbey, Khodadadi Nima, Gharehchopogh Farhad Soleimanian, Mirjalili Seyedali
Department of Computer Engineering, Firat University, Elazig, Turkey.
Department of Software Engineering, Firat University, Elazig, Turkey.
J Voice. 2024 Sep 7. doi: 10.1016/j.jvoice.2024.08.018.
Voice pathologies occur due to various factors, such as malfunction of the vocal cords. Computerized acoustic examination-based vocal pathology detection is crucial for early diagnosis, efficient follow-up, and improving problematic speech. Different acoustic measurements provide it. Executing this process requires expert monitoring and is not preferred by patients because it is time-consuming and costly. This paper is aimed at detecting metaheuristic-based automatic voice pathology. First, feature maps of 10 common diseases, including cordectomy, dysphonia, front lateral partial resection, contact pachyderma, laryngitis, lukoplakia, pure breath, recurrent laryngeal paralysis, vocal fold polyp, and vox senilis, were obtained from the Zero-Crossing Rate, Root-Mean-Square Energy, and Mel-frequency Cepstral Coefficients using a thousand voice signals from the Saarbruecken Voice Database dataset. Hybridizations of different features obtained from the voices of the same diseases using these three methods were used to increase the model's performance. The Grey Wolf Optimizer (MELGWO) algorithm based on local search, evolutionary operator, and concatenated feature maps derived from various approaches was employed to minimize the number of features, implement the models faster, and produce the best result. The fitness values of the metaheuristic algorithms were then determined using supervised machine learning techniques such as Support Vector Machine (SVM) and K-nearest neighbors. The F1 score, sensitivity, specificity, accuracy, and other assessment criteria were compared with the experimental data. The best accuracy result was achieved with 99.50% from the SVM classifier using the feature maps optimized by the improved MELGWO algorithms.
语音病理学是由多种因素引起的,比如声带功能失调。基于计算机声学检查的语音病理学检测对于早期诊断、有效随访以及改善有问题的语音至关重要。不同的声学测量方法可以提供相关检测。执行这个过程需要专家监测,而且患者不太喜欢,因为它既耗时又昂贵。本文旨在检测基于元启发式算法的自动语音病理学。首先,使用来自萨尔布吕肯语音数据库数据集中的一千个语音信号,从过零率、均方根能量和梅尔频率倒谱系数中获取了10种常见疾病的特征图,这些疾病包括声带切除术、发音障碍、前外侧部分切除术、接触性厚皮病、喉炎、白斑病、单纯呼吸音、喉返神经麻痹、声带息肉和老年嗓音。使用这三种方法从相同疾病的语音中获得的不同特征的杂交用于提高模型的性能。采用基于局部搜索、进化算子以及从各种方法导出的级联特征图的灰狼优化器(MELGWO)算法来减少特征数量,更快地实现模型,并产生最佳结果。然后使用支持向量机(SVM)和K近邻等监督机器学习技术确定元启发式算法的适应度值。将F1分数、灵敏度、特异性、准确率和其他评估标准与实验数据进行比较。使用改进的MELGWO算法优化的特征图,通过SVM分类器获得了99.50%的最佳准确率结果。