Lu Jie, Lu Xinhao, Wang Yixiao, Zhang Hengdong, Han Lei, Zhu Baoli, Wang Boshen
Key Laboratory of Environmental Medicine Engineering of Ministry of Education, Southeast University, Nanjing, 210009, China.
School of Cyber Science and Engineering, Southeast University, Nanjing, 211189, China.
Sci Rep. 2025 May 2;15(1):15361. doi: 10.1038/s41598-025-00050-1.
To compare the comprehensive performance of conventional logistic regression (LR) and seven machine learning (ML) algorithms in Noise-Induced Hearing Loss (NIHL) prediction, and to investigate the single nucleotide polymorphism (SNP) loci significantly associated with the occurrence and progression of NIHL. A total of 1,338 noise-exposed workers from 52 enterprises in Jiangsu Province were included in this study. 88 SNP loci involving multiple genes related to noise exposure and hearing loss were detected. LR and multiple ML algorithms were employed to establish the NIHL prediction model with accuracy, recall, precision, F-score, R and AUC as performance indicators. Compared to conventional LR, the evaluated ML models Generalized Regression Neural Network (GRNN), Probabilistic Neural Network (PNN), Genetic Algorithm-Random Forests (GA-RF) demonstrate superior performance and were considered to be the optimal models for processing large-scale SNP loci dataset. The SNP loci screened by these models are pivotal in the process of NIHL prediction, which further improves the prediction accuracy of the model. These findings open new possibilities for accurate prediction of NIHL based on SNP locus screening in the future, and provide a more scientific basis for decision-making in occupational health management.
为比较传统逻辑回归(LR)和七种机器学习(ML)算法在噪声性听力损失(NIHL)预测中的综合性能,并研究与NIHL发生和进展显著相关的单核苷酸多态性(SNP)位点。本研究纳入了江苏省52家企业的1338名噪声暴露工人。检测了涉及多个与噪声暴露和听力损失相关基因的88个SNP位点。采用LR和多种ML算法建立以准确性、召回率、精确率、F分数、R和AUC作为性能指标的NIHL预测模型。与传统LR相比,评估的ML模型广义回归神经网络(GRNN)、概率神经网络(PNN)、遗传算法-随机森林(GA-RF)表现出优越的性能,被认为是处理大规模SNP位点数据集的最佳模型。这些模型筛选出的SNP位点在NIHL预测过程中至关重要,进一步提高了模型的预测准确性。这些发现为未来基于SNP位点筛选准确预测NIHL开辟了新的可能性,并为职业健康管理决策提供了更科学的依据。