Aziz Kamran, Chen Feng, Ahmad Mahmood, Khan Muhammad Salman, Sabri Sabri Mohanad Muayad, Almujibah Hamad
The Key Laboratory of Road and Traffic Engineering, Ministry of Education, Tongji University, 4800 Cao'an Road, Jiading, Shanghai, 201804, China.
Institute of Energy Infrastructure, Universiti Tenaga Nasional, Kajang, 43000, Malaysia.
Sci Rep. 2025 Jul 9;15(1):24666. doi: 10.1038/s41598-025-08935-x.
Accurate prediction of crash injury severity and understanding the seriousness of multi-classification injuries is vital for informing authorities and the public. This Knowledge is crucial for enhancing road safety and reducing congestion, as different levels of injury necessitate distinct interventions, policies and responses to support sustainable transportation. Existing ML techniques often face class imbalance issues, resulting in suboptimal performance. Multi-class imbalance, more challenging than two-class imbalance, is frequently overlooked in traffic risk assessments. To accurately estimate the multi-class accident injuries and comprehend their severity we proposed a novel method called Bayesian Optimized Dynamic Ensemble Selection for Multi-Class Imbalance (DES-MI) with Ensemble Imbalance Learning (EIL), which involves; generating a pool of base classifiers with EIL methods and utilizing DES-MI to choose suitable classifiers. Utilizing homogeneous and heterogeneous pools of EIL classifiers, our findings demonstrate that DES-MI with EIL considerably enhances classification performance for datasets with multi-class imbalances. DES-MI with Heterogeneous EIL outperformed in performances while DES-MI with BRF achieved notable results in homogeneous ensembles. Important variables including road user gender, occupant age, month, airbag deployment, and road profile are also identified using SHAP for interpretability. Our DES-MI model with EIL classifiers and SHAP, by addressing multi-class imbalance, offers insightful information to stakeholders in road traffic safety by supporting the development of safer, efficient and sustainable urban road transport systems.
准确预测碰撞伤害严重程度并了解多分类伤害的严重性对于告知当局和公众至关重要。这些知识对于提高道路安全和减少拥堵至关重要,因为不同程度的伤害需要不同的干预措施、政策和应对措施来支持可持续交通。现有的机器学习技术经常面临类别不平衡问题,导致性能次优。多类别不平衡比二类别不平衡更具挑战性,在交通风险评估中经常被忽视。为了准确估计多类别事故伤害并了解其严重程度,我们提出了一种名为贝叶斯优化动态集成选择多类别不平衡(DES-MI)与集成不平衡学习(EIL)的新方法,该方法包括:使用EIL方法生成一组基础分类器,并利用DES-MI选择合适的分类器。利用EIL分类器的同构和异构池,我们的研究结果表明,带有EIL的DES-MI显著提高了多类别不平衡数据集的分类性能。带有异构EIL的DES-MI在性能上表现更优,而带有BRF的DES-MI在同构集成中取得了显著成果。还使用SHAP识别了包括道路使用者性别、乘客年龄、月份、安全气囊展开和道路轮廓等重要变量,以实现可解释性。我们带有EIL分类器和SHAP的DES-MI模型通过解决多类别不平衡问题,为道路交通安全的利益相关者提供了有价值的信息,支持更安全、高效和可持续的城市道路运输系统的发展。