Soladoye Afeez A, Olawade David B, Adeyanju Ibrahim A, Akpa Onoja M, Aderinto Nicholas, Owolabi Mayowa O
Department of Computer Engineering, Federal University, Oye, Ekiti, Nigeria.
Department of Allied and Public Health, School of Health, Sport and Bioscience, University of East London, London, United Kingdom; Department of Research and Innovation, Medway NHS Foundation Trust, Gillingham ME7 5NY, United Kingdom; Department of Public Health, York St John University, London, United Kingdom; School of Health and Care Management, Arden University, Arden House, Middlemarch Park, Coventry, CV3 4FJ.
Clin Neurol Neurosurg. 2025 Feb;249:108761. doi: 10.1016/j.clineuro.2025.108761. Epub 2025 Jan 27.
Stroke remains a leading cause of death and disability worldwide, with African populations bearing a disproportionately high burden due to limited healthcare infrastructure. Early prediction and intervention are critical to reducing stroke outcomes. This study developed and evaluated a stroke prediction system using Gated Recurrent Units (GRU), a variant of Recurrent Neural Networks (RNN), leveraging the Afrocentric Stroke Investigative Research and Education Network (SIREN) dataset.
The study utilized secondary data from the SIREN dataset, comprising 4236 records with 29 phenotypes. Feature selection reduced these to 15 optimal phenotypes based on their significance to stroke occurrence. The GRU model, designed with 128 input neurons and four hidden layers (64, 32, 16, and 8 neurons), was trained and evaluated using 150 epochs, a batch size of 8, and metrics such as accuracy, AUC, and prediction time. Comparisons were made with traditional machine learning algorithms (Logistic Regression, SVM, KNN) and Long Short-Term Memory (LSTM) networks.
The GRU-based system achieved a performance accuracy of 77.48 %, an AUC of 0.84, and a prediction time of 0.43 seconds, outperforming all other models. Logistic Regression achieved 73.58 %, while LSTM reached 74.88 % but with a longer prediction time of 2.23 seconds. Feature selection significantly improved the model's performance compared to using all 29 phenotypes.
The GRU-based system demonstrated superior performance in stroke prediction, offering an efficient and scalable tool for healthcare. Future research should focus on integrating unstructured data, validating the model on diverse populations, and exploring hybrid architectures to enhance predictive accuracy.
中风仍然是全球死亡和残疾的主要原因,由于医疗保健基础设施有限,非洲人口承受着不成比例的高负担。早期预测和干预对于改善中风预后至关重要。本研究利用非裔中心中风调查研究与教育网络(SIREN)数据集,开发并评估了一种使用门控循环单元(GRU)(循环神经网络(RNN)的一种变体)的中风预测系统。
该研究使用了SIREN数据集中的二次数据,包括4236条记录和29种表型。基于对中风发生的重要性,特征选择将这些表型减少到15种最优表型。设计了具有128个输入神经元和四个隐藏层(64、32、16和8个神经元)的GRU模型,并使用150个轮次、8的批量大小以及诸如准确率、AUC和预测时间等指标进行训练和评估。与传统机器学习算法(逻辑回归、支持向量机、K近邻)和长短期记忆(LSTM)网络进行了比较。
基于GRU的系统实现了77.48%的性能准确率、0.84的AUC和0.43秒的预测时间,优于所有其他模型。逻辑回归达到73.58%,而LSTM达到74.88%,但预测时间更长,为2.23秒。与使用所有29种表型相比,特征选择显著提高了模型的性能。
基于GRU的系统在中风预测中表现出卓越性能,为医疗保健提供了一种高效且可扩展的工具。未来的研究应专注于整合非结构化数据、在不同人群中验证模型以及探索混合架构以提高预测准确性。