Cao Maocheng, Jin Haochang, Wang Yuxi, Wang Li, Ji Junkai
Shenzhen Fuyong People's Hospital, Shenzhen, China.
National Engineering Laboratory for Big Data System Computing Technology, Shenzhen University, Shenzhen, China.
PLoS One. 2025 Aug 26;20(8):e0330530. doi: 10.1371/journal.pone.0330530. eCollection 2025.
Stroke, a common neurological disorder, is considered one of the leading causes of death and disability worldwide. Stroke prognosis issues involve using clinical characteristics collected from patients presented in tabular form to determine whether they are suitable for thrombolytic therapy. Transformer-based deep learning methods have achieved state-of-the-art performance in various classification tasks, but flaws still exist in dealing with tabular data. These models and algorithms largely tend to overfit and exhibit performance degeneration on small-scale, class-imbalanced datasets. Medical datasets are typically small and imbalanced due to the scarcity of labelled medical data samples. Therefore, this study proposes a novel stroke prognosis prediction model called Strokeformer to address these issues. Specifically, novel intra- and interfeature interaction modules are designed to capture internal and mutual information among individual features for more effective latent representations. In addition, we explore the possibility of performing the training process by pretraining on large-scale, class-balanced datasets and then fine-tuning on small-scale, class-imbalanced downstream datasets. This pretraining and fine-tuning paradigm is dramatically feasible for preventing overfitting. To verify the effectiveness of the proposed model and training method, experiments are conducted on 20 public datasets from OpenML and two private stroke prognosis datasets provided by Shenzhen Fuyong People's Hospital and The Affiliated Taizhou People's Hospital of Nanjing Medical University, China, respectively. The results show that Strokeformer performance significantly outperforms that of other comparison models on the introduced datasets. The principal limitation of the model lies in its lack of interpretability from the clinicians' perspective. Nevertheless, given that the interpretability of deep learning remains an open challenge, the promising empirical results achieved by Strokeformer on real-world stroke prognosis datasets highlight its potential to assist in clinical decision-making.
中风是一种常见的神经系统疾病,被认为是全球死亡和残疾的主要原因之一。中风预后问题涉及使用以表格形式收集的患者临床特征来确定他们是否适合溶栓治疗。基于Transformer的深度学习方法在各种分类任务中取得了领先的性能,但在处理表格数据时仍然存在缺陷。这些模型和算法在很大程度上容易过拟合,并且在小规模、类不平衡的数据集上表现出性能退化。由于标记的医学数据样本稀缺,医学数据集通常较小且不平衡。因此,本研究提出了一种名为Strokeformer的新型中风预后预测模型来解决这些问题。具体来说,设计了新颖的特征内和特征间交互模块,以捕获各个特征之间的内部和相互信息,从而获得更有效的潜在表示。此外,我们探索了通过在大规模、类平衡数据集上进行预训练,然后在小规模、类不平衡的下游数据集上进行微调来执行训练过程的可能性。这种预训练和微调范式对于防止过拟合非常可行。为了验证所提出模型和训练方法的有效性,分别在中国深圳福永人民医院和南京医科大学附属泰州人民医院提供的20个来自OpenML的公共数据集和两个私人中风预后数据集上进行了实验。结果表明,Strokeformer在引入的数据集上的性能明显优于其他比较模型。该模型的主要局限性在于从临床医生的角度来看缺乏可解释性。然而,鉴于深度学习的可解释性仍然是一个开放的挑战,Strokeformer在真实世界中风预后数据集上取得的有希望的实证结果突出了其协助临床决策的潜力。