IEEE Trans Biomed Eng. 2021 Dec;68(12):3725-3736. doi: 10.1109/TBME.2021.3085576. Epub 2021 Nov 19.
In a few patients with mild COVID-19, there is a possibility of the infection becoming severe or critical in the future. This work aims to identify high-risk patients who have a high probability of changing from mild to critical COVID-19 (only account for 5% of cases).
Using traditional convolutional neural networks for classification may not be suitable to identify this 5% of high risk patients from an entire dataset due to the highly imbalanced label distribution. To address this problem, we propose a Mix Contrast model, which matches original features with mixed features for contrastive learning. Three modules are proposed for training the model: 1) a cumulative learning strategy for synthesizing the mixed feature; 2) a commutative feature combination module for learning the commutative law of feature concatenation; 3) a united pairwise loss assigning adaptive weights for sample pairs with different class anchors based on their current optimization status.
We collect a multi-center computed tomography dataset including 918 confirmed COVID-19 patients from four hospitals and evaluate the proposed method on both the COVID-19 mild-to-critical prediction and COVID-19 diagnosis tasks. For mild-to-critical prediction, the experimental results show a recall of 0.80 and a specificity of 0.815. For diagnosis, the model shows comparable results with deep neural networks using a large dataset. Our method demonstrates improvements when the amount of training data is small or imbalanced.
Identifying mild-to-critical COVID-19 patients is important for early prevention and personalized treatment planning.
在少数轻症 COVID-19 患者中,未来感染有发展为重症或危重症的可能。本研究旨在识别出有较高概率从轻症 COVID-19 转变为重症 COVID-19 的高危患者(仅占病例的 5%)。
由于标签分布高度不平衡,使用传统的卷积神经网络进行分类可能无法从整个数据集中识别出这 5%的高危患者。为了解决这个问题,我们提出了一种混合对比模型,该模型使用原始特征和混合特征进行对比学习。该模型提出了三个模块进行训练:1)累积学习策略,用于合成混合特征;2)可交换特征组合模块,用于学习特征连接的可交换律;3)联合对损失分配,根据当前优化状态,为具有不同类别的锚点的样本对分配自适应权重。
我们收集了来自四家医院的包含 918 名确诊 COVID-19 患者的多中心 CT 数据集,并在 COVID-19 轻症至重症预测和 COVID-19 诊断任务上对所提出的方法进行了评估。对于轻症至重症预测,实验结果显示召回率为 0.80,特异性为 0.815。对于诊断,该模型在使用大型数据集时与深度神经网络表现相当。当训练数据量较少或不平衡时,我们的方法显示出了改进。
识别轻症至重症 COVID-19 患者对于早期预防和个性化治疗计划非常重要。