Institute of Health and Medical Technology, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui, China.
University of Science and Technology of China, Hefei, Anhui, China.
J Transl Med. 2024 Nov 22;22(1):1054. doi: 10.1186/s12967-024-05872-7.
The global outbreak of the coronavirus disease 2019 (COVID-19) has been enormously damaging, in which prolonged shedding of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2, previously 2019-nCoV) infection is a challenge in the prevention and treatment of COVID-19. However, there is still incomplete research on the risk factors that affect delayed shedding of SARS-CoV-2.
In a retrospective analysis of 56,878 hospitalized patients in the Fangcang Shelter Hospital (National Convention and Exhibition Center) in Shanghai, China, we compared patients with the duration of SARS-CoV-2 viral shedding > 12 days with those days < 12 days. The results of real-time polymerase chain reaction (RT-PCR) tests determined the duration of viral shedding from the first day of SARS-CoV-2 positivity to the day of SARS-CoV-2 negativity. The extreme gradient boosting (XGBoost) machine learning method was employed to establish a prediction model for prolonged SARS-CoV-2 shedding and analyze significant risk factors. Filtering features retraining and Shapley Additive Explanations (SHAP) techniques were followed to demonstrate and further explain the risk factors for long-term SARS-CoV-2 infection.
We conducted an assessment of ten different features, including vaccination, hypertension, diabetes, admission cycle threshold (Ct) value, cardio-cerebrovascular disease, gender, age, occupation, symptom, and family accompaniment, to determine their impact on the prolonged SARS-CoV-2 shedding. This study involved a large cohort of 56,878 hospitalized patients, and we leveraged the XGBoost algorithm to establish a predictive model based on these features. Upon analysis, six of these ten features were significantly associated with the prolonged SARS-CoV-2 shedding, as determined by both the importance order of the model and our results obtained through model reconstruction. Specifically, vaccination, hypertension, admission Ct value, gender, age, and family accompaniment were identified as the key features associated with prolonged viral shedding.
We developed a predictive model and identified six risk factors associated with prolonged SARS-CoV-2 viral shedding. Our study contributes to identifying and screening individuals with potential long-term SARS-CoV-2 infections. Moreover, our research also provides a reference for future preventive control, optimizing medical resource allocation and guiding epidemiological prevention, and guidelines for personal protection against SARS-CoV-2.
2019 年冠状病毒病(COVID-19)的全球爆发造成了极大的破坏,其中严重急性呼吸综合征冠状病毒 2(SARS-CoV-2,以前称为 2019-nCoV)感染的长时间排毒是 COVID-19 预防和治疗的挑战。然而,关于影响 SARS-CoV-2 延迟排毒的危险因素的研究仍不完整。
我们对中国上海方仓庇护医院(国家会展中心)的 56878 例住院患者进行了回顾性分析,将 SARS-CoV-2 病毒排毒时间超过 12 天的患者与排毒时间<12 天的患者进行比较。实时聚合酶链反应(RT-PCR)检测结果确定了从 SARS-CoV-2 阳性第一天到 SARS-CoV-2 阴性的排毒时间。采用极端梯度增强(XGBoost)机器学习方法建立了 SARS-CoV-2 延长排毒的预测模型,并分析了显著的危险因素。采用特征筛选重训练和 Shapley 加性解释(SHAP)技术,对长期 SARS-CoV-2 感染的危险因素进行了验证和进一步解释。
我们评估了包括接种疫苗、高血压、糖尿病、入院循环阈值(Ct)值、心脑血管疾病、性别、年龄、职业、症状和家庭陪伴在内的 10 种不同特征对 SARS-CoV-2 延长排毒的影响。本研究涉及 56878 例住院患者的大样本量,我们利用 XGBoost 算法基于这些特征建立了预测模型。分析结果表明,在模型的重要性顺序和通过模型重建获得的结果中,这 10 种特征中有 6 种与 SARS-CoV-2 延长排毒显著相关。具体而言,接种疫苗、高血压、入院 Ct 值、性别、年龄和家庭陪伴被确定为与病毒延长排毒相关的关键特征。
我们开发了一个预测模型,并确定了与 SARS-CoV-2 病毒延长排毒相关的 6 个危险因素。我们的研究有助于识别和筛选可能存在 SARS-CoV-2 长期感染的个体。此外,我们的研究还为未来的预防控制、优化医疗资源配置和指导流行病学预防以及个人 SARS-CoV-2 防护提供了参考。