Risk Management Unit, Fondazione Policlinico Universitario Agostino Gemelli IRCCS, Rome, Italy.
Eur Rev Med Pharmacol Sci. 2021 Mar;25(6):2785-2794. doi: 10.26355/eurrev_202103_25440.
To develop a deep learning-based decision tree for the primary care setting, to stratify adult patients with confirmed and unconfirmed coronavirus disease 2019 (COVID-19), and to predict the need for hospitalization or home monitoring.
We performed a retrospective cohort study on data from patients admitted to a COVID hospital in Rome, Italy, between 5 March 2020 and 5 June 2020. A confirmed case was defined as a patient with a positive nasopharyngeal RT-PCR test result, while an unconfirmed case had negative results on repeated swabs. Patients' medical history and clinical, laboratory and radiological findings were collected, and the dataset was used to train a predictive model for COVID-19 severity.
Data of 198 patients were included in the study. Twenty-eight (14.14%) had mild disease, 62 (31.31%) had moderate disease, 64 (32.32%) had severe disease, and 44 (22.22%) had critical disease. The G2 value assessed the contribution of each collected value to decision tree building. On this basis, SpO2 (%) with a cut point at 92 was chosen for the optimal first split. Therefore, the decision tree was built using values maximizing G2 and LogWorth. After the tree was built, the correspondence between inputs and outcomes was validated.
We developed a machine learning-based tool that is easy to understand and apply. It provides good discrimination in stratifying confirmed and unconfirmed COVID-19 patients with different prognoses in every context. Our tool might allow general practitioners visiting patients at home to decide whether the patient needs to be hospitalized.
开发一种基于深度学习的决策树,用于初级保健环境,对确诊和未确诊的 2019 冠状病毒病(COVID-19)成年患者进行分层,并预测住院或家庭监测的需求。
我们对 2020 年 3 月 5 日至 6 月 5 日期间在意大利罗马一家 COVID 医院住院的患者进行了回顾性队列研究。确诊病例定义为鼻咽 RT-PCR 检测结果阳性的患者,而未确诊病例则为重复拭子检测结果阴性的患者。收集了患者的病史和临床、实验室及影像学发现,并使用数据集训练了 COVID-19 严重程度的预测模型。
本研究纳入了 198 例患者的数据。28 例(14.14%)为轻症,62 例(31.31%)为中度,64 例(32.32%)为重症,44 例(22.22%)为危重症。G2 值评估了每个收集值对决策树构建的贡献。在此基础上,选择 SpO2(%)作为最佳的第一次分割的截断点为 92。因此,决策树是使用最大化 G2 和 LogWorth 的值构建的。构建树后,验证了输入与结果之间的对应关系。
我们开发了一种易于理解和应用的基于机器学习的工具。它在对不同预后的确诊和未确诊 COVID-19 患者进行分层方面具有良好的区分度。我们的工具可能使在家访视患者的全科医生能够决定患者是否需要住院。