Cui Wanting, Robins Daniel, Finkelstein Joseph
Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Stud Health Technol Inform. 2020 Jun 26;272:1-4. doi: 10.3233/SHTI200478.
The goal of this paper was to apply unsupervised machine learning techniques towards the discovery of latent clusters in COVID-19 patients. Over 6,000 adult patients tested positive for the SARS-CoV-2 infection at the Mount Sinai Health System in New York, USA met the inclusion criteria for analysis. Patients' diagnoses were mapped onto chronicity and one of the 18 body systems, and the optimal number of clusters was determined using K-means algorithm and the elbow method. 4 clusters were identified; the most frequently associated comorbidities involved infectious, respiratory, cardiovascular, endocrine, and genitourinary disorders, as well as socioeconomic factors that influence health status and contact with health services. These results offer a strong direction for future research and more granular analysis.
本文的目的是应用无监督机器学习技术来发现新冠肺炎患者中的潜在聚类。在美国纽约西奈山医疗系统中,超过6000名成年患者的新冠病毒检测呈阳性,符合纳入分析的标准。患者的诊断被映射到慢性病和18个身体系统之一,并使用K均值算法和肘部方法确定聚类的最佳数量。确定了4个聚类;最常相关的合并症包括感染性、呼吸、心血管、内分泌和泌尿生殖系统疾病,以及影响健康状况和与医疗服务接触的社会经济因素。这些结果为未来的研究和更细致的分析提供了有力的方向。