Center of Data and Knowledge Integration for Health (CIDACS), Instituto Gonçalo Moniz, Fundação Oswaldo Cruz, Salvador, Bahia, Brazil.
Instituto de Ciências da Saúde, Universidade Federal da Bahia, Salvador, Bahia, Brazil.
Sci Rep. 2021 Mar 24;11(1):6770. doi: 10.1038/s41598-021-86361-5.
Zika virus was responsible for the microcephaly epidemic in Brazil which began in October 2015 and brought great challenges to the scientific community and health professionals in terms of diagnosis and classification. Due to the difficulties in correctly identifying Zika cases, it is necessary to develop an automatic procedure to classify the probability of a CZS case from the clinical data. This work presents a machine learning algorithm capable of achieving this from structured and unstructured available data. The proposed algorithm reached 83% accuracy with textual information in medical records and image reports and 76% accuracy in classifying data without textual information. Therefore, the proposed algorithm has the potential to classify CZS cases in order to clarify the real effects of this epidemic, as well as to contribute to health surveillance in monitoring possible future epidemics.
寨卡病毒是导致巴西 2015 年 10 月开始的小头畸形疫情的罪魁祸首,这给科学界和卫生专业人员在诊断和分类方面带来了巨大挑战。由于正确识别寨卡病例存在困难,因此有必要开发一种自动程序,根据临床数据对寨卡综合征病例的概率进行分类。这项工作提出了一种机器学习算法,能够从结构化和非结构化的可用数据中实现这一目标。该算法在使用医疗记录和图像报告中的文本信息时达到了 83%的准确率,在没有文本信息的情况下分类数据时达到了 76%的准确率。因此,该算法有可能对寨卡综合征病例进行分类,以明确此次疫情的真实影响,并有助于在监测未来可能发生的疫情时进行卫生监测。