Microbiology and Infectology Research Group, Galicia sur Health Research Institute (IIS Galicia Sur), SERGAS-UVIGO, Vigo, Spain.
Microbiology Department, Complexo Hospitalario Universitario de Vigo (CHUVI), Sergas, Vigo, Spain.
Sci Rep. 2023 May 13;13(1):7786. doi: 10.1038/s41598-023-34882-6.
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been and remains one of the major challenges modern society has faced thus far. Over the past few months, large amounts of information have been collected that are only now beginning to be assimilated. In the present work, the existence of residual information in the massive numbers of rRT-PCRs that tested positive out of the almost half a million tests that were performed during the pandemic is investigated. This residual information is believed to be highly related to a pattern in the number of cycles that are necessary to detect positive samples as such. Thus, a database of more than 20,000 positive samples was collected, and two supervised classification algorithms (a support vector machine and a neural network) were trained to temporally locate each sample based solely and exclusively on the number of cycles determined in the rRT-PCR of each individual. Overall, this study suggests that there is valuable residual information in the rRT-PCR positive samples that can be used to identify patterns in the development of the SARS-CoV-2 pandemic. The successful application of supervised classification algorithms to detect these patterns demonstrates the potential of machine learning techniques to aid in understanding the spread of the virus and its variants.
严重急性呼吸综合征冠状病毒 2(SARS-CoV-2)是迄今为止现代社会面临的主要挑战之一。在过去的几个月里,已经收集了大量的信息,现在才开始被吸收。在目前的工作中,研究了在大流行期间进行的近 50 万次检测中,大量 rRT-PCR 检测呈阳性的检测中存在的残留信息。据信,这种残留信息与检测阳性样本所需的循环次数模式高度相关。因此,收集了超过 20,000 个阳性样本的数据库,并训练了两个监督分类算法(支持向量机和神经网络),仅根据每个个体的 rRT-PCR 中确定的循环次数来临时定位每个样本。总的来说,这项研究表明,rRT-PCR 阳性样本中存在有价值的残留信息,可以用来识别 SARS-CoV-2 大流行发展的模式。监督分类算法成功地应用于检测这些模式,证明了机器学习技术在帮助理解病毒及其变体传播方面的潜力。