Ogata Yoshiyuki, Kitayama Ruri
Graduate School of Agriculture, Osaka Metropolitan University, Sakai, Osaka, 599-8531, Japan.
BMC Genom Data. 2022 May 4;23(1):34. doi: 10.1186/s12863-022-01052-y.
Over a million genomes and mutational analyses of SARS-CoV-2 are available in public databases, which reveal the phylogenetic tree of the virus. Although these data have enabled scientists to closely track the evolution and transmission dynamics of the virus at global and local scales, the Mu variant, recently identified in infections in South America, shows an unusual combination of mutations, and it is difficult to visualize these atypical characteristics in public databases based on a phylogenetic tree.
The Vcorn SARS-CoV-2 database was constructed to provide information on COVID-19 infections and mutations in the S protein of the virus based on correlation network analysis. A correlation network was constructed using the recall index of one mutation to another mutation. The network includes several network modules in which nodes represent mutations and are tightly connected to each other. Individual network modules contain mutations of single variants, such as the alpha and delta variants. In the network constructed to emphasize mutations of the Mu variant using the database, the mutations were found to be located in multiple network modules, indicating that the mutations of the variant may have originated from multiple variants or be located at a basal position with a high frequency of mutation.
Vcorn SARS-CoV-2 provides information on COVID-19 and S protein mutations of SARS-CoV-2 via correlation network analysis. The network based on the analysis illustrates the unusual S protein mutations of the Mu variant. The database is freely available at http://www.plant.osakafu-u.ac.jp/~kagiana/vcorn/sarscov2/ .
公共数据库中存有超过一百万个严重急性呼吸综合征冠状病毒2(SARS-CoV-2)的基因组和突变分析数据,这些数据揭示了该病毒的系统发育树。尽管这些数据使科学家能够在全球和地方尺度上密切追踪病毒的进化和传播动态,但最近在南美洲感染病例中发现的Mu变体显示出异常的突变组合,并且基于系统发育树在公共数据库中难以直观呈现这些非典型特征。
构建了Vcorn SARS-CoV-2数据库,以基于相关网络分析提供有关新冠病毒感染及该病毒刺突蛋白(S蛋白)突变的信息。利用一个突变与另一个突变的召回指数构建了相关网络。该网络包括几个网络模块,其中节点代表突变且彼此紧密相连。各个网络模块包含单个变体的突变,如α和δ变体。在利用该数据库构建的以强调Mu变体突变为重点的网络中,发现这些突变位于多个网络模块中,这表明该变体的突变可能源自多个变体,或者位于具有高突变频率的基础位置。
Vcorn SARS-CoV-2通过相关网络分析提供有关新冠病毒及SARS-CoV-2的S蛋白突变的信息。基于该分析的网络展示了Mu变体异常的S蛋白突变。该数据库可在http://www.plant.osakafu-u.ac.jp/~kagiana/vcorn/sarscov2/免费获取。