Nagahama Institute of Bio-Science and Technology, Nagahama, Shiga-ken, Japan.
Faculty of Engineering, Niigata University, Niigata-ken, Japan.
PLoS One. 2022 Aug 31;17(8):e0273860. doi: 10.1371/journal.pone.0273860. eCollection 2022.
Among mutations that occur in SARS-CoV-2, efficient identification of mutations advantageous for viral replication and transmission is important to characterize and defeat this rampant virus. Mutations rapidly expanding frequency in a viral population are candidates for advantageous mutations, but neutral mutations hitchhiking with advantageous mutations are also likely to be included. To distinguish these, we focus on mutations that appear to occur independently in different lineages and expand in frequency in a convergent evolutionary manner. Batch-learning SOM (BLSOM) can separate SARS-CoV-2 genome sequences according by lineage from only providing the oligonucleotide composition. Focusing on remarkably expanding 20-mers, each of which is only represented by one copy in the viral genome, allows us to correlate the expanding 20-mers to mutations. Using visualization functions in BLSOM, we can efficiently identify mutations that have expanded remarkably both in the Omicron lineage, which is phylogenetically distinct from other lineages, and in other lineages. Most of these mutations involved changes in amino acids, but there were a few that did not, such as an intergenic mutation.
在 SARS-CoV-2 发生的突变中,有效识别有利于病毒复制和传播的突变对于描述和击败这种猖獗的病毒非常重要。在病毒群体中快速扩大频率的突变是有利突变的候选者,但有利突变的“搭便车”中性突变也很可能被包括在内。为了区分这些突变,我们专注于那些似乎在不同谱系中独立出现并以趋同进化的方式扩大频率的突变。批量学习 SOM(BLSOM)可以根据谱系将 SARS-CoV-2 基因组序列分离,而只需提供寡核苷酸组成。关注显著扩张的 20 个核苷酸,每个核苷酸在病毒基因组中仅代表一个副本,使我们能够将扩张的 20 个核苷酸与突变联系起来。使用 BLSOM 中的可视化功能,我们可以有效地识别出在奥密克戎谱系中显著扩张的突变,该谱系在系统发育上与其他谱系不同,以及在其他谱系中显著扩张的突变。这些突变大多数涉及氨基酸的变化,但也有一些没有,例如基因间突变。