Xi Binbin, Chen Zixi, Li Shuhua, Liu Wei, Jiang Dawei, Bai Yunmeng, Qu Yimo, Lon Jerome Rumdon, Huang Lizhen, Du Hongli
School of Biology and Biological Engineering, South China University of Technology, Guangzhou 510006, China.
Comput Struct Biotechnol J. 2021;19:5029-5038. doi: 10.1016/j.csbj.2021.09.002. Epub 2021 Sep 4.
In our previous work, we developed an automated tool, AutoVEM, for real-time monitoring the candidate key mutations and epidemic trends of SARS-CoV-2. In this research, we further developed AutoVEM into AutoVEM2. AutoVEM2 is composed of three modules, including call module, analysis module, and plot module, which can be used modularly or as a whole for any virus, as long as the corresponding reference genome is provided. Therefore, it's much more flexible than AutoVEM. Here, we analyzed three existing viruses by AutoVEM2, including SARS-CoV-2, HBV and HPV-16, to show the functions, effectiveness and flexibility of AutoVEM2. We found that the N501Y locus was almost completely linked to the other 16 loci in SARS-CoV-2 genomes from the UK and Europe. Among the 17 loci, 5 loci were on the S protein and all of the five mutations cause amino acid changes, which may influence the epidemic traits of SARS-CoV-2. And some candidate key mutations of HBV and HPV-16, including T350G of HPV-16 and C659T of HBV, were detected. In brief, we developed a flexible automated tool to analyze candidate key mutations and epidemic trends for any virus, which would become a standard process for virus analysis based on genome sequences in the future.
在我们之前的工作中,我们开发了一种自动化工具AutoVEM,用于实时监测新型冠状病毒(SARS-CoV-2)的候选关键突变和流行趋势。在本研究中,我们将AutoVEM进一步开发为AutoVEM2。AutoVEM2由三个模块组成,包括调用模块、分析模块和绘图模块,只要提供相应的参考基因组,就可以模块化或整体用于任何病毒。因此,它比AutoVEM更加灵活。在这里,我们使用AutoVEM2分析了三种现有的病毒,包括新型冠状病毒、乙肝病毒(HBV)和人乳头瘤病毒16型(HPV-16),以展示AutoVEM2的功能、有效性和灵活性。我们发现,在来自英国和欧洲的新型冠状病毒基因组中,N501Y位点几乎与其他16个位点完全连锁。在这17个位点中,有5个位点位于S蛋白上,所有这五个突变都会导致氨基酸变化,这可能会影响新型冠状病毒的流行特征。并且检测到了乙肝病毒和人乳头瘤病毒16型的一些候选关键突变,包括人乳头瘤病毒16型的T350G和乙肝病毒的C659T。简而言之,我们开发了一种灵活的自动化工具,用于分析任何病毒的候选关键突变和流行趋势,这将在未来成为基于基因组序列的病毒分析的标准流程。