Mohamad Noordin Noorliza, Tan Joon Liang, Chong Chee Kheong, Chem Yu Kie, Tajudin Norazimah, Abu Bakar Rehan Shuhada, Sengol Selvanesan, Phoon Hannah Yik Phing, Che Azid Nurul Aina Murni, W Mohd Arifin W Nur Afiza, Aziz Zirwatul Adilah, Hussin Hani, Ibrahim Nurul Syahida, Omar Aziyati, Ravi Ushananthiny, Kamarul Zaman Kamal Hisham, Yamin Mohd Asri, Ngeow Yun Fong
National Public Health Laboratory, Ministry of Health Malaysia, Sungai Buloh, Selangor, Malaysia.
Faculty of Information Science and Technology, Multimedia University, Melaka, Melaka, Malaysia.
PeerJ. 2021 Nov 3;9:e12449. doi: 10.7717/peerj.12449. eCollection 2021.
More than a year after its first appearance in December 2019, the COVID-19 pandemic is still on a rampage in many parts of the world. Although several vaccines have been approved for emergency use, the emergence and rapid spread of new SARS-CoV-2 variants have sparked fears of vaccine failure due to immune evasion. Massive viral genome sequencing has been recommended to track the genetic changes that could lead to adverse consequences.
We sequenced SARS-CoV-2 respiratory isolates from the National Public Health Laboratory, Malaysia and examined them together with viral genomes deposited in GISAID by other Malaysian researchers, to understand the evolutionary trend of the virus circulating in the country. We studied the distribution of virus lineages and site-wise mutations, analysed genetic clustering with the goeBURST full Minimum Spanning Tree algorithm, examined the trend of viral nucleotide diversity over time and performed nucleotide substitution association analyses.
We identified 22 sub-lineages, 13 clonal complexes, 178 sequence types and seven sites of linkage disequilibrium in 277 SARS-CoV-2 genomes sequenced between January and December 2020. B.1.524 was the largest lineage group. The number of mutations per genome ranged from 0 to 19. The mean genomic diversity value over 12 months was 3.26 × 10. Of 359 mutations detected, 60.5% of which were non-synonymous, the most frequent were in the (P4715L), (D614G and A701V) and (S194L) genes.
The SARS-CoV-2 virus accumulated an abundance of mutations in the first year of the COVID-19 pandemic in Malaysia. Its overall genetic diversity, however, is relatively low compared to other Asian countries with larger populations. Continuous genomic and epidemiological surveillance will help to clarify the evolutionary processes determining viral diversity and impacting on human health.
自2019年12月首次出现一年多后,新型冠状病毒肺炎(COVID-19)大流行仍在世界许多地区肆虐。尽管几种疫苗已获批紧急使用,但新型严重急性呼吸综合征冠状病毒2(SARS-CoV-2)变体的出现和快速传播引发了对因免疫逃逸导致疫苗失效的担忧。已建议进行大规模病毒基因组测序,以追踪可能导致不良后果的基因变化。
我们对马来西亚国家公共卫生实验室的SARS-CoV-2呼吸道分离株进行了测序,并将其与其他马来西亚研究人员存入全球共享流感数据倡议组织(GISAID)的病毒基因组一起进行分析,以了解该国流行的病毒的进化趋势。我们研究了病毒谱系和位点特异性突变的分布,使用goeBURST完全最小生成树算法分析了基因聚类,研究了病毒核苷酸多样性随时间的变化趋势,并进行了核苷酸替代关联分析。
在2020年1月至12月测序的277个SARS-CoV-2基因组中,我们鉴定出22个亚谱系、13个克隆复合体、178个序列类型和7个连锁不平衡位点。B.1.524是最大的谱系组。每个基因组的突变数范围为0至19。12个月内的平均基因组多样性值为3.26×10。在检测到的359个突变中,60.5%为非同义突变,最常见的突变位于(P4715L)、(D614G和A701V)和(S194L)基因。
在马来西亚COVID-19大流行的第一年,SARS-CoV-2病毒积累了大量突变。然而,与其他人口较多的亚洲国家相比,其总体遗传多样性相对较低。持续的基因组和流行病学监测将有助于阐明决定病毒多样性并影响人类健康的进化过程。