Rahman Md Marufur, Kader Shirmin Bintay, Rizvi S M Shahriar
Centre for Medical Biotechnology, Management Information System, Directorate General of Health Services, Mohakhali, Dhaka, 1212, Bangladesh.
International Centre for Diarrhoeal Disease Research, Bangladesh.
Heliyon. 2021 Aug;7(8):e07866. doi: 10.1016/j.heliyon.2021.e07866. Epub 2021 Aug 21.
In a try to understand the pathogenesis, evolution and epidemiology of the SARS-CoV-2 virus, scientists from all over the world are tracking its genomic changes in real-time. Genomic studies can be helpful in understanding the disease dynamics. We have downloaded 324 complete and near complete SARS-CoV-2 genomes submitted in GISAID database from Bangladesh which were isolated between 30 March to 7 September, 2020. We then compared these genomes with Wuhan reference sequence and found 4160 mutation events including 2253 missense single nucleotide variations, 38 deletions and 10 insertions. The C>T nucleotide change was most prevalent (41% of all mutations) possibly due to selective mutation pressure to reduce CpG sites to evade CpG targeted host immune response. The most frequent mutation that occurred in 98% isolates was 3037C>T which is a synonymous change that usually accompanied 3 other mutations that include 241C>T, 14408C>T (P323L in RdRp) and 23403A>G (D614G in spike protein). The P323L was reported to increase mutation rate and D614G is associated with increased viral replication and currently most prevalent variant circulating all over the world. We identified multiple missense mutations in B-cell and T-cell predicted epitope regions and/or PCR target regions (including R203K and G204R that occurred in 86% of the isolates) that may impact immunogenicity and/or RT-PCR based diagnosis. Our analysis revealed 5 large deletion events in ORF7a and ORF8 gene products that may be associated with less severity of the disease and increased viral clearance. Our phylogeny analysis identified most of the isolates belonged to the Nextstrain clade 20B (86%) and GISAID clade GR (88%). Most of our isolates shared common ancestors either directly with European countries or jointly with middle eastern countries as well as Australia and India. Interestingly, the 19B clade (GISAID S clade) was unique to Chittagong, which was originally prevalent in China. This reveals possible multiple introductions of the virus in Bangladesh via different routes. Hence, more genome sequencing and analysis with related clinical data is needed to interpret functional significance and better predict the disease dynamics that may be helpful for policy makers to control the COVID-19 pandemic.
为了了解严重急性呼吸综合征冠状病毒2(SARS-CoV-2)病毒的发病机制、演变和流行病学,世界各地的科学家正在实时追踪其基因组变化。基因组研究有助于理解疾病动态。我们从GISAID数据库下载了2020年3月30日至9月7日期间在孟加拉国分离的324个完整和近乎完整的SARS-CoV-2基因组。然后我们将这些基因组与武汉参考序列进行比较,发现了4160个突变事件,包括2253个错义单核苷酸变异、38个缺失和10个插入。C>T核苷酸变化最为普遍(占所有突变的41%),这可能是由于选择性突变压力导致CpG位点减少,以逃避针对CpG的宿主免疫反应。在98%的分离株中出现的最常见突变是3037C>T,这是一个同义变化,通常伴随着其他3个突变,包括241C>T、14408C>T(RdRp中的P323L)和23403A>G(刺突蛋白中的D614G)。据报道,P323L会增加突变率,D614G与病毒复制增加有关,是目前在全球传播的最普遍变异株。我们在B细胞和T细胞预测表位区域和/或PCR靶区域(包括在86%的分离株中出现的R203K和G204R)中鉴定出多个错义突变,这些突变可能会影响免疫原性和/或基于RT-PCR的诊断。我们的分析揭示了开放阅读框7a和开放阅读框8基因产物中的5个大缺失事件,这些事件可能与疾病严重程度降低和病毒清除增加有关。我们的系统发育分析确定,大多数分离株属于Nextstrain进化枝20B(86%)和GISAID进化枝GR(88%)。我们的大多数分离株要么直接与欧洲国家有共同祖先,要么与中东国家以及澳大利亚和印度有共同祖先。有趣的是,19B进化枝(GISAID S进化枝)是吉大港特有的,最初在中国流行。这揭示了该病毒可能通过不同途径多次传入孟加拉国。因此,需要更多的基因组测序以及与相关临床数据的分析,以解释功能意义并更好地预测疾病动态,这可能有助于政策制定者控制新冠疫情。