Institute of Molecular Biology and Pathology, CNR c/o Dep. Chemistry, Sapienza University of Rome, Rome, Italy.
Institute of Genetics and Biophysics, CNR, Naples, Italy; Laboratory of Stem Cell Biology and Pharmacology of Neurodegenerative Diseases, Department of Biosciences, University of Milan, Milan, Italy; University of Naples Federico II, Naples, Italy.
Int J Biol Macromol. 2024 May;266(Pt 1):131054. doi: 10.1016/j.ijbiomac.2024.131054. Epub 2024 Mar 24.
The surveillance of COVID-19 pandemic has led to the determination of millions of genome sequences of the SARS-CoV-2 virus, with the accumulation of a wealth of information never collected before for an infectious disease. Exploring the information retrieved from the GISAID database reporting at that time >13 million genome sequences, we classified the 141,639 unique missense mutations detected in the first two-and-a-half years (up to October 2022) of the pandemic. Notably, our analysis indicates that 98.2 % of all possible conservative amino acid replacements occurred. Even non-conservative mutations were highly represented (73.9 %). For a significant number of residues (3 %), all possible replacements with the other nineteen amino acids have been observed. These observations strongly indicate that, in this time interval, the virus explored all possible alternatives in terms of missense mutations for all sites of its polypeptide chain and that those that are not observed severely affect SARS-CoV-2 integrity. The implications of the present findings go well beyond the structural biology of SARS-CoV-2 as the huge amount of information here collected and classified may be valuable for the elucidation of the sequence-structure-function relationships in proteins.
对 COVID-19 大流行的监测导致了数以百万计的 SARS-CoV-2 病毒基因组序列的确定,积累了以前从未为传染病收集过的大量信息。通过探索当时从 GISAID 数据库报告中检索到的信息,我们对大流行头两年半(截至 2022 年 10 月)检测到的 141639 个独特的错义突变进行了分类。值得注意的是,我们的分析表明,所有可能的保守氨基酸替换中有 98.2%发生了。即使是非保守突变也有很高的代表性(73.9%)。对于大量残基(3%),已经观察到所有可能的用其他十九种氨基酸进行替换。这些观察结果强烈表明,在这段时间内,病毒在其多肽链的所有位点上探索了所有可能的错义突变替代方案,而那些未被观察到的严重影响了 SARS-CoV-2 的完整性。目前这些发现的意义远远超出了 SARS-CoV-2 的结构生物学,因为这里收集和分类的大量信息可能对阐明蛋白质中的序列-结构-功能关系具有重要价值。