一种新颖的统计方法可预测新冠病毒基因组片段的可突变性。

A novel statistical method predicts mutability of the genomic segments of the SARS-CoV-2 virus.

作者信息

Darooneh Amir Hossein, Przedborski Michelle, Kohandel Mohammad

机构信息

Department of Applied Mathematics, University of Waterloo, Waterloo, ON, Canada.

出版信息

QRB Discov. 2021 Dec 13;3:e1. doi: 10.1017/qrd.2021.13. eCollection 2022.

DOI:10.1017/qrd.2021.13

PMID:35106478

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8795775/

Abstract

The SARS-CoV-2 virus has made the largest pandemic of the 21st century, with hundreds of millions of cases and tens of millions of fatalities. Scientists all around the world are racing to develop vaccines and new pharmaceuticals to overcome the pandemic and offer effective treatments for COVID-19 disease. Consequently, there is an essential need to better understand how the pathogenesis of SARS-CoV-2 is affected by viral mutations and to determine the conserved segments in the viral genome that can serve as stable targets for novel therapeutics. Here, we introduce a text-mining method to estimate the mutability of genomic segments directly from a reference (ancestral) whole genome sequence. The method relies on calculating the importance of genomic segments based on their spatial distribution and frequency over the whole genome. To validate our approach, we perform a large-scale analysis of the viral mutations in nearly 80,000 publicly available SARS-CoV-2 predecessor whole genome sequences and show that these results are highly correlated with the segments predicted by the statistical method used for keyword detection. Importantly, these correlations are found to hold at the codon and gene levels, as well as for gene coding regions. Using the text-mining method, we further identify codon sequences that are potential candidates for siRNA-based antiviral drugs. Significantly, one of the candidates identified in this work corresponds to the first seven codons of an epitope of the spike glycoprotein, which is the only SARS-CoV-2 immunogenic peptide without a match to a human protein.

摘要

严重急性呼吸综合征冠状病毒2（SARS-CoV-2）引发了21世纪规模最大的大流行，造成数亿人感染，数千万人死亡。世界各地的科学家都在竞相研发疫苗和新型药物，以战胜这场大流行并为新冠肺炎提供有效治疗。因此，迫切需要更好地了解SARS-CoV-2的发病机制如何受到病毒突变的影响，并确定病毒基因组中可作为新型治疗药物稳定靶点的保守片段。在此，我们介绍一种文本挖掘方法，可直接从参考（祖先）全基因组序列估计基因组片段的可变性。该方法基于计算基因组片段在整个基因组中的空间分布和频率来确定其重要性。为验证我们的方法，我们对近80000条公开可用的SARS-CoV-2前身全基因组序列中的病毒突变进行了大规模分析，结果表明这些结果与用于关键词检测的统计方法预测的片段高度相关。重要的是，这些相关性在密码子和基因水平以及基因编码区域均成立。利用文本挖掘方法，我们进一步确定了基于小干扰RNA（siRNA）的抗病毒药物的潜在候选密码子序列。值得注意的是，这项工作中确定的候选序列之一对应于刺突糖蛋白一个表位的前七个密码子，该表位是SARS-CoV-2唯一与人蛋白无匹配的免疫原性肽。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/06b1/10392689/c540e4c0f730/S2633289221000132_fig1.jpg

相似文献

A novel statistical method predicts mutability of the genomic segments of the SARS-CoV-2 virus.一种新颖的统计方法可预测新冠病毒基因组片段的可突变性。

QRB Discov. 2021 Dec 13;3:e1. doi: 10.1017/qrd.2021.13. eCollection 2022.

Current status of antivirals and druggable targets of SARS CoV-2 and other human pathogenic coronaviruses.SARS-CoV-2 及其他人类致病冠状病毒的抗病毒药物和可用药靶的现状。

Drug Resist Updat. 2020 Dec;53:100721. doi: 10.1016/j.drup.2020.100721. Epub 2020 Aug 26.

A comprehensive genomic study, mutation screening, phylogenetic and statistical analysis of SARS-CoV-2 and its variant omicron among different countries.一项针对 SARS-CoV-2 及其不同国家变异株奥密克戎的全基因组研究、突变筛查、系统进化和统计学分析。

J Infect Public Health. 2022 Aug;15(8):878-891. doi: 10.1016/j.jiph.2022.07.002. Epub 2022 Jul 8.

Genetic Surveillance of SARS-CoV-2 M Reveals High Sequence and Structural Conservation Prior to the Introduction of Protease Inhibitor Paxlovid.SARS-CoV-2 M 基因监测显示，在引入蛋白酶抑制剂帕克洛维德之前，其序列和结构高度保守。

mBio. 2022 Aug 30;13(4):e0086922. doi: 10.1128/mbio.00869-22. Epub 2022 Jul 13.

Artificial intelligence predicts the immunogenic landscape of SARS-CoV-2 leading to universal blueprints for vaccine designs.人工智能预测 SARS-CoV-2 的免疫原性景观，从而为疫苗设计提供通用蓝图。

Sci Rep. 2020 Dec 23;10(1):22375. doi: 10.1038/s41598-020-78758-5.

Quantitative Mutation Analysis of Genes and Proteins of Major SARS-CoV-2 Variants of Concern and Interest.主要关注和感兴趣的 SARS-CoV-2 变体的基因和蛋白质的定量突变分析。

Viruses. 2023 May 18;15(5):1193. doi: 10.3390/v15051193.

Deciphering the co-adaptation of codon usage between respiratory coronaviruses and their human host uncovers candidate therapeutics for COVID-19.解析呼吸道冠状病毒与其人类宿主之间的密码子使用协同进化关系揭示了针对 COVID-19 的候选治疗药物。

Infect Genet Evol. 2020 Nov;85:104471. doi: 10.1016/j.meegid.2020.104471. Epub 2020 Jul 22.

Origin, phylogeny, variability and epitope conservation of SARS-CoV-2 worldwide.新型冠状病毒（SARS-CoV-2）在全球范围内的起源、系统发育、变异性及表位保守性

Virus Res. 2021 Oct 15;304:198526. doi: 10.1016/j.virusres.2021.198526. Epub 2021 Jul 30.

Evolving geographic diversity in SARS-CoV2 and in silico analysis of replicating enzyme 3CL targeting repurposed drug candidates.SARS-CoV2 的地理多样性演变和针对复制酶 3CL 的计算机分析，以寻找可再利用药物的候选物。

J Transl Med. 2020 Jul 9;18(1):278. doi: 10.1186/s12967-020-02448-z.

Molecular Insights of SARS-CoV-2 Infection and Molecular Treatments.SARS-CoV-2 感染的分子机制与分子治疗。

Curr Mol Med. 2022;22(7):621-639. doi: 10.2174/1566524021666211013121831.

引用本文的文献

Paying attention to the SARS-CoV-2 dialect : a deep neural network approach to predicting novel protein mutations.关注新冠病毒变体：一种预测新型蛋白质突变的深度神经网络方法。

Commun Biol. 2025 Jan 21;8(1):98. doi: 10.1038/s42003-024-07262-7.

Modeling SARS-CoV-2 nucleotide mutations as a stochastic process.模拟 SARS-CoV-2 核苷酸突变作为一个随机过程。

PLoS One. 2023 Apr 28;18(4):e0284874. doi: 10.1371/journal.pone.0284874. eCollection 2023.

本文引用的文献

Prediction of Potential Commercially Available Inhibitors against SARS-CoV-2 by Multi-Task Deep Learning Model.基于多任务深度学习模型预测潜在的抗 SARS-CoV-2 商业抑制剂。

Biomolecules. 2022 Aug 21;12(8):1156. doi: 10.3390/biom12081156.

Biovacc-19: A Candidate Vaccine for Covid-19 (SARS-CoV-2) Developed from Analysis of its General Method of Action for Infectivity.Biovacc-19：一种基于对新冠病毒（SARS-CoV-2）感染性一般作用机制分析而研发的新冠疫苗候选物。

QRB Discov. 2020 Jun 2;1:e6. doi: 10.1017/qrd.2020.8. eCollection 2020.

The origin of SARS-CoV-2.严重急性呼吸综合征冠状病毒2的起源。

Lancet Infect Dis. 2020 Sep;20(9):1018-1019. doi: 10.1016/S1473-3099(20)30641-1.

Geographic and Genomic Distribution of SARS-CoV-2 Mutations.新型冠状病毒2变异株的地理和基因组分布

Front Microbiol. 2020 Jul 22;11:1800. doi: 10.3389/fmicb.2020.01800. eCollection 2020.

COVID-19 Coronavirus Vaccine Design Using Reverse Vaccinology and Machine Learning.利用反向疫苗学和机器学习设计 COVID-19 冠状病毒疫苗。

Front Immunol. 2020 Jul 3;11:1581. doi: 10.3389/fimmu.2020.01581. eCollection 2020.

Molecular Basis for ADP-Ribose Binding to the Mac1 Domain of SARS-CoV-2 nsp3.SARS-CoV-2 nsp3 的 Mac1 结构域与 ADP-核糖结合的分子基础。

Biochemistry. 2020 Jul 21;59(28):2608-2615. doi: 10.1021/acs.biochem.0c00309. Epub 2020 Jul 6.

Insights into SARS-CoV-2 genome, structure, evolution, pathogenesis and therapies: Structural genomics approach.对 SARS-CoV-2 基因组、结构、进化、发病机制和治疗方法的深入了解：结构基因组学方法。

Biochim Biophys Acta Mol Basis Dis. 2020 Oct 1;1866(10):165878. doi: 10.1016/j.bbadis.2020.165878. Epub 2020 Jun 13.

Immunology of COVID-19: Current State of the Science.COVID-19 的免疫学：科学现状。

Immunity. 2020 Jun 16;52(6):910-941. doi: 10.1016/j.immuni.2020.05.002. Epub 2020 May 6.

The Proteins of Severe Acute Respiratory Syndrome Coronavirus-2 (SARS CoV-2 or n-COV19), the Cause of COVID-19.严重急性呼吸综合征冠状病毒 2 型（SARS-CoV-2 或 n-COVID-19）的蛋白，引发 COVID-19。

Protein J. 2020 Jun;39(3):198-216. doi: 10.1007/s10930-020-09901-4.

Pathogenic priming likely contributes to serious and critical illness and mortality in COVID-19 via autoimmunity.致病性预激发可能通过自身免疫导致新冠病毒病的严重和危重症以及死亡。

J Transl Autoimmun. 2020 Apr 9;3:100051. doi: 10.1016/j.jtauto.2020.100051. eCollection 2020.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种新颖的统计方法可预测新冠病毒基因组片段的可突变性。

A novel statistical method predicts mutability of the genomic segments of the SARS-CoV-2 virus.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献