• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生命的边缘:病毒RNA中核苷酸序列的分布

On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs.

作者信息

Husev Mykola, Rovenchak Andrij

机构信息

Department for Theoretical Physics, Ivan Franko National University of Lviv, 12 Drahomanov St, UA-79005 Lviv, Ukraine.

出版信息

Biosemiotics. 2021;14(2):253-269. doi: 10.1007/s12304-021-09403-5. Epub 2021 Feb 17.

DOI:10.1007/s12304-021-09403-5
PMID:33613787
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7887720/
Abstract

The aim of the study is to analyze viruses using parameters obtained from distributions of nucleotide sequences in the viral RNA. Seeking for the input data homogeneity, we analyze single-stranded RNA viruses only. Two approaches are used to obtain the nucleotide sequences; In the first one, chunks of equal length (four nucleotides) are considered. In the second approach, the whole RNA genome is divided into parts by adenine or the most frequent nucleotide as a "space". Rank-frequency distributions are studied in both cases. The defined nucleotide sequences are signs comparable to a certain extent to syllables or words as seen from the nature of their rank-frequency distributions. Within the first approach, the Pólya and the negative hypergeometric distribution yield the best fit. For the distributions obtained within the second approach, we have calculated a set of parameters, including entropy, mean sequence length, and its dispersion. The calculated parameters became the basis for the classification of viruses. We observed that proximity of viruses on planes spanned on various pairs of parameters corresponds to related species. In certain cases, such a proximity is observed for unrelated species as well calling thus for the expansion of the set of parameters used in the classification. We also observed that the fifth most frequent nucleotide sequences obtained within the second approach are of different nature in case of human coronaviruses (different nucleotides for MERS, SARS-CoV, and SARS-CoV-2 versus identical nucleotides for four other coronaviruses). We expect that our findings will be useful as a supplementary tool in the classification of diseases caused by RNA viruses with respect to severity and contagiousness.

摘要

本研究的目的是利用从病毒RNA核苷酸序列分布中获得的参数来分析病毒。为了寻求输入数据的同质性,我们仅分析单链RNA病毒。使用两种方法来获取核苷酸序列;在第一种方法中,考虑等长的片段(四个核苷酸)。在第二种方法中,整个RNA基因组以腺嘌呤或最常见的核苷酸作为“间隔”被分成多个部分。在这两种情况下都研究了秩频分布。从其秩频分布的性质来看,所定义的核苷酸序列在一定程度上类似于音节或单词。在第一种方法中,波利亚分布和负超几何分布拟合效果最佳。对于在第二种方法中获得的分布,我们计算了一组参数,包括熵、平均序列长度及其离散度。计算出的参数成为病毒分类的基础。我们观察到,在由各种参数对构成的平面上病毒的接近程度对应于相关物种。在某些情况下,不相关物种之间也观察到这种接近程度,因此需要扩展分类中使用的参数集。我们还观察到,在第二种方法中获得的第五个最常见的核苷酸序列在人类冠状病毒的情况下具有不同的性质(中东呼吸综合征冠状病毒、严重急性呼吸综合征冠状病毒和严重急性呼吸综合征冠状病毒2的核苷酸不同,而其他四种冠状病毒的核苷酸相同)。我们期望我们的发现将作为一种补充工具,用于根据严重程度和传染性对由RNA病毒引起的疾病进行分类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/118d704b0e39/12304_2021_9403_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/463c30199c49/12304_2021_9403_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/0335e9576268/12304_2021_9403_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/cf68893ec7f7/12304_2021_9403_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/8299a975d69a/12304_2021_9403_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/0929e69a0d11/12304_2021_9403_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/ed0d57ff2781/12304_2021_9403_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/118d704b0e39/12304_2021_9403_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/463c30199c49/12304_2021_9403_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/0335e9576268/12304_2021_9403_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/cf68893ec7f7/12304_2021_9403_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/8299a975d69a/12304_2021_9403_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/0929e69a0d11/12304_2021_9403_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/ed0d57ff2781/12304_2021_9403_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/34c2/7887720/118d704b0e39/12304_2021_9403_Fig7_HTML.jpg

相似文献

1
On the Verge of Life: Distribution of Nucleotide Sequences in Viral RNAs.生命的边缘:病毒RNA中核苷酸序列的分布
Biosemiotics. 2021;14(2):253-269. doi: 10.1007/s12304-021-09403-5. Epub 2021 Feb 17.
2
[Analysis of Nucleotide Changes in RT-PCR Primer/Probe Binding Regions in SARS-CoV-2 Isolates Reported from Turkey].[土耳其报告的严重急性呼吸综合征冠状病毒2(SARS-CoV-2)分离株逆转录聚合酶链反应(RT-PCR)引物/探针结合区域的核苷酸变化分析]
Mikrobiyol Bul. 2021 Jul;55(3):311-326. doi: 10.5578/mb.20219803.
3
Cluster Analysis of Coronavirus Sequences using Computational Sequence Descriptors: With Applications to SARS, MERS and SARS-CoV-2 (CoVID-19).冠状病毒序列的计算序列描述聚类分析:在 SARS、MERS 和 SARS-CoV-2(CoVID-19)中的应用。
Curr Comput Aided Drug Des. 2021;17(7):936-945. doi: 10.2174/1573409917666210202092646.
4
Evaluation on the use of Nanopore sequencing for direct characterization of coronaviruses from respiratory specimens, and a study on emerging missense mutations in partial RdRP gene of SARS-CoV-2.评估 Nanopore 测序在直接检测呼吸道样本中的冠状病毒中的应用,以及对 SARS-CoV-2 部分 RdRP 基因中新兴错义突变的研究。
Virol J. 2020 Nov 23;17(1):183. doi: 10.1186/s12985-020-01454-3.
5
Properties of Coronavirus and SARS-CoV-2.冠状病毒及新型冠状病毒(SARS-CoV-2)的特性
Malays J Pathol. 2020 Apr;42(1):3-11.
6
Detection and full genome characterization of two beta CoV viruses related to Middle East respiratory syndrome from bats in Italy.在意大利的蝙蝠中检测到与中东呼吸综合征相关的两种β-CoV 病毒,并对其全基因组进行了特征分析。
Virol J. 2017 Dec 19;14(1):239. doi: 10.1186/s12985-017-0907-1.
7
Detection of coronaviruses in & species of bats from different States of India.在来自印度不同邦的蝙蝠中检测到冠状病毒。
Indian J Med Res. 2020;151(2 & 3):226-235. doi: 10.4103/ijmr.IJMR_795_20.
8
The antiviral compound remdesivir potently inhibits RNA-dependent RNA polymerase from Middle East respiratory syndrome coronavirus.抗病毒化合物瑞德西韦能有效抑制中东呼吸综合征冠状病毒的 RNA 依赖性 RNA 聚合酶。
J Biol Chem. 2020 Apr 10;295(15):4773-4779. doi: 10.1074/jbc.AC120.013056. Epub 2020 Feb 24.
9
Genomic characterization of a newly discovered coronavirus associated with acute respiratory distress syndrome in humans.人类急性呼吸窘迫综合征相关新发现冠状病毒的基因组特征。
mBio. 2012 Nov 20;3(6):e00473-12. doi: 10.1128/mBio.00473-12.
10
[Molecular diagnosis and phylogenetic analysis of the first MERS case in Turkey].[土耳其首例中东呼吸综合征病例的分子诊断与系统发育分析]
Mikrobiyol Bul. 2015 Jul;49(3):414-22. doi: 10.5578/mb.9247.

本文引用的文献

1
Predicting enhancer-promoter interaction from genomic sequence with deep neural networks.利用深度神经网络从基因组序列预测增强子-启动子相互作用。
Quant Biol. 2019 Jun;7(2):122-137. doi: 10.1007/s40484-019-0154-0.
2
Identifying viruses from metagenomic data using deep learning.利用深度学习从宏基因组数据中识别病毒。
Quant Biol. 2020 Mar;8(1):64-77. doi: 10.1007/s40484-019-0187-4.
3
Network motif-based analysis of regulatory patterns in paralogous gene pairs.基于网络基序的旁系同源基因对调控模式分析。
J Bioinform Comput Biol. 2020 Jun;18(3):2040008. doi: 10.1142/S0219720020400089.
4
Understanding and Interpretation of Case Fatality Rate of Coronavirus Disease 2019.2019年冠状病毒病病死率的理解与解读
J Korean Med Sci. 2020 Mar 30;35(12):e137. doi: 10.3346/jkms.2020.35.e137.
5
MoRFPred_en: Sequence-based prediction of MoRFs using an ensemble learning strategy.MoRFPred_en:使用集成学习策略基于序列预测莫尔费(MoRFs)。
J Bioinform Comput Biol. 2019 Dec;17(6):1940015. doi: 10.1142/S0219720019400158.
6
A new coronavirus associated with human respiratory disease in China.一种在中国与人类呼吸道疾病相关的新型冠状病毒。
Nature. 2020 Mar;579(7798):265-269. doi: 10.1038/s41586-020-2008-3. Epub 2020 Feb 3.
7
Many human RNA viruses show extraordinarily stringent selective constraints on protein evolution.许多人类 RNA 病毒在蛋白质进化上表现出极高的严格选择约束。
Proc Natl Acad Sci U S A. 2019 Sep 17;116(38):19009-19018. doi: 10.1073/pnas.1907626116. Epub 2019 Sep 4.
8
Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980-2017: a systematic analysis for the Global Burden of Disease Study 2017.全球、区域和国家按年龄、性别和死因分类的死亡率,195 个国家和地区,1980-2017 年:2017 年全球疾病负担研究的系统分析。
Lancet. 2018 Nov 10;392(10159):1736-1788. doi: 10.1016/S0140-6736(18)32203-7. Epub 2018 Nov 8.
9
A planarian nidovirus expands the limits of RNA genome size.涡虫虹彩病毒拓展了 RNA 基因组大小的极限。
PLoS Pathog. 2018 Nov 1;14(11):e1007314. doi: 10.1371/journal.ppat.1007314. eCollection 2018 Nov.
10
Clustering DNA sequences using the out-of-place measure with reduced n-grams.使用减少的n元语法的异位度量对DNA序列进行聚类。
J Theor Biol. 2016 Oct 7;406:61-72. doi: 10.1016/j.jtbi.2016.06.029. Epub 2016 Jun 29.