• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

欧几里得距离分析助力病毒基因组中的核苷酸偏斜分析。

Euclidean Distance Analysis Enables Nucleotide Skew Analysis in Viral Genomes.

作者信息

van Hemert Formijn, Jebbink Maarten, van der Ark Andries, Scholer Frits, Berkhout Ben

机构信息

Laboratory of Experimental Virology, Medical Microbiology, Amsterdam UMC, University of Amsterdam, Amsterdam, Netherlands.

Research Institute of Child Development and Education, University of Amsterdam, Amsterdam, Netherlands.

出版信息

Comput Math Methods Med. 2018 Oct 30;2018:6490647. doi: 10.1155/2018/6490647. eCollection 2018.

DOI:10.1155/2018/6490647
PMID:30510593
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6232797/
Abstract

Nucleotide skew analysis is a versatile method to study the nucleotide composition of RNA/DNA molecules, in particular to reveal characteristic sequence signatures. For instance, skew analysis of the nucleotide bias of several viral RNA genomes indicated that it is enriched in the unpaired, single-stranded genome regions, thus creating an even more striking virus-specific signature. The comparison of skew graphs for many virus isolates or families is difficult, time-consuming, and nonquantitative. Here, we present a procedure for a more simple identification of similarities and dissimilarities between nucleotide skew data of coronavirus, flavivirus, picornavirus, and HIV-1 RNA genomes. Window and step sizes were normalized to correct for differences in length of the viral genome. Cumulative skew data are converted into pairwise Euclidean distance matrices, which can be presented as neighbor-joining trees. We present skew value trees for the four virus families and show that closely related viruses are placed in small clusters. Importantly, the skew value trees are similar to the trees constructed by a "classical" model of evolutionary nucleotide substitution. Thus, we conclude that the simple calculation of Euclidean distances between nucleotide skew data allows an easy and quantitative comparison of characteristic sequence signatures of virus genomes. These results indicate that the Euclidean distance analysis of nucleotide skew data forms a nice addition to the virology toolbox.

摘要

核苷酸偏性分析是一种用于研究RNA/DNA分子核苷酸组成的通用方法,尤其有助于揭示特征性序列特征。例如,对几种病毒RNA基因组的核苷酸偏性进行偏性分析表明,其在未配对的单链基因组区域中富集,从而形成了更为显著的病毒特异性特征。对许多病毒分离株或病毒家族的偏性图进行比较既困难、耗时,又缺乏定量性。在此,我们提出了一种程序,可更简便地识别冠状病毒、黄病毒、小RNA病毒和HIV-1 RNA基因组的核苷酸偏性数据之间的异同。对窗口大小和步长进行归一化处理,以校正病毒基因组长度的差异。累积偏性数据被转换为成对的欧几里得距离矩阵,可将其呈现为邻接树。我们给出了这四个病毒家族的偏性值树,并表明亲缘关系相近的病毒被归为小簇。重要的是,偏性值树与通过“经典”进化核苷酸替换模型构建的树相似。因此,我们得出结论,对核苷酸偏性数据进行简单的欧几里得距离计算,就能轻松且定量地比较病毒基因组的特征性序列特征。这些结果表明,对核苷酸偏性数据进行欧几里得距离分析是病毒学工具箱中的一个很好补充。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/7f9e8614afdb/CMMM2018-6490647.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/20a170875d45/CMMM2018-6490647.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/bca573483ced/CMMM2018-6490647.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/b9872e085606/CMMM2018-6490647.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/0d84f951c969/CMMM2018-6490647.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/cf3366501ca8/CMMM2018-6490647.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/7f9e8614afdb/CMMM2018-6490647.006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/20a170875d45/CMMM2018-6490647.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/bca573483ced/CMMM2018-6490647.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/b9872e085606/CMMM2018-6490647.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/0d84f951c969/CMMM2018-6490647.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/cf3366501ca8/CMMM2018-6490647.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/81b1/6232797/7f9e8614afdb/CMMM2018-6490647.006.jpg

相似文献

1
Euclidean Distance Analysis Enables Nucleotide Skew Analysis in Viral Genomes.欧几里得距离分析助力病毒基因组中的核苷酸偏斜分析。
Comput Math Methods Med. 2018 Oct 30;2018:6490647. doi: 10.1155/2018/6490647. eCollection 2018.
2
On the nucleotide composition and structure of retroviral RNA genomes.关于逆转录病毒RNA基因组的核苷酸组成与结构
Virus Res. 2014 Nov 26;193:16-23. doi: 10.1016/j.virusres.2014.03.019. Epub 2014 Mar 24.
3
Impact of the biased nucleotide composition of viral RNA genomes on RNA structure and codon usage.病毒RNA基因组的偏向性核苷酸组成对RNA结构和密码子使用的影响
J Gen Virol. 2016 Oct;97(10):2608-2619. doi: 10.1099/jgv.0.000579. Epub 2016 Aug 11.
4
Genomic analysis of the Chinese genotype 1F rubella virus that disappeared after 2002 in China.中国 1F 基因型风疹病毒的基因组分析,该病毒于 2002 年后在中国消失。
J Med Virol. 2014 Dec;86(12):2114-21. doi: 10.1002/jmv.23936. Epub 2014 Jun 24.
5
A common RNA motif in the 3' end of the genomes of astroviruses, avian infectious bronchitis virus and an equine rhinovirus.星状病毒、禽传染性支气管炎病毒和一种马鼻病毒基因组3'端的一种常见RNA基序。
J Gen Virol. 1998 Apr;79 ( Pt 4):715-8. doi: 10.1099/0022-1317-79-4-715.
6
Detection and sequence characterization of the 3'-end of coronavirus genomes harboring the highly conserved RNA motif s2m.携带高度保守RNA基序s2m的冠状病毒基因组3'端的检测与序列特征分析
Methods Mol Biol. 2008;454:27-34. doi: 10.1007/978-1-59745-181-9_3.
7
Codon and amino acid usage in retroviral genomes is consistent with virus-specific nucleotide pressure.逆转录病毒基因组中的密码子和氨基酸使用情况与病毒特异性核苷酸压力一致。
AIDS Res Hum Retroviruses. 2002 Jan 20;18(2):133-41. doi: 10.1089/08892220252779674.
8
Comparative analysis of complete genome sequences of three avian coronaviruses reveals a novel group 3c coronavirus.三种禽冠状病毒全基因组序列的比较分析揭示了一种新型3c组冠状病毒。
J Virol. 2009 Jan;83(2):908-17. doi: 10.1128/JVI.01977-08. Epub 2008 Oct 29.
9
Circularizing picornavirus genomes to rapidly obtain terminal sequence.使微小核糖核酸病毒基因组环化以快速获得末端序列。
J Clin Virol. 2013 Sep;58(1):286-7. doi: 10.1016/j.jcv.2013.04.020. Epub 2013 May 21.
10
Sequencing and phylogenetic analysis identifies candidate members of a new picornavirus genus in terrestrial tortoise species.测序和系统发育分析确定了陆龟物种中一种新型小RNA病毒属的候选成员。
Arch Virol. 2015 Mar;160(3):811-6. doi: 10.1007/s00705-014-2292-z. Epub 2014 Nov 29.

引用本文的文献

1
Identification of technology frontiers of artificial intelligence-assisted pathology based on patent citation network.基于专利引文网络的人工智能辅助病理学技术前沿识别。
PLoS One. 2022 Aug 22;17(8):e0273355. doi: 10.1371/journal.pone.0273355. eCollection 2022.
2
Features of Action Potentials from Identified Thalamic Nuclei in Anesthetized Patients.麻醉患者中已识别丘脑核团动作电位的特征
Brain Sci. 2020 Dec 17;10(12):1002. doi: 10.3390/brainsci10121002.
3
HDNA methylation data-based molecular subtype classification related to the prognosis of patients with hepatocellular carcinoma.

本文引用的文献

1
Impact of the biased nucleotide composition of viral RNA genomes on RNA structure and codon usage.病毒RNA基因组的偏向性核苷酸组成对RNA结构和密码子使用的影响
J Gen Virol. 2016 Oct;97(10):2608-2619. doi: 10.1099/jgv.0.000579. Epub 2016 Aug 11.
2
Nucleotide composition of the Zika virus RNA genome and its codon usage.寨卡病毒RNA基因组的核苷酸组成及其密码子使用情况。
Virol J. 2016 Jun 8;13:95. doi: 10.1186/s12985-016-0551-1.
3
MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets.MEGA7:适用于更大数据集的分子进化遗传学分析版本7.0
基于 HDNA 甲基化数据的分子亚型分类与肝癌患者预后相关。
BMC Med Genomics. 2020 Aug 24;13(1):118. doi: 10.1186/s12920-020-00770-5.
4
Computer-Assisted and Data Driven Approaches for Surveillance, Drug Discovery, and Vaccine Design for the Zika Virus.用于寨卡病毒监测、药物研发和疫苗设计的计算机辅助及数据驱动方法
Pharmaceuticals (Basel). 2019 Oct 16;12(4):157. doi: 10.3390/ph12040157.
Mol Biol Evol. 2016 Jul;33(7):1870-4. doi: 10.1093/molbev/msw054. Epub 2016 Mar 22.
4
On the biased nucleotide composition of the human coronavirus RNA genome.关于人类冠状病毒RNA基因组的偏向性核苷酸组成
Virus Res. 2015 Apr 16;202:41-7. doi: 10.1016/j.virusres.2014.11.031. Epub 2015 Feb 2.
5
Sequence analysis of origins of replication in the Saccharomyces cerevisiae genomes.酿酒酵母基因组中复制起点的序列分析。
Front Microbiol. 2014 Nov 18;5:574. doi: 10.3389/fmicb.2014.00574. eCollection 2014.
6
MAFFT: iterative refinement and additional methods.MAFFT:迭代优化及其他方法。
Methods Mol Biol. 2014;1079:131-46. doi: 10.1007/978-1-62703-646-7_8.
7
The A-nucleotide preference of HIV-1 in the context of its structured RNA genome.HIV-1 在其结构 RNA 基因组背景下的 A-核苷酸偏好性。
RNA Biol. 2013 Feb;10(2):211-5. doi: 10.4161/rna.22896. Epub 2012 Dec 12.
8
ViralZone: recent updates to the virus knowledge resource.病毒地带:病毒知识库的最新更新。
Nucleic Acids Res. 2013 Jan;41(Database issue):D579-83. doi: 10.1093/nar/gks1220. Epub 2012 Nov 28.
9
The biased nucleotide composition of the HIV genome: a constant factor in a highly variable virus.HIV 基因组的偏性核苷酸组成:高度可变病毒中的一个恒定因素。
Retrovirology. 2012 Nov 6;9:92. doi: 10.1186/1742-4690-9-92.
10
The GB viruses: a review and proposed classification of GBV-A, GBV-C (HGV), and GBV-D in genus Pegivirus within the family Flaviviridae.GB 病毒:黄病毒科中的杯状病毒属内的 GBV-A、GBV-C(HGV)和 GBV-D 的综述和分类建议。
J Gen Virol. 2011 Feb;92(Pt 2):233-46. doi: 10.1099/vir.0.027490-0. Epub 2010 Nov 17.