逐个碱基：全病毒基因组比对的单核苷酸水平分析

Base-By-Base: single nucleotide-level analysis of whole viral genome alignments.

作者信息

Brodie Ryan, Smith Alex J, Roper Rachel L, Tcherepanov Vasily, Upton Chris

机构信息

Biochemistry and Microbiology, University of Victoria, B,C, V8W 3P6 Canada.

出版信息

BMC Bioinformatics. 2004 Jul 14;5:96. doi: 10.1186/1471-2105-5-96.

DOI:10.1186/1471-2105-5-96

PMID:15253776

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC481056/

Abstract

BACKGROUND

With ever increasing numbers of closely related virus genomes being sequenced, it has become desirable to be able to compare two genomes at a level more detailed than gene content because two strains of an organism may share the same set of predicted genes but still differ in their pathogenicity profiles. For example, detailed comparison of multiple isolates of the smallpox virus genome (each approximately 200 kb, with 200 genes) is not feasible without new bioinformatics tools.

RESULTS

A software package, Base-By-Base, has been developed that provides visualization tools to enable researchers to 1) rapidly identify and correct alignment errors in large, multiple genome alignments; and 2) generate tabular and graphical output of differences between the genomes at the nucleotide level. Base-By-Base uses detailed annotation information about the aligned genomes and can list each predicted gene with nucleotide differences, display whether variations occur within promoter regions or coding regions and whether these changes result in amino acid substitutions. Base-By-Base can connect to our mySQL database (Virus Orthologous Clusters; VOCs) to retrieve detailed annotation information about the aligned genomes or use information from text files.

CONCLUSION

Base-By-Base enables users to quickly and easily compare large viral genomes; it highlights small differences that may be responsible for important phenotypic differences such as virulence. It is available via the Internet using Java Web Start and runs on Macintosh, PC and Linux operating systems with the Java 1.4 virtual machine.

摘要

背景

随着越来越多密切相关的病毒基因组被测序，人们希望能够在比基因内容更详细的层面上比较两个基因组，因为一种生物的两个菌株可能共享相同的一组预测基因，但它们的致病性特征仍可能不同。例如，在没有新的生物信息学工具的情况下，对天花病毒基因组的多个分离株（每个约200 kb，有200个基因）进行详细比较是不可行的。

结果

已经开发了一个名为“逐碱基”的软件包，它提供可视化工具，使研究人员能够：1）快速识别并纠正大型多基因组比对中的比对错误；2）生成核苷酸水平上基因组间差异的表格和图形输出。“逐碱基”使用有关比对基因组的详细注释信息，并可以列出每个有核苷酸差异的预测基因，显示变异是否发生在启动子区域或编码区域，以及这些变化是否导致氨基酸替换。“逐碱基”可以连接到我们的MySQL数据库（病毒直系同源簇；VOCs）以检索有关比对基因组的详细注释信息，或使用来自文本文件的信息。

结论

“逐碱基”使用户能够快速轻松地比较大型病毒基因组；它突出显示了可能导致重要表型差异（如毒力）的微小差异。它可通过Java Web Start在互联网上获取，并在配备Java 1.4虚拟机的Macintosh、PC和Linux操作系统上运行。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d9e7/481056/52e6bc21f060/1471-2105-5-96-1.jpg

相似文献

Base-By-Base: single nucleotide-level analysis of whole viral genome alignments.

BMC Bioinformatics. 2004 Jul 14;5:96. doi: 10.1186/1471-2105-5-96.

GATA: a graphic alignment tool for comparative sequence analysis.

BMC Bioinformatics. 2005 Jan 17;6:9. doi: 10.1186/1471-2105-6-9.

CGAT: a comparative genome analysis tool for visualizing alignments in the analysis of complex evolutionary changes between closely related genomes.

BMC Bioinformatics. 2006 Oct 24;7:472. doi: 10.1186/1471-2105-7-472.

SeqTools: visual tools for manual analysis of sequence alignments.

BMC Res Notes. 2016 Jan 22;9:39. doi: 10.1186/s13104-016-1847-3.

[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].

Yi Chuan Xue Bao. 2004 May;31(5):431-43.

Combo: a whole genome comparative browser.

Bioinformatics. 2006 Jul 15;22(14):1782-3. doi: 10.1093/bioinformatics/btl193. Epub 2006 May 18.

Bioinformatics for analysis of poxvirus genomes.

Methods Mol Biol. 2012;890:233-58. doi: 10.1007/978-1-61779-876-4_14.

EnteriX 2003: Visualization tools for genome alignments of Enterobacteriaceae.

Nucleic Acids Res. 2003 Jul 1;31(13):3527-32. doi: 10.1093/nar/gkg551.

New bioinformatics tools for viral genome analyses at Viral Bioinformatics--Canada.

Pharmacogenomics. 2005 Apr;6(3):271-80. doi: 10.1517/14622416.6.3.271.

PLATCOM: a Platform for Computational Comparative Genomics.

Bioinformatics. 2005 May 15;21(10):2514-6. doi: 10.1093/bioinformatics/bti350. Epub 2005 Feb 24.

引用本文的文献

The Involvement of Genes in the Long-Term Persistence of the African Swine Fever Virus in Gastropods.

Viruses. 2025 Jun 7;17(6):824. doi: 10.3390/v17060824.

Genomic analysis of two Chinese isolates of hyphantria cunea nucleopolyhedrovirus reveals a novel species of alphabaculovirus that infects hyphantria cunea drury (lepidoptera: arctiidae).

BMC Genomics. 2022 May 13;23(1):367. doi: 10.1186/s12864-022-08604-7.

Computational strategies to combat COVID-19: useful tools to accelerate SARS-CoV-2 and coronavirus research.

Brief Bioinform. 2021 Mar 22;22(2):642-663. doi: 10.1093/bib/bbaa232.

Base-By-Base Version 3: New Comparative Tools for Large Virus Genomes.

Viruses. 2018 Nov 15;10(11):637. doi: 10.3390/v10110637.

Complete genome sequence of a novel sea otterpox virus.

Virus Genes. 2018 Dec;54(6):756-767. doi: 10.1007/s11262-018-1594-8. Epub 2018 Sep 17.

Genomic Sequencing of High-Efficiency Transducing Streptococcal Bacteriophage A25: Consequences of Escape from Lysogeny.

J Bacteriol. 2018 Nov 6;200(23). doi: 10.1128/JB.00358-18. Print 2018 Dec 1.

Pathogenicity and Whole Genome Sequence Analysis of a Pseudorabies Virus Strain FJ-2012 Isolated from Fujian, Southern China.

Can J Infect Dis Med Microbiol. 2017;2017:9073172. doi: 10.1155/2017/9073172. Epub 2017 Dec 31.

Construction of an infectious horsepox virus vaccine from chemically synthesized DNA fragments.

PLoS One. 2018 Jan 19;13(1):e0188453. doi: 10.1371/journal.pone.0188453. eCollection 2018.

Insights into the ancestral organisation of the mammalian MHC class II region from the genome of the pteropid bat, Pteropus alecto.

BMC Genomics. 2017 May 18;18(1):388. doi: 10.1186/s12864-017-3760-0.

Genomic characterization of two novel pathogenic avipoxviruses isolated from pacific shearwaters (Ardenna spp.).

BMC Genomics. 2017 Apr 13;18(1):298. doi: 10.1186/s12864-017-3680-z.

本文引用的文献

Poxvirus Orthologous Clusters (POCs).

Bioinformatics. 2002 Nov;18(11):1544-5. doi: 10.1093/bioinformatics/18.11.1544.

Viral genome organizer: a system for analyzing complete viral genomes.

Virus Res. 2000 Sep;70(1-2):55-64. doi: 10.1016/s0168-1702(00)00210-0.

T-Coffee: A novel method for fast and accurate multiple sequence alignment.

J Mol Biol. 2000 Sep 8;302(1):205-17. doi: 10.1006/jmbi.2000.4042.

DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment.

Bioinformatics. 1999 Mar;15(3):211-8. doi: 10.1093/bioinformatics/15.3.211.

The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.

Nucleic Acids Res. 1997 Dec 15;25(24):4876-82. doi: 10.1093/nar/25.24.4876.

Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Nucleic Acids Res. 1997 Sep 1;25(17):3389-402. doi: 10.1093/nar/25.17.3389.

Methods for comparing a DNA sequence with a protein sequence.

Comput Appl Biosci. 1996 Dec;12(6):497-506. doi: 10.1093/bioinformatics/12.6.497.

CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice.

Nucleic Acids Res. 1994 Nov 11;22(22):4673-80. doi: 10.1093/nar/22.22.4673.

Structure of vaccinia virus late promoters.

J Mol Biol. 1989 Dec 20;210(4):771-84. doi: 10.1016/0022-2836(89)90108-3.

Structure of vaccinia virus early promoters.

J Mol Biol. 1989 Dec 20;210(4):749-69. doi: 10.1016/0022-2836(89)90107-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

逐个碱基：全病毒基因组比对的单核苷酸水平分析

Base-By-Base: single nucleotide-level analysis of whole viral genome alignments.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献