Synthetic Biology and Biotechnology Unit, Department of Biology, University of Padua, 35131 Padua, Italy.
Viruses. 2024 Sep 6;16(9):1425. doi: 10.3390/v16091425.
Computer-aided analysis of proteins or nucleic acids seems like a matter of course nowadays; however, the history of Bioinformatics and Computational Biology is quite recent. The advent of high-throughput sequencing has led to the production of "big data", which has also affected the field of virology. The collaboration between the communities of bioinformaticians and virologists already started a few decades ago and it was strongly enhanced by the recent SARS-CoV-2 pandemics. In this article, which is the first in a series on how bioinformatics can enhance virus research, we show that highly useful information is retrievable from selected general and dedicated databases. Indeed, an enormous amount of information-both in terms of nucleotide/protein sequences and their annotation-is deposited in the general databases of international organisations participating in the International Nucleotide Sequence Database Collaboration (INSDC). However, more and more virus-specific databases have been established and are progressively enriched with the contents and features reported in this article. Since viruses are intracellular obligate parasites, a special focus is given to host-pathogen protein-protein interaction databases. Finally, we illustrate several phylogenetic and phylodynamic tools, combining information on algorithms and features with practical information on how to use them and case studies that validate their usefulness. Databases and tools for functional inference will be covered in the next article of this series: .
计算机辅助分析蛋白质或核酸在当今似乎是理所当然的事情;然而,生物信息学和计算生物学的历史相当短暂。高通量测序的出现导致了“大数据”的产生,这也影响了病毒学领域。生物信息学家和病毒学家社区之间的合作早在几十年前就已经开始,最近的 SARS-CoV-2 大流行更是加强了这种合作。在本系列文章的第一篇中,我们展示了如何从选定的通用和专用数据库中检索到非常有用的信息,这篇文章探讨了生物信息学如何增强病毒研究。事实上,无论是在核苷酸/蛋白质序列及其注释方面,都有大量的信息被存储在参与国际核苷酸序列数据库协作(INSDC)的国际组织的通用数据库中。然而,越来越多的病毒专用数据库已经建立起来,并逐步充实了本文所报道的内容和特点。由于病毒是细胞内专性寄生虫,因此特别关注宿主-病原体蛋白质-蛋白质相互作用数据库。最后,我们展示了几种系统发育和系统进化动力学工具,结合了算法和功能的信息,以及如何使用它们的实用信息和案例研究,验证了它们的有用性。本系列的下一篇文章将介绍用于功能推断的数据库和工具: 。