Hu Gang, Kurgan Lukasz
School of Mathematical Sciences and LPMC, Nankai University, Tianjin, China.
Department of Computer Science, Virginia Commonwealth University, Richmond, Virginia.
Curr Protoc Protein Sci. 2019 Feb;95(1):e71. doi: 10.1002/cpps.71. Epub 2018 Aug 13.
Sequence similarity searching has become an important part of the daily routine of molecular biologists, bioinformaticians and biophysicists. With the rapidly growing sequence databanks, this computational approach is commonly applied to determine functions and structures of unannotated sequences, to investigate relationships between sequences, and to construct phylogenetic trees. We introduce arguably the most popular BLAST-based family of the sequence similarity search tools. We explain basic concepts related to the sequence alignment and demonstrate how to search the current databanks using Web site versions of BLASTP, PSI-BLAST and BLASTN. We also describe the standalone BLAST+ tool. Moreover, this unit discusses the inputs, parameter settings and outputs of these tools. Lastly, we cover recent advances in the sequence similarity searching, focusing on the fast MMseqs2 method. © 2018 by John Wiley & Sons, Inc.
序列相似性搜索已成为分子生物学家、生物信息学家和生物物理学家日常工作的重要组成部分。随着序列数据库的迅速增长,这种计算方法通常用于确定未注释序列的功能和结构、研究序列之间的关系以及构建系统发育树。我们介绍了可以说是最流行的基于BLAST的序列相似性搜索工具家族。我们解释了与序列比对相关的基本概念,并演示了如何使用BLASTP、PSI-BLAST和BLASTN的网站版本搜索当前数据库。我们还描述了独立的BLAST+工具。此外,本单元讨论了这些工具的输入、参数设置和输出。最后,我们介绍了序列相似性搜索的最新进展,重点是快速的MMseqs2方法。© 2018 John Wiley & Sons, Inc.