Pearson William R
University of Virginia School of Medicine, Charlottesville, Virginia.
Curr Protoc Bioinformatics. 2013 Jun;Chapter 3:3.1.1-3.1.8. doi: 10.1002/0471250953.bi0301s42.
Sequence similarity searching, typically with BLAST, is the most widely used and most reliable strategy for characterizing newly determined sequences. Sequence similarity searches can identify "homologous" proteins or genes by detecting excess similarity- statistically significant similarity that reflects common ancestry. This unit provides an overview of the inference of homology from significant similarity, and introduces other units in this chapter that provide more details on effective strategies for identifying homologs.
序列相似性搜索,通常使用BLAST,是用于鉴定新测定序列的最广泛使用且最可靠的策略。序列相似性搜索可以通过检测过量的相似性(反映共同祖先的统计学上显著的相似性)来识别“同源”蛋白质或基因。本单元概述了从显著相似性推断同源性的方法,并介绍了本章中提供有关鉴定同源物有效策略更多详细信息的其他单元。