Thompson J D, Plewniak F, Thierry J, Poch O
Laboratoire de Biologie et Genomique Structurales, Institut de Génétique et de Biologie Moléculaire et Cellulaire, CNRS/INSERM/ULP, BP 163, 67404 Illkirch Cedex, France.
Nucleic Acids Res. 2000 Aug 1;28(15):2919-26. doi: 10.1093/nar/28.15.2919.
DbClustal addresses the important problem of the automatic multiple alignment of the top scoring full-length sequences detected by a database homology search. By combining the advantages of both local and global alignment algorithms into a single system, DbClustal is able to provide accurate global alignments of highly divergent, complex sequence sets. Local alignment information is incorporated into a ClustalW global alignment in the form of a list of anchor points between pairs of sequences. The method is demonstrated using anchors supplied by the Blast post-processing program, Ballast. The rapidity and reliability of DbClustal have been demonstrated using the recently annotated Pyrococcus abyssi proteome where the number of alignments with totally misaligned sequences was reduced from 20% to <2%. A web site has been implemented proposing BlastP database searches with automatic alignment of the top hits by DbClustal.
DbClustal解决了数据库同源性搜索检测到的得分最高的全长序列的自动多序列比对这一重要问题。通过将局部比对算法和全局比对算法的优点结合到一个单一系统中,DbClustal能够对高度分化的复杂序列集进行准确的全局比对。局部比对信息以序列对之间的锚点列表的形式并入ClustalW全局比对中。该方法通过使用Blast后处理程序Ballast提供的锚点进行了演示。使用最近注释的深渊嗜热栖热菌蛋白质组证明了DbClustal的快速性和可靠性,其中与完全错配序列的比对数量从20%减少到了<2%。已经实现了一个网站,该网站提供BlastP数据库搜索,并由DbClustal对顶级命中结果进行自动比对。