Suppr超能文献

指南2:考虑多个参数的不确定性,准确检测不可靠的比对区域。

GUIDANCE2: accurate detection of unreliable alignment regions accounting for the uncertainty of multiple parameters.

作者信息

Sela Itamar, Ashkenazy Haim, Katoh Kazutaka, Pupko Tal

机构信息

Department of Cell Research and Immunology, George S. Wise Faculty of Life Sciences, Tel-Aviv University, Tel-Aviv 6997801, Israel.

Immunology Frontier Research Center, Osaka University, Suita, Osaka 565-0871, Japan Computational Biology Research Center, The National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 135-0064, Japan.

出版信息

Nucleic Acids Res. 2015 Jul 1;43(W1):W7-14. doi: 10.1093/nar/gkv318. Epub 2015 Apr 16.

Abstract

Inference of multiple sequence alignments (MSAs) is a critical part of phylogenetic and comparative genomics studies. However, from the same set of sequences different MSAs are often inferred, depending on the methodologies used and the assumed parameters. Much effort has recently been devoted to improving the ability to identify unreliable alignment regions. Detecting such unreliable regions was previously shown to be important for downstream analyses relying on MSAs, such as the detection of positive selection. Here we developed GUIDANCE2, a new integrative methodology that accounts for: (i) uncertainty in the process of indel formation, (ii) uncertainty in the assumed guide tree and (iii) co-optimal solutions in the pairwise alignments, used as building blocks in progressive alignment algorithms. We compared GUIDANCE2 with seven methodologies to detect unreliable MSA regions using extensive simulations and empirical benchmarks. We show that GUIDANCE2 outperforms all previously developed methodologies. Furthermore, GUIDANCE2 also provides a set of alternative MSAs which can be useful for downstream analyses. The novel algorithm is implemented as a web-server, available at: http://guidance.tau.ac.il.

摘要

多序列比对(MSA)的推断是系统发育和比较基因组学研究的关键部分。然而,根据所使用的方法和假定的参数,从同一组序列中常常会推断出不同的MSA。最近,人们投入了大量精力来提高识别不可靠比对区域的能力。先前已证明,检测此类不可靠区域对于依赖MSA的下游分析(如正选择检测)很重要。在此,我们开发了GUIDANCE2,这是一种新的综合方法,该方法考虑了:(i)插入缺失形成过程中的不确定性,(ii)假定引导树中的不确定性,以及(iii)作为渐进比对算法构建块的两两比对中的共同最优解。我们使用广泛的模拟和实证基准,将GUIDANCE2与七种检测不可靠MSA区域的方法进行了比较。我们表明,GUIDANCE2优于所有先前开发的方法。此外,GUIDANCE2还提供了一组可供选择的MSA,这对于下游分析可能是有用的。该新算法作为一个网络服务器实现,网址为:http://guidance.tau.ac.il

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验