FEAST：敏感的多进化速率局部比对。

FEAST: sensitive local alignment with multiple rates of evolution.

机构信息

David R. Cheriton School of Computer Science, University of Waterloo, 200 University Avenue West, Waterloo, Ontario N2L 3G1, Canada.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):698-709. doi: 10.1109/TCBB.2010.76.

DOI:10.1109/TCBB.2010.76

PMID:20733242

Abstract

We present a pairwise local aligner, FEAST, which uses two new techniques: a sensitive extension algorithm for identifying homologous subsequences, and a descriptive probabilistic alignment model. We also present a new procedure for training alignment parameters and apply it to the human and mouse genomes, producing a better parameter set for these sequences. Our extension algorithm identifies homologous subsequences by considering all evolutionary histories. It has higher maximum sensitivity than Viterbi extensions, and better balances specificity. We model alignments with several submodels, each with unique statistical properties, describing strongly similar and weakly similar regions of homologous DNA. Training parameters using two submodels produces superior alignments, even when we align with only the parameters from the weaker submodel. Our extension algorithm combined with our new parameter set achieves sensitivity 0.59 on synthetic tests. In contrast, LASTZ with default settings achieves sensitivity 0.35 with the same false positive rate. Using the weak submodel as parameters for LASTZ increases its sensitivity to 0.59 with high error. FEAST is available at http://monod.uwaterloo.ca/feast/.

摘要

我们提出了一种新的两两局部比对程序 FEAST，它使用了两种新技术：一种用于识别同源序列的敏感扩展算法和一种描述性概率比对模型。我们还提出了一种新的对齐参数训练程序，并将其应用于人类和小鼠基因组，为这些序列生成了更好的参数集。我们的扩展算法通过考虑所有进化历史来识别同源序列。它比维特比扩展具有更高的最大灵敏度，并且更好地平衡了特异性。我们使用多个子模型对比对进行建模，每个子模型都具有独特的统计特性，描述同源 DNA 的强相似区域和弱相似区域。使用两个子模型训练参数可产生更好的比对结果，即使我们仅使用较弱子模型的参数进行比对也是如此。我们的扩展算法结合新的参数集，在合成测试中实现了 0.59 的灵敏度。相比之下，LASTZ 默认设置的灵敏度为 0.35，假阳性率相同。使用弱子模型作为 LASTZ 的参数可以将其灵敏度提高到 0.59，但错误率很高。FEAST 可在 http://monod.uwaterloo.ca/feast/ 上获得。

相似文献

FEAST: sensitive local alignment with multiple rates of evolution.FEAST：敏感的多进化速率局部比对。

IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):698-709. doi: 10.1109/TCBB.2010.76.

An algorithm for progressive multiple alignment of sequences with insertions.一种用于含插入序列的渐进多序列比对算法。

Proc Natl Acad Sci U S A. 2005 Jul 26;102(30):10557-62. doi: 10.1073/pnas.0409137102. Epub 2005 Jul 6.

Bayesian coestimation of phylogeny and sequence alignment.系统发育与序列比对的贝叶斯联合估计

BMC Bioinformatics. 2005 Apr 1;6:83. doi: 10.1186/1471-2105-6-83.

A novel heuristic for local multiple alignment of interspersed DNA repeats.一种用于散布DNA重复序列局部多重比对的新型启发式方法。

IEEE/ACM Trans Comput Biol Bioinform. 2009 Apr-Jun;6(2):180-9. doi: 10.1109/TCBB.2009.9.

Mulan: multiple-sequence local alignment and visualization for studying function and evolution.木兰：用于研究功能和进化的多序列局部比对与可视化

Genome Res. 2005 Jan;15(1):184-94. doi: 10.1101/gr.3007205. Epub 2004 Dec 8.

DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment.DIALIGN-T：一种改进的基于片段的多序列比对算法。

BMC Bioinformatics. 2005 Mar 22;6:66. doi: 10.1186/1471-2105-6-66.

PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs：一种整合了系统发育的吉布斯采样基序查找器。

PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.

Accurate anchoring alignment of divergent sequences.发散序列的精确锚定比对。

Bioinformatics. 2006 Jan 1;22(1):29-34. doi: 10.1093/bioinformatics/bti772. Epub 2005 Nov 13.

Dynamic use of multiple parameter sets in sequence alignment.在序列比对中动态地依次使用多个参数集。

Nucleic Acids Res. 2007;35(2):678-86. doi: 10.1093/nar/gkl1063. Epub 2006 Dec 19.

A local multiple alignment method for detection of non-coding RNA sequences.一种用于检测非编码RNA序列的局部多重比对方法。

Bioinformatics. 2009 Jun 15;25(12):1498-505. doi: 10.1093/bioinformatics/btp261. Epub 2009 Apr 17.

引用本文的文献

Split-alignment of genomes finds orthologies more accurately.基因组的分裂比对能更准确地找到直系同源基因。

Genome Biol. 2015 May 21;16(1):106. doi: 10.1186/s13059-015-0670-9.

Sequence and expression analysis of gaps in human chromosome 20.人类染色体 20 区带序列及表达分析

Nucleic Acids Res. 2012 Aug;40(14):6660-72. doi: 10.1093/nar/gks302. Epub 2012 Apr 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

FEAST：敏感的多进化速率局部比对。

FEAST: sensitive local alignment with multiple rates of evolution.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献