• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

广义树比对问题的局部搜索。

Local search for the generalized tree alignment problem.

机构信息

Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA.

出版信息

BMC Bioinformatics. 2013 Feb 26;14:66. doi: 10.1186/1471-2105-14-66.

DOI:10.1186/1471-2105-14-66
PMID:23441880
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3637488/
Abstract

BACKGROUND

A phylogeny postulates shared ancestry relationships among organisms in the form of a binary tree. Phylogenies attempt to answer an important question posed in biology: what are the ancestor-descendent relationships between organisms? At the core of every biological problem lies a phylogenetic component. The patterns that can be observed in nature are the product of complex interactions, constrained by the template that our ancestors provide. The problem of simultaneous tree and alignment estimation under Maximum Parsimony is known in combinatorial optimization as the Generalized Tree Alignment Problem (GTAP). The GTAP is the Steiner Tree Problem for the sequence edit distance. Like many biologically interesting problems, the GTAP is NP-Hard. Typically the Steiner Tree is presented under the Manhattan or the Hamming distances.

RESULTS

Experimentally, the accuracy of the GTAP has been subjected to evaluation. Results show that phylogenies selected using the GTAP from unaligned sequences are competitive with the best methods and algorithms available. Here, we implement and explore experimentally existing and new local search heuristics for the GTAP using simulated and real data.

CONCLUSIONS

The methods presented here improve by more than three orders of magnitude in execution time the best local search heuristics existing to date when applied to real data.

摘要

背景

系统发育树假设有共同的祖先关系的生物以二进制树的形式。系统发育树试图回答生物学中的一个重要问题:生物体之间的祖先-后代关系是什么?每个生物学问题的核心都有一个系统发育成分。在自然界中可以观察到的模式是复杂相互作用的产物,受限于我们祖先提供的模板。最大简约法下同时进行树和比对估计的问题在组合优化中被称为广义树比对问题(GTAP)。GTAP 是序列编辑距离的 Steiner 树问题。像许多有趣的生物学问题一样,GTAP 是 NP 难的。通常,Steiner 树是在曼哈顿或汉明距离下呈现的。

结果

实验中,GTAP 的准确性已经过评估。结果表明,使用 GTAP 从未对齐的序列中选择的系统发育与现有的最佳方法和算法具有竞争力。在这里,我们使用模拟和真实数据实现和实验性地探索了 GTAP 的现有和新的局部搜索启发式方法。

结论

当应用于真实数据时,与迄今为止存在的最佳局部搜索启发式方法相比,本文提出的方法在执行时间上提高了三个数量级以上。

相似文献

1
Local search for the generalized tree alignment problem.广义树比对问题的局部搜索。
BMC Bioinformatics. 2013 Feb 26;14:66. doi: 10.1186/1471-2105-14-66.
2
The tree alignment problem.树对齐问题。
BMC Bioinformatics. 2012 Nov 9;13:293. doi: 10.1186/1471-2105-13-293.
3
The deferred path heuristic for the generalized tree alignment problem.广义树对齐问题的延迟路径启发式算法。
J Comput Biol. 1997 Fall;4(3):415-31. doi: 10.1089/cmb.1997.4.415.
4
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.SATe-II:一种非常快速且准确的同时估计多个序列比对和系统发育树的方法。
Syst Biol. 2012 Jan;61(1):90-106. doi: 10.1093/sysbio/syr095. Epub 2011 Dec 1.
5
Simultaneous phylogeny reconstruction and multiple sequence alignment.同时进行系统发育重建和多序列比对。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S11. doi: 10.1186/1471-2105-10-S1-S11.
6
Efficient parsimony-based methods for phylogenetic network reconstruction.基于简约法的高效系统发育网络重建方法。
Bioinformatics. 2007 Jan 15;23(2):e123-8. doi: 10.1093/bioinformatics/btl313.
7
Settling the intractability of multiple alignment.解决多重比对的棘手问题。
J Comput Biol. 2006 Sep;13(7):1323-39. doi: 10.1089/cmb.2006.13.1323.
8
A Provably Efficient Algorithm for the k-Mismatch Average Common Substring Problem.一种用于解决k错配平均公共子串问题的可证明高效算法。
J Comput Biol. 2016 Jun;23(6):472-82. doi: 10.1089/cmb.2015.0235. Epub 2016 Apr 8.
9
Efficient algorithms for knowledge-enhanced supertree and supermatrix phylogenetic problems.基于知识增强的超级树和超级矩阵系统发育问题的高效算法。
IEEE/ACM Trans Comput Biol Bioinform. 2013 Nov-Dec;10(6):1432-41. doi: 10.1109/TCBB.2012.162.
10
AxML: a fast program for sequential and parallel phylogenetic tree calculations based on the maximum likelihood method.AxML:一种基于最大似然法进行序列和并行系统发育树计算的快速程序。
Proc IEEE Comput Soc Bioinform Conf. 2002;1:21-8.

引用本文的文献

1
Characterization of VP1 sequence of Coxsackievirus A16 isolates by Bayesian evolutionary method.基于贝叶斯进化方法对柯萨奇病毒A16分离株VP1序列的特征分析
Virol J. 2016 Jul 28;13:130. doi: 10.1186/s12985-016-0578-3.

本文引用的文献

1
AN EMPIRICAL COMPARISON OF MICROCOMPUTER PARSIMONY PROGRAMS.
Cladistics. 1987 Jun;3(2):121-144. doi: 10.1111/j.1096-0031.1987.tb00502.x.
2
CHARACTER OPTIMIZATION AND CALCULATION OF TREE LENGTHS.字符优化与树长计算
Cladistics. 1993 Dec;9(4):433-436. doi: 10.1111/j.1096-0031.1993.tb00236.x.
3
Efficient Incremental Character Optimization.高效增量字符优化
Cladistics. 1997 Mar;13(1-2):21-26. doi: 10.1111/j.1096-0031.1997.tb00239.x.
4
Tree Searches Under Sankoff Parsimony.基于 Sankoff 简约法的树形搜索
Cladistics. 1998 Sep;14(3):229-237. doi: 10.1111/j.1096-0031.1998.tb00336.x.
5
Analyzing Large Data Sets in Reasonable Times: Solutions for Composite Optima.在合理时间内分析大型数据集:复合最优解的解决方案。
Cladistics. 1999 Dec;15(4):415-428. doi: 10.1111/j.1096-0031.1999.tb00278.x.
6
POY version 4: phylogenetic analysis using dynamic homologies.POY版本4:使用动态同源性的系统发育分析。
Cladistics. 2010 Feb;26(1):72-85. doi: 10.1111/j.1096-0031.2009.00282.x. Epub 2009 Sep 24.
7
The tree alignment problem.树对齐问题。
BMC Bioinformatics. 2012 Nov 9;13:293. doi: 10.1186/1471-2105-13-293.
8
Simultaneous phylogeny reconstruction and multiple sequence alignment.同时进行系统发育重建和多序列比对。
BMC Bioinformatics. 2009 Jan 30;10 Suppl 1(Suppl 1):S11. doi: 10.1186/1471-2105-10-S1-S11.
9
Barking up the wrong treelength: the impact of gap penalty on alignment and tree accuracy.错误的树长探寻:空位罚分对序列比对和树准确性的影响
IEEE/ACM Trans Comput Biol Bioinform. 2009 Jan-Mar;6(1):7-21. doi: 10.1109/TCBB.2008.63.
10
Phylogeny estimation and alignment via POY versus Clustal + PAUP*: a response to Ogden and Rosenberg (2007).通过POY与Clustal + PAUP*进行系统发育估计和序列比对:对奥格登和罗森伯格(2007年)的回应
Syst Biol. 2008 Aug;57(4):653-7. doi: 10.1080/10635150802302476.