• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

AntiClustal:通过反极聚类和线性近似1-中位数计算进行多序列比对。

AntiClustal: Multiple Sequence Alignment by antipole clustering and linear approximate 1-median computation.

作者信息

Di Pietro C, Di Pietro V, Emmanuele G, Ferro A, Maugeri T, Modica E, Pigola G, Pulvirenti A, Purrello M, Ragusa M, Scalia M, Shasha D, Travali S, Zimmitti V

机构信息

Dipartimento di Scienze Biomediche, Università di Catania.

出版信息

Proc IEEE Comput Soc Bioinform Conf. 2003;2:326-36.

PMID:16452808
Abstract

In this paper we present a new Multiple Sequence Alignment (MSA) algorithm called AntiClusAl. The method makes use of the commonly use idea of aligning homologous sequences belonging to classes generated by some clustering algorithm, and then continue the alignment process ina bottom-up way along a suitable tree structure. The final result is then read at the root of the tree. Multiple sequence alignment in each cluster makes use of the progressive alignment with the 1-median (center) of the cluster. The 1-median of set S of sequences is the element of S which minimizes the average distance from any other sequence in S. Its exact computation requires quadratic time. The basic idea of our proposed algorithm is to make use of a simple and natural algorithmic technique based on randomized tournaments which has been successfully applied to large size search problems in general metric spaces. In particular a clustering algorithm called Antipole tree and an approximate linear 1-median computation are used. Our algorithm compared with Clustal W, a widely used tool to MSA, shows a better running time results with fully comparable alignment quality. A successful biological application showing high aminoacid conservation during evolution of Xenopus laevis SOD2 is also cited.

摘要

在本文中,我们提出了一种名为AntiClusAl的新的多序列比对(MSA)算法。该方法利用了一种常用的思路,即对属于由某种聚类算法生成的类别的同源序列进行比对,然后沿着合适的树结构以自底向上的方式继续比对过程。最终结果在树的根节点处读取。每个聚类中的多序列比对利用与聚类的1-中位数(中心)的渐进比对。序列集合S的1-中位数是S中的元素,它使与S中任何其他序列的平均距离最小化。其精确计算需要二次时间。我们提出的算法的基本思想是利用一种基于随机锦标赛的简单自然的算法技术,该技术已成功应用于一般度量空间中的大规模搜索问题。特别地,使用了一种名为反极树的聚类算法和一种近似线性的1-中位数计算。我们的算法与广泛用于多序列比对的工具Clustal W相比,在比对质量完全可比的情况下,显示出更好的运行时间结果。还引用了一个成功的生物学应用,该应用显示了非洲爪蟾SOD2在进化过程中的高氨基酸保守性。

相似文献

1
AntiClustal: Multiple Sequence Alignment by antipole clustering and linear approximate 1-median computation.AntiClustal:通过反极聚类和线性近似1-中位数计算进行多序列比对。
Proc IEEE Comput Soc Bioinform Conf. 2003;2:326-36.
2
Efficient constrained multiple sequence alignment with performance guarantee.具有性能保证的高效约束多序列比对
Proc IEEE Comput Soc Bioinform Conf. 2003;2:337-46.
3
A comparative analysis of multiple sequence alignments for biological data.生物数据多序列比对的比较分析。
Biomed Mater Eng. 2015;26 Suppl 1:S1781-9. doi: 10.3233/BME-151479.
4
A probabilistic coding based quantum genetic algorithm for multiple sequence alignment.一种基于概率编码的用于多序列比对的量子遗传算法。
Comput Syst Bioinformatics Conf. 2008;7:15-26.
5
CLAGen: a tool for clustering and annotating gene sequences using a suffix tree algorithm.CLAGen:一种使用后缀树算法对基因序列进行聚类和注释的工具。
Biosystems. 2006 Jun;84(3):175-82. doi: 10.1016/j.biosystems.2005.11.001. Epub 2005 Dec 27.
6
PartTree: an algorithm to build an approximate tree from a large number of unaligned sequences.PartTree:一种从大量未比对序列构建近似树的算法。
Bioinformatics. 2007 Feb 1;23(3):372-4. doi: 10.1093/bioinformatics/btl592. Epub 2006 Nov 21.
7
High similarity sequence comparison in clustering large sequence databases.在大型序列数据库聚类中的高相似性序列比较。
Proc IEEE Comput Soc Bioinform Conf. 2002;1:228-36.
8
DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment.DIALIGN-T:一种改进的基于片段的多序列比对算法。
BMC Bioinformatics. 2005 Mar 22;6:66. doi: 10.1186/1471-2105-6-66.
9
A simple genetic algorithm for multiple sequence alignment.一种用于多序列比对的简单遗传算法。
Genet Mol Res. 2007 Oct 5;6(4):964-82.
10
Multiple Sequence Alignment Computation Using the T-Coffee Regressive Algorithm Implementation.使用T-Coffee回归算法实现的多序列比对计算
Methods Mol Biol. 2021;2231:89-97. doi: 10.1007/978-1-0716-1036-7_6.

引用本文的文献

1
Identifying heterogeneous subtypes of gastric cancer and subtype‑specific subpaths of microRNA‑target pathways.鉴定胃癌的异质亚型和 microRNA 靶途径的亚型特异性亚路径。
Mol Med Rep. 2018 Mar;17(3):3583-3590. doi: 10.3892/mmr.2017.8329. Epub 2017 Dec 20.
2
Identification of functional pathways associated with the conditional ablation of serum response factor in Dstncorn1 mice.Dstncorn1小鼠中与血清反应因子条件性缺失相关的功能通路的鉴定。
Mol Med Rep. 2017 Jan;15(1):139-145. doi: 10.3892/mmr.2016.5984. Epub 2016 Dec 5.
3
Identification potential biomarkers in pulmonary tuberculosis and latent infection based on bioinformatics analysis.
基于生物信息学分析鉴定肺结核和潜伏感染中的潜在生物标志物。
BMC Infect Dis. 2016 Sep 21;16(1):500. doi: 10.1186/s12879-016-1822-6.
4
Analysis of gene expression profile identifies potential biomarkers for atherosclerosis.基因表达谱分析可识别动脉粥样硬化的潜在生物标志物。
Mol Med Rep. 2016 Oct;14(4):3052-8. doi: 10.3892/mmr.2016.5650. Epub 2016 Aug 19.