Suppr超能文献

一种使用遗传算法的蛋白质序列多序列比对增强算法。

An enhanced algorithm for multiple sequence alignment of protein sequences using genetic algorithm.

作者信息

Kumar Manish

机构信息

Department of Computer Science and Engineering, Indian School of Mines, Dhanbad, Jharkhand, India.

出版信息

EXCLI J. 2015 Dec 15;14:1232-55. doi: 10.17179/excli2015-302. eCollection 2015.

Abstract

One of the most fundamental operations in biological sequence analysis is multiple sequence alignment (MSA). The basic of multiple sequence alignment problems is to determine the most biologically plausible alignments of protein or DNA sequences. In this paper, an alignment method using genetic algorithm for multiple sequence alignment has been proposed. Two different genetic operators mainly crossover and mutation were defined and implemented with the proposed method in order to know the population evolution and quality of the sequence aligned. The proposed method is assessed with protein benchmark dataset, e.g., BALIBASE, by comparing the obtained results to those obtained with other alignment algorithms, e.g., SAGA, RBT-GA, PRRP, HMMT, SB-PIMA, CLUSTALX, CLUSTAL W, DIALIGN and PILEUP8 etc. Experiments on a wide range of data have shown that the proposed algorithm is much better (it terms of score) than previously proposed algorithms in its ability to achieve high alignment quality.

摘要

多重序列比对(MSA)是生物序列分析中最基本的操作之一。多重序列比对问题的基础是确定蛋白质或DNA序列最符合生物学原理的比对方式。本文提出了一种使用遗传算法进行多重序列比对的方法。定义并实现了两种不同的遗传算子,主要是交叉和变异,通过该方法来了解序列比对的群体进化和质量。通过将所得结果与其他比对算法(如SAGA、RBT-GA、PRRP、HMMT、SB-PIMA、CLUSTALX、CLUSTAL W、DIALIGN和PILEUP8等)所得结果进行比较,使用蛋白质基准数据集(如BALIBASE)对所提出的方法进行评估。在广泛的数据上进行的实验表明,所提出的算法在实现高比对质量的能力方面(就得分而言)比先前提出的算法要好得多。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c8/4820728/ec6b5b5c1e4c/EXCLI-14-1232-t-001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验