一种使用遗传算法的蛋白质序列多序列比对增强算法。

An enhanced algorithm for multiple sequence alignment of protein sequences using genetic algorithm.

作者信息

Kumar Manish

机构信息

Department of Computer Science and Engineering, Indian School of Mines, Dhanbad, Jharkhand, India.

出版信息

EXCLI J. 2015 Dec 15;14:1232-55. doi: 10.17179/excli2015-302. eCollection 2015.

DOI:10.17179/excli2015-302

PMID:27065770

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4820728/

Abstract

One of the most fundamental operations in biological sequence analysis is multiple sequence alignment (MSA). The basic of multiple sequence alignment problems is to determine the most biologically plausible alignments of protein or DNA sequences. In this paper, an alignment method using genetic algorithm for multiple sequence alignment has been proposed. Two different genetic operators mainly crossover and mutation were defined and implemented with the proposed method in order to know the population evolution and quality of the sequence aligned. The proposed method is assessed with protein benchmark dataset, e.g., BALIBASE, by comparing the obtained results to those obtained with other alignment algorithms, e.g., SAGA, RBT-GA, PRRP, HMMT, SB-PIMA, CLUSTALX, CLUSTAL W, DIALIGN and PILEUP8 etc. Experiments on a wide range of data have shown that the proposed algorithm is much better (it terms of score) than previously proposed algorithms in its ability to achieve high alignment quality.

摘要

多重序列比对（MSA）是生物序列分析中最基本的操作之一。多重序列比对问题的基础是确定蛋白质或DNA序列最符合生物学原理的比对方式。本文提出了一种使用遗传算法进行多重序列比对的方法。定义并实现了两种不同的遗传算子，主要是交叉和变异，通过该方法来了解序列比对的群体进化和质量。通过将所得结果与其他比对算法（如SAGA、RBT-GA、PRRP、HMMT、SB-PIMA、CLUSTALX、CLUSTAL W、DIALIGN和PILEUP8等）所得结果进行比较，使用蛋白质基准数据集（如BALIBASE）对所提出的方法进行评估。在广泛的数据上进行的实验表明，所提出的算法在实现高比对质量的能力方面（就得分而言）比先前提出的算法要好得多。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/11c8/4820728/ec6b5b5c1e4c/EXCLI-14-1232-t-001.jpg

相似文献

An enhanced algorithm for multiple sequence alignment of protein sequences using genetic algorithm.

EXCLI J. 2015 Dec 15;14:1232-55. doi: 10.17179/excli2015-302. eCollection 2015.

Vertical decomposition with Genetic Algorithm for Multiple Sequence Alignment.

BMC Bioinformatics. 2011 Aug 25;12:353. doi: 10.1186/1471-2105-12-353.

Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns.

Bioinformatics. 2013 Sep 1;29(17):2112-21. doi: 10.1093/bioinformatics/btt360. Epub 2013 Jun 21.

A Novel Approach to Multiple Sequence Alignment Using Multiobjective Evolutionary Algorithm Based on Decomposition.

IEEE J Biomed Health Inform. 2016 Mar;20(2):717-27. doi: 10.1109/JBHI.2015.2403397. Epub 2015 Feb 12.

An approach for COFFEE objective function to global DNA multiple sequence alignment.

Comput Biol Chem. 2018 Aug;75:39-44. doi: 10.1016/j.compbiolchem.2018.04.012. Epub 2018 Apr 25.

A simple genetic algorithm for multiple sequence alignment.

Genet Mol Res. 2007 Oct 5;6(4):964-82.

Multiple sequence alignment using multi-objective based bacterial foraging optimization algorithm.

Biosystems. 2016 Dec;150:177-189. doi: 10.1016/j.biosystems.2016.10.005. Epub 2016 Oct 23.

Resolving the multiple sequence alignment problem using biogeography-based optimization with multiple populations.

J Bioinform Comput Biol. 2015 Aug;13(4):1550016. doi: 10.1142/S021972001550016X. Epub 2015 Apr 30.

A probabilistic coding based quantum genetic algorithm for multiple sequence alignment.

Comput Syst Bioinformatics Conf. 2008;7:15-26.

RBT-GA: a novel metaheuristic for solving the Multiple Sequence Alignment problem.

BMC Genomics. 2009 Jul 7;10 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2164-10-S1-S10.

本文引用的文献

An improved scoring method for protein residue conservation and multiple sequence alignment.

IEEE Trans Nanobioscience. 2011 Dec;10(4):275-85. doi: 10.1109/TNB.2011.2179553.

iPBA: a tool for protein structure comparison using sequence alignment strategies.

Nucleic Acids Res. 2011 Jul;39(Web Server issue):W18-23. doi: 10.1093/nar/gkr333. Epub 2011 May 17.

A comprehensive benchmark study of multiple sequence alignment methods: current challenges and future perspectives.

PLoS One. 2011 Mar 31;6(3):e18093. doi: 10.1371/journal.pone.0018093.

Prediction of missense mutation functionality depends on both the algorithm and sequence alignment employed.

Hum Mutat. 2011 Jun;32(6):661-8. doi: 10.1002/humu.21490. Epub 2011 Apr 7.

More than 1,001 problems with protein domain databases: transmembrane regions, signal peptides and the issue of sequence homology.

PLoS Comput Biol. 2010 Jul 29;6(7):e1000867. doi: 10.1371/journal.pcbi.1000867.

Issues in bioinformatics benchmarking: the case study of multiple sequence alignment.

Nucleic Acids Res. 2010 Nov;38(21):7353-63. doi: 10.1093/nar/gkq625. Epub 2010 Jul 17.

Heuristic reusable dynamic programming: efficient updates of local sequence alignment.

IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):570-82. doi: 10.1109/TCBB.2009.30.

RBT-GA: a novel metaheuristic for solving the Multiple Sequence Alignment problem.

BMC Genomics. 2009 Jul 7;10 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2164-10-S1-S10.

Phylogeny-aware gap placement prevents errors in sequence alignment and evolutionary analysis.

Science. 2008 Jun 20;320(5883):1632-5. doi: 10.1126/science.1158395.

Bioinformatics challenges of new sequencing technology.

Trends Genet. 2008 Mar;24(3):142-9. doi: 10.1016/j.tig.2007.12.006. Epub 2008 Feb 11.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一种使用遗传算法的蛋白质序列多序列比对增强算法。

An enhanced algorithm for multiple sequence alignment of protein sequences using genetic algorithm.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献