一种多方法引导的遗传算法及其在操纵子预测中的应用。

A multi-approaches-guided genetic algorithm with application to operon prediction.

作者信息

Wang Shuqin, Wang Yan, Du Wei, Sun Fangxun, Wang Xiumei, Zhou Chunguang, Liang Yanchun

机构信息

College of Computer Science and Technology, Jilin University, Key Laboratory of Symbol Computation and Knowledge Engineering of the Ministry of Education, Changchun 130012, China.

出版信息

Artif Intell Med. 2007 Oct;41(2):151-9. doi: 10.1016/j.artmed.2007.07.010. Epub 2007 Sep 14.

DOI:10.1016/j.artmed.2007.07.010

PMID:17869072

Abstract

OBJECTIVE

The prediction of operons is critical to the reconstruction of regulatory networks at the whole genome level. Multiple genome features have been used for predicting operons. However, multiple genome features are usually dealt with using only single method in the literatures. The aim of this paper is to develop a combined method for operon prediction by using different methods to preprocess different genome features in order for exerting their unique characteristics.

METHODS

A novel multi-approach-guided genetic algorithm for operon prediction is presented. We exploit different methods for intergenic distance, cluster of orthologous groups (COG) gene functions, metabolic pathway and microarray expression data. A novel local-entropy-minimization method is proposed to partition intergenic distance. Our program can be used for other newly sequenced genomes by transferring the knowledge that has been obtained from Escherichia coli data. We calculate the log-likelihood for COG gene functions and Pearson correlation coefficient for microarray expression data. The genetic algorithm is used for integrating the four types of data.

RESULTS

The proposed method is examined on E. coli K12 genome, Bacillus subtilis genome, and Pseudomonas aeruginosa PAO1 genome. The accuracies of prediction for these three genomes are 85.9987%, 88.296%, and 81.2384%, respectively.

CONCLUSION

Simulated experimental results demonstrate that in the genetic algorithm the preprocessing for genome data using multiple approaches ensures the effective utilization of different biological characteristics. Experimental results also show that the proposed method is applicable for predicting operons in prokaryote.

摘要

目的

操纵子预测对于全基因组水平调控网络的重建至关重要。多种基因组特征已被用于预测操纵子。然而，在文献中多种基因组特征通常仅用单一方法处理。本文的目的是开发一种组合方法，通过使用不同方法对不同基因组特征进行预处理，以发挥它们的独特特性来进行操纵子预测。

方法

提出了一种用于操纵子预测的新型多方法引导遗传算法。我们利用不同方法处理基因间距离、直系同源簇（COG）基因功能、代谢途径和微阵列表达数据。提出了一种新的局部熵最小化方法来划分基因间距离。通过转移从大肠杆菌数据中获得的知识，我们的程序可用于其他新测序的基因组。我们计算COG基因功能的对数似然值和微阵列表达数据的皮尔逊相关系数。遗传算法用于整合这四类数据。

结果

在大肠杆菌K12基因组、枯草芽孢杆菌基因组和铜绿假单胞菌PAO1基因组上检验了所提出的方法。这三个基因组的预测准确率分别为85.9987%、88.296%和81.2384%。

结论

模拟实验结果表明，在遗传算法中使用多种方法对基因组数据进行预处理可确保有效利用不同的生物学特征。实验结果还表明所提出的方法适用于预测原核生物中的操纵子。

相似文献

A multi-approaches-guided genetic algorithm with application to operon prediction.一种多方法引导的遗传算法及其在操纵子预测中的应用。

Artif Intell Med. 2007 Oct;41(2):151-9. doi: 10.1016/j.artmed.2007.07.010. Epub 2007 Sep 14.

Computational prediction of operons in Synechococcus sp. WH8102.聚球藻属WH8102中操纵子的计算预测

Genome Inform. 2004;15(2):211-22.

Features for computational operon prediction in prokaryotes.原核生物计算操纵子预测的特征。

Brief Funct Genomics. 2012 Jul;11(4):291-9. doi: 10.1093/bfgp/els024. Epub 2012 Jun 28.

Detection of operons.操纵子的检测

Proteins. 2006 Aug 15;64(3):615-28. doi: 10.1002/prot.21021.

Bridge and brick network motifs: identifying significant building blocks from complex biological systems.桥接与砖块网络基序：从复杂生物系统中识别重要构建模块

Artif Intell Med. 2007 Oct;41(2):117-27. doi: 10.1016/j.artmed.2007.07.006. Epub 2007 Sep 7.

Operon prediction based on SVM.基于支持向量机的操纵子预测。

Comput Biol Chem. 2006 Jun;30(3):233-40. doi: 10.1016/j.compbiolchem.2006.03.002. Epub 2006 May 23.

Genome-wide partial correlation analysis of Escherichia coli microarray data.大肠杆菌微阵列数据的全基因组偏相关分析

Genet Mol Res. 2007 Oct 5;6(4):730-42.

A novel method for accurate operon predictions in all sequenced prokaryotes.一种用于在所有已测序原核生物中进行准确操纵子预测的新方法。

Nucleic Acids Res. 2005 Feb 8;33(3):880-92. doi: 10.1093/nar/gki232. Print 2005.

Inferring large-scale gene regulatory networks using a low-order constraint-based algorithm.使用基于低阶约束的算法推断大规模基因调控网络。

Mol Biosyst. 2010 Jun;6(6):988-98. doi: 10.1039/b917571g. Epub 2010 Feb 19.

The condition-dependent transcriptional network in Escherichia coli.大肠杆菌中条件依赖型转录网络。

Ann N Y Acad Sci. 2009 Mar;1158:29-35. doi: 10.1111/j.1749-6632.2008.03746.x.

引用本文的文献

Detecting operons in bacterial genomes via visual representation learning.通过可视化表示学习检测细菌基因组中的操纵子。

Sci Rep. 2021 Jan 22;11(1):2124. doi: 10.1038/s41598-021-81169-9.

Cautions about the reliability of pairwise gene correlations based on expression data.基于表达数据的成对基因相关性可靠性的注意事项。

Front Microbiol. 2015 Jun 26;6:650. doi: 10.3389/fmicb.2015.00650. eCollection 2015.

Single nucleotide polymorphism barcoding to evaluate oral cancer risk using odds ratio-based genetic algorithms.基于优势比值遗传算法的单核苷酸多态性条码评估口腔癌风险

Kaohsiung J Med Sci. 2012 Jul;28(7):362-8. doi: 10.1016/j.kjms.2012.02.002. Epub 2012 May 14.

Binary particle swarm optimization for operon prediction.二进制粒子群优化算法在操纵子预测中的应用。

Nucleic Acids Res. 2010 Jul;38(12):e128. doi: 10.1093/nar/gkq204. Epub 2010 Apr 12.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种多方法引导的遗传算法及其在操纵子预测中的应用。

A multi-approaches-guided genetic algorithm with application to operon prediction.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献