Taneda Akito
Department of Electronic and Information System Engineering, Faculty of Science and Technology, Hirosaki University, Hirosaki 036-8561, Japan.
Comput Biol Chem. 2005 Apr;29(2):111-9. doi: 10.1016/j.compbiolchem.2005.02.004.
In order to predict non-coding RNA genes and functions on the basis of genome sequences, accurate secondary structure prediction is useful. Although single-sequence folding programs such as mfold have been successful, it is of great importance to develop a novel approach for further improvement of the prediction performance. In the present paper, a secondary structure prediction method based on genetic algorithm, Cofolga, is proposed. The program developed performs folding and alignment of two homologous RNAs simultaneously. Cofolga was tested with a dataset composed of 13 tRNAs, seven 5S rRNAs, five RNase P RNAs, and five SRP RNAs; as a result, it turned out that the average prediction accuracies for the tRNAs, 5S rRNAs, RNase P RNAs, and SRP RNAs obtained by Cofolga with an optimal weight factor and default parameters were 83.6, 81.8, 73.5, and 67.7%, respectively. These results were superior to those obtained by a single-sequence folding based on free-energy minimization in which corresponding average prediction accuracies were 52.4, 47.4, 57.7, and 52.3%, respectively. Cofolga has a post-processing in which a single-sequence folding is performed after fixation of a predicted common structure; this post-processing enables Cofolga to predict a structure that is present in one of two RNAs alone. The executable files of Cofolga (for Windows/Unix/Mac) can be obtained by an e-mail request.
为了基于基因组序列预测非编码RNA基因及其功能,准确的二级结构预测很有用。尽管诸如mfold之类的单序列折叠程序已经取得成功,但开发一种新方法以进一步提高预测性能非常重要。在本文中,提出了一种基于遗传算法的二级结构预测方法Cofolga。所开发的程序可同时对两个同源RNA进行折叠和比对。使用由13个tRNA、7个5S rRNA、5个RNase P RNA和5个SRP RNA组成的数据集对Cofolga进行了测试;结果表明,Cofolga在最佳权重因子和默认参数下获得的tRNA、5S rRNA、RNase P RNA和SRP RNA的平均预测准确率分别为83.6%、81.8%、73.5%和67.7%。这些结果优于基于自由能最小化的单序列折叠所获得的结果,后者相应的平均预测准确率分别为52.4%、47.4%、57.7%和52.3%。Cofolga有一个后处理过程,即在固定预测的共同结构后进行单序列折叠;这种后处理使Cofolga能够预测仅存在于两个RNA之一中的结构。可通过电子邮件请求获得Cofolga(适用于Windows/Unix/Mac)的可执行文件。