用于优化与成对序列比对动态规划相关评分函数的无导数神经网络。

Derivative-free neural network for optimizing the scoring functions associated with dynamic programming of pairwise-profile alignment.

作者信息

Yamada Kazunori D

机构信息

1Graduate School of Information Sciences, Tohoku University, 6-3-09, Aramaki-Aza-Aoba, Aoba-ku, Sendai, 980-8579 Japan.

2Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo, Japan.

出版信息

Algorithms Mol Biol. 2018 Feb 15;13:5. doi: 10.1186/s13015-018-0123-6. eCollection 2018.

DOI:10.1186/s13015-018-0123-6

PMID:29467815

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5815186/

Abstract

BACKGROUND

A profile-comparison method with position-specific scoring matrix (PSSM) is among the most accurate alignment methods. Currently, cosine similarity and correlation coefficients are used as scoring functions of dynamic programming to calculate similarity between PSSMs. However, it is unclear whether these functions are optimal for profile alignment methods. By definition, these functions cannot capture nonlinear relationships between profiles. Therefore, we attempted to discover a novel scoring function, which was more suitable for the profile-comparison method than existing functions, using neural networks.

RESULTS

Although neural networks required derivative-of-cost functions, the problem being addressed in this study lacked them. Therefore, we implemented a novel derivative-free neural network by combining a conventional neural network with an evolutionary strategy optimization method used as a solver. Using this novel neural network system, we optimized the scoring function to align remote sequence pairs. Our results showed that the pairwise-profile aligner using the novel scoring function significantly improved both alignment sensitivity and precision relative to aligners using existing functions.

CONCLUSIONS

We developed and implemented a novel derivative-free neural network and aligner (Nepal) for optimizing sequence alignments. Nepal improved alignment quality by adapting to remote sequence alignments and increasing the expressiveness of similarity scores. Additionally, this novel scoring function can be realized using a simple matrix operation and easily incorporated into other aligners. Moreover our scoring function could potentially improve the performance of homology detection and/or multiple-sequence alignment of remote homologous sequences. The goal of the study was to provide a novel scoring function for profile alignment method and develop a novel learning system capable of addressing derivative-free problems. Our system is capable of optimizing the performance of other sophisticated methods and solving problems without derivative-of-cost functions, which do not always exist in practical problems. Our results demonstrated the usefulness of this optimization method for derivative-free problems.

摘要

背景

带有位置特异性评分矩阵（PSSM）的轮廓比较方法是最精确的比对方法之一。目前，余弦相似度和相关系数被用作动态规划的评分函数来计算PSSM之间的相似度。然而，尚不清楚这些函数对于轮廓比对方法是否是最优的。根据定义，这些函数无法捕捉轮廓之间的非线性关系。因此，我们试图使用神经网络发现一种比现有函数更适合轮廓比较方法的新型评分函数。

结果

尽管神经网络需要代价函数的导数，但本研究中要解决的问题却没有。因此，我们通过将传统神经网络与用作求解器的进化策略优化方法相结合，实现了一种新型的无导数神经网络。使用这个新型神经网络系统，我们优化了评分函数以比对远缘序列对。我们的结果表明，与使用现有函数的比对器相比，使用新型评分函数的成对轮廓比对器在比对灵敏度和精度方面均有显著提高。

结论

我们开发并实现了一种用于优化序列比对的新型无导数神经网络和比对器（Nepal）。Nepal通过适应远缘序列比对并提高相似性得分的表现力来提高比对质量。此外，这种新型评分函数可以通过简单的矩阵运算实现，并易于整合到其他比对器中。而且我们的评分函数有可能提高远缘同源序列的同源性检测和/或多序列比对的性能。本研究的目标是为轮廓比对方法提供一种新型评分函数，并开发一种能够解决无导数问题的新型学习系统。我们的系统能够优化其他复杂方法的性能，并解决没有代价函数导数的问题，而这些问题在实际问题中并不总是存在的。我们的结果证明了这种优化方法对于无导数问题的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/68f9/5815186/a34083e9ee73/13015_2018_123_Fig1_HTML.jpg

相似文献

Derivative-free neural network for optimizing the scoring functions associated with dynamic programming of pairwise-profile alignment.

Algorithms Mol Biol. 2018 Feb 15;13:5. doi: 10.1186/s13015-018-0123-6. eCollection 2018.

Improving the alignment quality of consistency based aligners with an evaluation function using synonymous protein words.

PLoS One. 2011;6(12):e27872. doi: 10.1371/journal.pone.0027872. Epub 2011 Dec 2.

High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABER-TOOTH.

BMC Bioinformatics. 2010 May 14;11:251. doi: 10.1186/1471-2105-11-251.

Protein sequence alignment with family-specific amino acid similarity matrices.

BMC Res Notes. 2011 Aug 16;4:296. doi: 10.1186/1756-0500-4-296.

STRUCTFAST: protein sequence remote homology detection and alignment using novel dynamic programming and profile-profile scoring.

Proteins. 2006 Sep 1;64(4):960-7. doi: 10.1002/prot.21049.

Pareto optimal pairwise sequence alignment.

IEEE/ACM Trans Comput Biol Bioinform. 2013 Mar-Apr;10(2):481-93. doi: 10.1109/TCBB.2013.2.

A comparison of scoring functions for protein sequence profile alignment.

Bioinformatics. 2004 May 22;20(8):1301-8. doi: 10.1093/bioinformatics/bth090. Epub 2004 Feb 12.

Fuse: multiple network alignment via data fusion.

Bioinformatics. 2016 Apr 15;32(8):1195-203. doi: 10.1093/bioinformatics/btv731. Epub 2015 Dec 14.

Global multiple protein-protein interaction network alignment by combining pairwise network alignments.

BMC Bioinformatics. 2015;16 Suppl 13(Suppl 13):S11. doi: 10.1186/1471-2105-16-S13-S11. Epub 2015 Sep 25.

ReformAlign: improved multiple sequence alignments using a profile-based meta-alignment approach.

BMC Bioinformatics. 2014 Aug 7;15(1):265. doi: 10.1186/1471-2105-15-265.

引用本文的文献

De novo profile generation based on sequence context specificity with the long short-term memory network.

BMC Bioinformatics. 2018 Jul 18;19(1):272. doi: 10.1186/s12859-018-2284-1.

本文引用的文献

Sequence-based prediction of protein protein interaction using a deep-learning algorithm.

BMC Bioinformatics. 2017 May 25;18(1):277. doi: 10.1186/s12859-017-1700-2.

DeepPPI: Boosting Prediction of Protein-Protein Interactions with Deep Neural Networks.

J Chem Inf Model. 2017 Jun 26;57(6):1499-1510. doi: 10.1021/acs.jcim.7b00028. Epub 2017 May 26.

Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility.

Bioinformatics. 2017 Sep 15;33(18):2842-2849. doi: 10.1093/bioinformatics/btx218.

Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

Sci Rep. 2016 Jan 11;6:18962. doi: 10.1038/srep18962.

Determinants of the rate of protein sequence evolution.

Nat Rev Genet. 2015 Jul;16(7):409-20. doi: 10.1038/nrg3950. Epub 2015 Jun 9.

Deep learning.

Nature. 2015 May 28;521(7553):436-44. doi: 10.1038/nature14539.

A Deep Learning Network Approach to ab initio Protein Secondary Structure Prediction.

IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):103-12. doi: 10.1109/TCBB.2014.2343960. Epub 2014 Aug 7.

MRFalign: protein homology detection through alignment of Markov random fields.

PLoS Comput Biol. 2014 Mar 27;10(3):e1003500. doi: 10.1371/journal.pcbi.1003500. eCollection 2014 Mar.

Revisiting amino acid substitution matrices for identifying distantly related proteins.

Bioinformatics. 2014 Feb 1;30(3):317-25. doi: 10.1093/bioinformatics/btt694. Epub 2013 Nov 26.

Discriminative modelling of context-specific amino acid substitution probabilities.

Bioinformatics. 2012 Dec 15;28(24):3240-7. doi: 10.1093/bioinformatics/bts622. Epub 2012 Oct 17.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

用于优化与成对序列比对动态规划相关评分函数的无导数神经网络。

Derivative-free neural network for optimizing the scoring functions associated with dynamic programming of pairwise-profile alignment.

作者信息

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献