Anbo Hiroto, Amagai Hiroki, Fukuchi Satoshi
Department of Life Science and Informatics, Faculty of Engineering, Maebashi Institute of Technology, Maebashi, Gunma 371-0816, Japan.
Biophys Physicobiol. 2020 Nov 3;17:147-154. doi: 10.2142/biophysico.BSJ-2020026. eCollection 2020.
Intrinsically disordered proteins are those proteins with intrinsically disordered regions. One of the unique characteristics of intrinsically disordered proteins is the existence of functional segments in intrinsically dis-ordered regions. These segments are involved in binding to partner molecules, such as protein and DNA, and play important roles in signaling pathways and/or transcriptional regulation. Although there are databases that gather information on such disordered binding regions, data remain limited. Therefore, it is desirable to develop programs to predict the disordered binding regions without using data for the binding regions. We developed a program, NeProc, to predict the disordered binding regions, which can be regarded as intrinsically disordered regions with a structural propensity. We only used data for the structural domains and intrinsically disordered regions to detect such regions. NeProc accepts a query amino acid sequence converted into a position specific score matrix, and uses two neural networks that employ different window sizes, a neural network of short windows, and a neural network of long windows. The performance of NeProc was comparable to that of existing programs of the disordered binding region prediction. This result presents the possibility to overcome the shortage of the disordered binding region data in the development of the prediction programs for these binding regions. NeProc is available at http://flab.neproc.org/neproc/index.html.
内在无序蛋白质是指那些具有内在无序区域的蛋白质。内在无序蛋白质的独特特征之一是在内在无序区域存在功能片段。这些片段参与与伴侣分子(如蛋白质和DNA)的结合,并在信号通路和/或转录调控中发挥重要作用。尽管有数据库收集关于此类无序结合区域的信息,但数据仍然有限。因此,期望开发不使用结合区域数据来预测无序结合区域的程序。我们开发了一个名为NeProc的程序来预测无序结合区域,该区域可被视为具有结构倾向的内在无序区域。我们仅使用结构域和内在无序区域的数据来检测此类区域。NeProc接受转换为位置特异性评分矩阵的查询氨基酸序列,并使用两个采用不同窗口大小的神经网络,即短窗口神经网络和长窗口神经网络。NeProc的性能与现有的无序结合区域预测程序相当。这一结果表明在开发这些结合区域的预测程序时有可能克服无序结合区域数据不足的问题。NeProc可在http://flab.neproc.org/neproc/index.html上获取。