Li Bian, Mendenhall Jeffrey, Nguyen Elizabeth Dong, Weiner Brian E, Fischer Axel W, Meiler Jens
Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37232, United States.
Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37232, United States.
J Chem Inf Model. 2016 Feb 22;56(2):423-34. doi: 10.1021/acs.jcim.5b00517. Epub 2016 Feb 5.
Prediction of the three-dimensional (3D) structures of proteins by computational methods is acknowledged as an unsolved problem. Accurate prediction of important structural characteristics such as contact number is expected to accelerate the otherwise slow progress being made in the prediction of 3D structure of proteins. Here, we present a dropout neural network-based method, TMH-Expo, for predicting the contact number of transmembrane helix (TMH) residues from sequence. Neuronal dropout is a strategy where certain neurons of the network are excluded from back-propagation to prevent co-adaptation of hidden-layer neurons. By using neuronal dropout, overfitting was significantly reduced and performance was noticeably improved. For multi-spanning helical membrane proteins, TMH-Expo achieved a remarkable Pearson correlation coefficient of 0.69 between predicted and experimental values and a mean absolute error of only 1.68. In addition, among those membrane protein-membrane protein interface residues, 76.8% were correctly predicted. Mapping of predicted contact numbers onto structures indicates that contact numbers predicted by TMH-Expo reflect the exposure patterns of TMHs and reveal membrane protein-membrane protein interfaces, reinforcing the potential of predicted contact numbers to be used as restraints for 3D structure prediction and protein-protein docking. TMH-Expo can be accessed via a Web server at www.meilerlab.org .
通过计算方法预测蛋白质的三维(3D)结构被认为是一个尚未解决的问题。准确预测诸如接触数等重要结构特征,有望加速在蛋白质3D结构预测方面原本缓慢的进展。在此,我们提出一种基于随机失活神经网络的方法TMH-Expo,用于从序列预测跨膜螺旋(TMH)残基的接触数。神经元随机失活是一种策略,即网络中的某些神经元被排除在反向传播之外,以防止隐藏层神经元的共同适应。通过使用神经元随机失活,显著减少了过拟合并明显提高了性能。对于多跨膜螺旋膜蛋白,TMH-Expo在预测值与实验值之间实现了显著的皮尔逊相关系数0.69,平均绝对误差仅为1.68。此外,在那些膜蛋白-膜蛋白界面残基中,76.8%被正确预测。将预测的接触数映射到结构上表明,TMH-Expo预测的接触数反映了TMH的暴露模式并揭示了膜蛋白-膜蛋白界面,增强了预测接触数用作3D结构预测和蛋白质-蛋白质对接约束的潜力。可通过网站www.meilerlab.org上的网络服务器访问TMH-Expo。