Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Italy.
Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Italy.
J Mol Biol. 2021 May 28;433(11):166729. doi: 10.1016/j.jmb.2020.166729. Epub 2020 Dec 3.
TransMembrane β-Barrel (TMBB) proteins located in the outer membranes of Gram-negative bacteria are crucial for many important biological processes and primary candidates as drug targets. Structure determination of TMBB proteins is challenging and hence computational methods devised for the analysis of TMBB proteins are important for complementing experimental approaches. Here, we present a novel web server called BetAware-Deep that is able to accurately identify the topology of TMBB proteins (i.e. the number and orientation of membrane-spanning segments along the protein sequence) and to discriminate them from other protein types. The method in BetAware-Deep defines new features by exploiting a non-canonical computation of the hydrophobic moment and by adopting sequence-profile weighting of the White&Wimley hydrophobicity scale. These features are processed using a two-step approach based on deep learning and probabilistic graphical models. BetAware-Deep has been trained on a dataset comprising 58 TMBBs and benchmarked on a novel set of 15 TMBB proteins. Results showed that BetAware-Deep outperforms two recently released state-of-the-art methods for topology prediction, predicting correct topologies of 10 out of 15 proteins. TMBB detection was also assessed on a larger dataset comprising 1009 TMBB proteins and 7571 non-TMBB proteins. Even in this benchmark, BetAware-Deep scored at the level of top-performing methods. A web server has been developed allowing users to analyze input protein sequences and providing topology prediction together with a rich set of information including a graphical representation of the residue-level annotations and prediction probabilities. BetAware-Deep is available at https://busca.biocomp.unibo.it/betaware2.
跨膜β-桶(TMBB)蛋白位于革兰氏阴性菌的外膜中,对于许多重要的生物学过程至关重要,是药物靶点的主要候选者。TMBB 蛋白的结构测定具有挑战性,因此,设计用于分析 TMBB 蛋白的计算方法对于补充实验方法非常重要。在这里,我们提出了一个名为 BetAware-Deep 的新的网络服务器,它能够准确地识别 TMBB 蛋白的拓扑结构(即沿蛋白质序列的跨膜片段的数量和方向),并将其与其他蛋白质类型区分开来。BetAware-Deep 中的方法通过利用非规范的疏水矩计算和采用 White&Wimley 疏水性标度的序列轮廓加权来定义新的特征。这些特征使用基于深度学习和概率图形模型的两步方法进行处理。BetAware-Deep 是在包含 58 个 TMBB 的数据集上进行训练的,并在一个由 15 个 TMBB 蛋白组成的新数据集上进行了基准测试。结果表明,BetAware-Deep 优于两种最近发布的拓扑预测的最先进方法,能够正确预测 15 个蛋白质中的 10 个。在包含 1009 个 TMBB 蛋白和 7571 个非 TMBB 蛋白的更大数据集上也评估了 TMBB 检测。即使在这个基准中,BetAware-Deep 的得分也达到了表现最好的方法的水平。开发了一个网络服务器,允许用户分析输入的蛋白质序列,并提供拓扑预测以及包括残基级注释和预测概率的图形表示在内的丰富信息。BetAware-Deep 可在 https://busca.biocomp.unibo.it/betaware2 上获得。