School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.
Methods Mol Biol. 2022;2499:221-260. doi: 10.1007/978-1-0716-2317-6_12.
Protein posttranslational modifications (PTMs) are a rapidly expanding feature class of significant importance in cell biology. Due to a high burden of experimental proof, the number of functionals PTMs in the eukaryotic proteome is currently underestimated. Furthermore, not all PTMs are functionally equivalent. Computational approaches that can confidently recommend PTMs of probable function can improve the heuristics of PTM investigation and alleviate these problems. To address this need, we developed SAPH-ire: a multifeature heuristic neural network model that takes community wisdom into account by recommending experimental PTMs similar to those which have previously been established as having regulatory impact. Here, we describe the principle behind the SAPH-ire model, how it is developed, how we evaluate its performance, and important caveats to consider when building and interpreting such models. Finally, we discus current limitations of functional PTM prediction models and highlight potential mechanisms for their improvement.
蛋白质翻译后修饰(PTMs)是细胞生物学中一个迅速发展的重要特征类别。由于实验验证的负担很重,目前真核蛋白质组中功能性 PTMs 的数量被低估了。此外,并非所有的 PTMs 都具有同等的功能。能够自信地推荐可能具有功能的 PTMs 的计算方法可以改进 PTM 研究的启发式方法,并缓解这些问题。为了满足这一需求,我们开发了 SAPH-ire:一种多特征启发式神经网络模型,通过推荐与先前被确定为具有监管影响的实验性 PTMs 相似的 PTMs,考虑到了社区的智慧。在这里,我们描述了 SAPH-ire 模型背后的原理、它是如何开发的、我们如何评估它的性能,以及在构建和解释此类模型时需要考虑的重要注意事项。最后,我们讨论了目前功能 PTM 预测模型的局限性,并强调了提高其性能的潜在机制。