Amare Andinet, Hummon Amanda B, Southey Bruce R, Zimmerman Tyler A, Rodriguez-Zas Sandra L, Sweedler Jonathan V
Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.
J Proteome Res. 2006 May;5(5):1162-7. doi: 10.1021/pr0504541.
Neuropeptides are an important class of cell to cell signaling molecules that are difficult to predict from genetic information because of their large number of post-translational modifications. The transition from prohormone genetic sequence information to the determination of the biologically active neuropeptides requires the identification of the cleaved basic sites, among the many possible cleavage sites, that exist in the prohormone. We report a binary logistic regression model trained on mammalian prohormones that is more sensitive than existing methods in predicting these processing sites, and demonstrate the application of this method to mammalian neuropeptidomic studies. By comparing the predictive abilities of a binary logistic model trained on molluscan prohormone cleavages with the reported model, we establish the need for phyla-specific models.
神经肽是一类重要的细胞间信号分子,由于其大量的翻译后修饰,很难从遗传信息中预测。从前体激素遗传序列信息到确定生物活性神经肽,需要在众多可能的切割位点中识别出前体激素中存在的切割碱性位点。我们报告了一种基于哺乳动物前体激素训练的二元逻辑回归模型,该模型在预测这些加工位点方面比现有方法更敏感,并展示了该方法在哺乳动物神经肽组学研究中的应用。通过将基于软体动物前体激素切割训练的二元逻辑模型的预测能力与所报告的模型进行比较,我们确定了针对特定门的模型的必要性。