Department of Biological Chemistry, Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem 91904, Israel.
Toxins (Basel). 2012 Nov 16;4(11):1367-84. doi: 10.3390/toxins4111367.
Cnidaria is a rich phylum that includes thousands of marine species. In this study, we focused on Anthozoa and Hydrozoa that are represented by the Nematostella vectensis (Sea anemone) and Hydra magnipapillata genomes. We present a method for ranking the toxin-like candidates from complete proteomes of Cnidaria. Toxin-like functions were revealed using ClanTox, a statistical machine-learning predictor trained on ion channel inhibitors from venomous animals. Fundamental features that were emphasized in training ClanTox include cysteines and their spacing along the sequences. Among the 83,000 proteins derived from Cnidaria representatives, we found 170 candidates that fulfill the properties of toxin-like-proteins, the vast majority of which were previously unrecognized as toxins. An additional 394 short proteins exhibit characteristics of toxin-like proteins at a moderate degree of confidence. Remarkably, only 11% of the predicted toxin-like proteins were previously classified as toxins. Based on our prediction methodology and manual annotation, we inferred functions for over 400 of these proteins. Such functions include protease inhibitors, membrane pore formation, ion channel blockers and metal binding proteins. Many of the proteins belong to small families of paralogs. We conclude that the evolutionary expansion of toxin-like proteins in Cnidaria contributes to their fitness in the complex environment of the aquatic ecosystem.
刺胞动物门是一个拥有丰富物种的门,包括数千种海洋物种。在这项研究中,我们专注于珊瑚虫纲和水螅纲,它们分别由海葵(Nematostella vectensis)和大水螅(Hydra magnipapillata)基因组代表。我们提出了一种从刺胞动物完整蛋白质组中对毒素样候选物进行排序的方法。使用 ClanTox 揭示了毒素样功能,ClanTox 是一种基于来自有毒动物的离子通道抑制剂的统计机器学习预测器进行训练的。在训练 ClanTox 时强调的基本特征包括半胱氨酸及其在序列中的间隔。在所研究的刺胞动物代表中,我们从 83000 个蛋白质中发现了 170 个符合毒素样蛋白特性的候选物,其中绝大多数以前未被认为是毒素。另外 394 个短蛋白质在中等置信度下表现出毒素样蛋白的特征。值得注意的是,只有 11%的预测毒素样蛋白以前被归类为毒素。基于我们的预测方法和手动注释,我们推断了其中 400 多个蛋白质的功能。这些功能包括蛋白酶抑制剂、膜孔形成、离子通道阻断剂和金属结合蛋白。许多蛋白质属于小的基因家族。我们得出结论,刺胞动物门中毒素样蛋白的进化扩张有助于它们在水生生态系统复杂环境中的适应性。