Shomali Aida, Vafaei Sadi Mohammad Sadegh, Bakhtiarizadeh Mohammad Reza, Aliniaeifard Sasan, Trewavas Anthony, Calvo Paco
Department of Horticulture, College of Aburaihan, University of Tehran, Tehran, Iran.
Department of Animal and Poultry Science, College of Aburaihan, University of Tehran, Tehran, Iran.
Commun Integr Biol. 2022 Nov 15;15(1):253-264. doi: 10.1080/19420889.2022.2143101. eCollection 2022.
In this study, we advance a robust methodology for identifying specific intelligence-related proteins across phyla. Our approach exploits a support vector machine-based classifier capable of predicting intelligence-related proteins based on a pool of meaningful protein features. For the sake of illustration of our proposed general method, we develop a novel computational two-layer predictor, Intell_Pred, to predict query sequences (proteins or transcripts) as intelligence-related or non-intelligence-related proteins or transcripts, subsequently classifying the former sequences into learning and memory-related classes. Based on a five-fold cross-validation and independent blind test, Intell_Pred obtained an average accuracy of 87.48 and 88.89, respectively. Our findings revealed that a score >0.75 (during prediction by Intell_Pred) is a well-grounded choice for predicting intelligence-related candidate proteins in most organisms across biological kingdoms. In particular, we assessed seismonastic movements and associate learning in plants and evaluated the proteins involved using Intell_Pred. Proteins related to seismonastic movement and associate learning showed high percentages of similarities with intelligence-related proteins. Our findings lead us to believe that Intell_Pred can help identify the intelligence-related proteins and their classes using a given protein/transcript sequence.
在本研究中,我们提出了一种强大的方法来识别跨门的特定智力相关蛋白质。我们的方法利用了一种基于支持向量机的分类器,该分类器能够根据一组有意义的蛋白质特征预测智力相关蛋白质。为了说明我们提出的通用方法,我们开发了一种新颖的计算双层预测器Intell_Pred,以将查询序列(蛋白质或转录本)预测为与智力相关或与非智力相关的蛋白质或转录本,随后将前一类序列分类为与学习和记忆相关的类别。基于五折交叉验证和独立盲测,Intell_Pred的平均准确率分别为87.48和88.89。我们的研究结果表明,在预测过程中(通过Intell_Pred)得分>0.75是预测生物界大多数生物体中与智力相关的候选蛋白质的合理选择。特别是,我们评估了植物中的感震运动和联想学习,并使用Intell_Pred评估了其中涉及的蛋白质。与感震运动和联想学习相关的蛋白质与智力相关蛋白质具有很高的相似百分比。我们的研究结果使我们相信,Intell_Pred可以帮助使用给定的蛋白质/转录本序列识别与智力相关蛋白质及其类别。