Rajput Akanksha, Gupta Amit Kumar, Kumar Manoj
Bioinformatics Centre, Institute of Microbial Technology, Council of Scientific and Industrial Research (CSIR), Sector 39-A, Chandigarh-160036, India.
PLoS One. 2015 Mar 17;10(3):e0120066. doi: 10.1371/journal.pone.0120066. eCollection 2015.
Quorum sensing peptides (QSPs) are the signaling molecules used by the Gram-positive bacteria in orchestrating cell-to-cell communication. In spite of their enormous importance in signaling process, their detailed bioinformatics analysis is lacking. In this study, QSPs and non-QSPs were examined according to their amino acid composition, residues position, motifs and physicochemical properties. Compositional analysis concludes that QSPs are enriched with aromatic residues like Trp, Tyr and Phe. At the N-terminal, Ser was a dominant residue at maximum positions, namely, first, second, third and fifth while Phe was a preferred residue at first, third and fifth positions from the C-terminal. A few motifs from QSPs were also extracted. Physicochemical properties like aromaticity, molecular weight and secondary structure were found to be distinguishing features of QSPs. Exploiting above properties, we have developed a Support Vector Machine (SVM) based predictive model. During 10-fold cross-validation, SVM achieves maximum accuracy of 93.00%, Mathew's correlation coefficient (MCC) of 0.86 and Receiver operating characteristic (ROC) of 0.98 on the training/testing dataset (T200p+200n). Developed models performed equally well on the validation dataset (V20p+20n). The server also integrates several useful analysis tools like "QSMotifScan", "ProtFrag", "MutGen" and "PhysicoProp". Our analysis reveals important characteristics of QSPs and on the basis of these unique features, we have developed a prediction algorithm "QSPpred" (freely available at: http://crdd.osdd.net/servers/qsppred).
群体感应肽(QSPs)是革兰氏阳性菌用于协调细胞间通讯的信号分子。尽管它们在信号传导过程中极为重要,但缺乏详细的生物信息学分析。在本研究中,根据氨基酸组成、残基位置、基序和理化性质对QSPs和非QSPs进行了研究。组成分析得出结论,QSPs富含色氨酸(Trp)、酪氨酸(Tyr)和苯丙氨酸(Phe)等芳香族残基。在N端,丝氨酸(Ser)在大多数位置占主导,即第一、第二、第三和第五位,而苯丙氨酸在C端的第一、第三和第五位是优先残基。还从QSPs中提取了一些基序。发现芳香性、分子量和二级结构等理化性质是QSPs的显著特征。利用上述性质,我们开发了一种基于支持向量机(SVM)的预测模型。在10折交叉验证中,SVM在训练/测试数据集(T200p + 200n)上的最大准确率为93.00%,马修斯相关系数(MCC)为0.86,受试者工作特征曲线(ROC)为0.98。开发的模型在验证数据集(V20p + 20n)上表现同样良好。该服务器还集成了几个有用的分析工具,如“QSMotifScan”、“ProtFrag”、“MutGen”和“PhysicoProp”。我们的分析揭示了QSPs的重要特征,并基于这些独特特征开发了一种预测算法“QSPpred”(可从http://crdd.osdd.net/servers/qsppred免费获取)。