Shameer Khader, Pugalenthi Ganesan, Kandaswamy Krishna Kumar, Sowdhamini Ramanathan
National Centre for Biological Sciences, UAS-GKVK Campus, Bellary Road, Bangalore 560 065, India.
Protein Pept Lett. 2011 Oct;18(10):1010-20. doi: 10.2174/092986611796378729.
3D domain swapping is a protein structural phenomenon that mediates the formation of the higher order oligomers in a variety of proteins with different structural and functional properties. 3D domain swapping is associated with a variety of biological functions ranging from oligomerization to pathological conformational diseases. 3D domain swapping is realised subsequent to structure determination where the protein is observed in the swapped conformation in the oligomeric state. This is a limiting step to understand this important structural phenomenon in a large scale from the growing sequence data. A new machine learning approach, 3dswap-pred, has been developed for the prediction of 3D domain swapping in protein structures from mere sequence data using the Random Forest approach. 3Dswap-pred is implemented using a positive sequence dataset derived from literature based structural curation of 297 structures. A negative sequence dataset is obtained from 462 SCOP domains using a new sequence data mining approach and a set of 126 sequencederived features. Statistical validation using an independent dataset of 68 positive sequences and 313 negative sequences revealed that 3dswap-pred achieved an accuracy of 63.8%. A webserver is also implemented using the 3dswap-pred Random Forest model. The server is available from the URL: http://caps.ncbs.res.in/3dswap-pred.
三维结构域交换是一种蛋白质结构现象,它介导多种具有不同结构和功能特性的蛋白质中高阶寡聚体的形成。三维结构域交换与从寡聚化到病理性构象疾病等多种生物学功能相关。三维结构域交换是在结构测定之后实现的,在寡聚状态下观察到蛋白质处于交换构象。从不断增长的序列数据大规模理解这一重要结构现象,这是一个限制步骤。一种新的机器学习方法3dswap-pred已经被开发出来,用于仅根据序列数据使用随机森林方法预测蛋白质结构中的三维结构域交换。3Dswap-pred使用从基于文献的297个结构的结构整理中获得的阳性序列数据集来实现。使用一种新的序列数据挖掘方法和一组126个序列衍生特征,从462个SCOP结构域中获得阴性序列数据集。使用由68个阳性序列和313个阴性序列组成的独立数据集进行统计验证,结果表明3dswap-pred的准确率达到了63.8%。还使用3dswap-pred随机森林模型实现了一个网络服务器。该服务器可从以下网址获取:http://caps.ncbs.res.in/3dswap-pred 。