Shameer Khader, Sowdhamini Ramanathan
National Centre for Biological Sciences (TIFR), GKVK Campus, Bangalore 560065, India.
J Clin Bioinforma. 2012 Apr 3;2(1):8. doi: 10.1186/2043-9113-2-8.
3D domain swapping is a novel structural phenomenon observed in diverse set of protein structures in oligomeric conformations. A distinct structural feature, where structural segments in a protein dimer or higher oligomer were shared between two or more chains of a protein structure, characterizes 3D domain swapping. 3D domain swapping was observed as a key mediator of numerous functional mechanisms and play pathogenic role in various diseases including conformational diseases like amyloidosis, Alzheimer's disease, Parkinson's disease and prion diseases. We report the first study with a focus on identifying functional classes, pathways and diseases mediated by 3D domain swapping in the human proteome.
We used a panel of four enrichment tools with two different ontologies and two annotations database to derive biological and clinical relevant information associated with 3D domain swapping. Protein domain enrichment analysis followed by Gene Ontology (GO) term enrichment analysis revealed the functional repertoire of proteins involved in swapping. Pathway analysis using KEGG annotations revealed diverse pathway associations of human proteins involved in 3D domain swapping. Disease Ontology was used to find statistically significant associations with proteins in swapped conformation and various disease categories (P-value < 0.05).
We report meta-analysis results of a literature-curated dataset of human gene products involved in 3D domain swapping and discuss new insights about the functional repertoire, pathway associations and disease implications of proteins involved in 3D domain swapping.
Our integrated bioinformatics pipeline comprising of four different enrichment tools, two ontologies and two annotations revealed new insights into the functional and disease correlations with 3D domain swapping. GO term enrichment were used to infer terms associated with three different GO categories. Protein domain enrichment was used to identify conserved domains enriched in swapped proteins. Pathway enrichment analysis using KEGG annotations revealed that proteins with swapped conformations are present in all six classes of KEGG BRITE hierarchy and significantly enriched KEGG pathways were observed in five classes. Five major classes of disease were found to be associated with 3D domain swapping using functional disease ontology based enrichment analysis. Five classes of human diseases: cancer, diseases of the respiratory or pulmonary system, degenerative diseases of the central nervous system, vascular disease and encephalitis were found to be significant. In conclusion, our study shows that bioinformatics based analytical approaches using curated data can enhance the understanding of functional and disease implications of 3D domain swapping.
三维结构域交换是在寡聚构象的多种蛋白质结构中观察到的一种新型结构现象。三维结构域交换的一个显著结构特征是,蛋白质二聚体或更高聚体中的结构片段在蛋白质结构的两条或多条链之间共享。三维结构域交换被认为是多种功能机制的关键介导因素,并且在包括淀粉样变性、阿尔茨海默病、帕金森病和朊病毒病等构象疾病在内的各种疾病中发挥致病作用。我们报告了第一项专注于识别由人类蛋白质组中的三维结构域交换介导的功能类别、途径和疾病的研究。
我们使用了一组四种富集工具,结合两种不同的本体和两个注释数据库,以获取与三维结构域交换相关的生物学和临床相关信息。蛋白质结构域富集分析随后进行基因本体(GO)术语富集分析,揭示了参与交换的蛋白质的功能库。使用KEGG注释进行的途径分析揭示了参与三维结构域交换的人类蛋白质的多种途径关联。疾病本体用于发现与处于交换构象的蛋白质和各种疾病类别之间的统计学显著关联(P值<0.05)。
我们报告了一个关于参与三维结构域交换的人类基因产物的文献整理数据集的荟萃分析结果,并讨论了关于参与三维结构域交换的蛋白质的功能库、途径关联和疾病影响的新见解。
我们由四种不同的富集工具、两种本体和两种注释组成的综合生物信息学管道揭示了与三维结构域交换相关的功能和疾病相关性的新见解。GO术语富集用于推断与三个不同GO类别相关的术语。蛋白质结构域富集用于识别在交换蛋白中富集的保守结构域。使用KEGG注释进行的途径富集分析表明,处于交换构象的蛋白质存在于KEGG BRITE层次结构的所有六个类别中,并且在五个类别中观察到显著富集的KEGG途径。使用基于功能疾病本体的富集分析发现,五类主要疾病与三维结构域交换相关。发现五类人类疾病:癌症、呼吸或肺部系统疾病、中枢神经系统退行性疾病、血管疾病和脑炎具有显著性。总之,我们的研究表明,使用整理数据的基于生物信息学的分析方法可以增强对三维结构域交换的功能和疾病影响的理解。