Department of Computer Science, National University of Singapore, 13 Computing Drive S(117417), Singapore.
Bioinformatics. 2010 Apr 15;26(8):1036-42. doi: 10.1093/bioinformatics/btq065. Epub 2010 Feb 18.
An important class of protein interactions involves the binding of a protein's domain to a short linear motif (SLiM) on its interacting partner. Extracting such motifs, either experimentally or computationally, is challenging because of their weak binding and high degree of degeneracy. Recent rapid increase of available protein structures provides an excellent opportunity to study SLiMs directly from their 3D structures.
Using domain interface extraction (Diet), we characterized 452 distinct SLiMs from the Protein Data Bank (PDB), of which 155 are validated in varying degrees-40 have literature validation, 54 are supported by at least one domain-peptide structural instance, and another 61 have overrepresentation in high-throughput PPI data. We further observed that the lacklustre coverage of existing computational SLiM detection methods could be due to the common assumption that most SLiMs occur outside globular domain regions. 198 of 452 SLiM that we reported are actually found on domain-domain interface; some of them are implicated in autoimmune and neurodegenerative diseases. We suggest that these SLiMs would be useful for designing inhibitors against the pathogenic protein complexes underlying these diseases. Our findings show that 3D structure-based SLiM detection algorithms can provide a more complete coverage of SLiM-mediated protein interactions than current sequence-based approaches.
一类重要的蛋白质相互作用涉及蛋白质结构域与相互作用伴侣上短线性基序 (SLiM) 的结合。由于其结合较弱且高度简并,因此无论是通过实验还是计算方法提取这些基序都具有挑战性。最近可获得的蛋白质结构数量的快速增加为直接从 3D 结构研究 SLiMs 提供了极好的机会。
我们使用结构域界面提取(Diet)方法,从蛋白质数据库(PDB)中鉴定出 452 个不同的 SLiM,其中 155 个在不同程度上得到验证-40 个具有文献验证,54 个得到至少一个结构域-肽结构实例的支持,另外 61 个在高通量 PPI 数据中过度表达。我们还观察到,现有的计算 SLiM 检测方法覆盖不足可能是由于普遍假设大多数 SLiMs 发生在球状结构域区域之外。我们报告的 452 个 SLiM 中有 198 个实际上存在于结构域-结构域界面上;其中一些与自身免疫和神经退行性疾病有关。我们建议这些 SLiM 可用于设计针对这些疾病潜在致病蛋白复合物的抑制剂。我们的研究结果表明,基于 3D 结构的 SLiM 检测算法比当前基于序列的方法能更完整地覆盖 SLiM 介导的蛋白质相互作用。