Vanhee Peter, Verschueren Erik, Baeten Lies, Stricher Francois, Serrano Luis, Rousseau Frederic, Schymkowitz Joost
VIB SWITCH Laboratory, Flanders Institute of Biotechnology, Free University of Brussels, Pleinlaan 2, 1050 Brussels, Belgium.
Nucleic Acids Res. 2011 Jan;39(Database issue):D435-42. doi: 10.1093/nar/gkq972. Epub 2010 Oct 23.
High-resolution structures of proteins remain the most valuable source for understanding their function in the cell and provide leads for drug design. Since the availability of sufficient protein structures to tackle complex problems such as modeling backbone moves or docking remains a problem, alternative approaches using small, recurrent protein fragments have been employed. Here we present two databases that provide a vast resource for implementing such fragment-based strategies. The BriX database contains fragments from over 7000 non-homologous proteins from the Astral collection, segmented in lengths from 4 to 14 residues and clustered according to structural similarity, summing up to a content of 2 million fragments per length. To overcome the lack of loops classified in BriX, we constructed the Loop BriX database of non-regular structure elements, clustered according to end-to-end distance between the regular residues flanking the loop. Both databases are available online (http://brix.crg.es) and can be accessed through a user-friendly web-interface. For high-throughput queries a web-based API is provided, as well as full database downloads. In addition, two exciting applications are provided as online services: (i) user-submitted structures can be covered on the fly with BriX classes, representing putative structural variation throughout the protein and (ii) gaps or low-confidence regions in these structures can be bridged with matching fragments.
蛋白质的高分辨率结构仍然是理解其在细胞中功能的最有价值来源,并为药物设计提供线索。由于获得足够的蛋白质结构来解决诸如建模主链移动或对接等复杂问题仍然是一个难题,因此已采用使用小的、重复的蛋白质片段的替代方法。在这里,我们展示了两个数据库,它们为实施这种基于片段的策略提供了丰富的资源。BriX数据库包含来自Astral集合中7000多种非同源蛋白质的片段,片段长度为4至14个残基,并根据结构相似性进行聚类,每个长度的片段总数达200万个。为了克服BriX中未分类环的不足,我们构建了非规则结构元件的Loop BriX数据库,该数据库根据环两侧规则残基之间的端到端距离进行聚类。这两个数据库均可在线获取(http://brix.crg.es),并可通过用户友好的网络界面进行访问。对于高通量查询,提供了基于网络的应用程序编程接口(API)以及完整的数据库下载。此外,还提供了两项令人兴奋的在线服务应用程序:(i)用户提交的结构可以即时用BriX类别覆盖,这代表了整个蛋白质中假定的结构变异;(ii)这些结构中的缺口或低置信度区域可以用匹配的片段来填补。