Pawlicki Sandrine, Le Béchec Antony, Delamarche Christian
Université de Rennes I and CNRS UMR 6026, Equipe Structure et Dynamique des Macromolécules, Campus de Beaulieu, Nb 13, 35042 RENNES Cedex, France.
BMC Bioinformatics. 2008 Jun 10;9:273. doi: 10.1186/1471-2105-9-273.
Misfolding and aggregation of proteins into ordered fibrillar structures is associated with a number of severe pathologies, including Alzheimer's disease, prion diseases, and type II diabetes. The rapid accumulation of knowledge about the sequences and structures of these proteins allows using of in silico methods to investigate the molecular mechanisms of their abnormal conformational changes and assembly. However, such an approach requires the collection of accurate data, which are inconveniently dispersed among several generalist databases.
We therefore created a free online knowledge database (AMYPdb) dedicated to amyloid precursor proteins and we have performed large scale sequence analysis of the included data. Currently, AMYPdb integrates data on 31 families, including 1,705 proteins from nearly 600 organisms. It displays links to more than 2,300 bibliographic references and 1,200 3D-structures. A Wiki system is available to insert data into the database, providing a sharing and collaboration environment. We generated and analyzed 3,621 amino acid sequence patterns, reporting highly specific patterns for each amyloid family, along with patterns likely to be involved in protein misfolding and aggregation.
AMYPdb is a comprehensive online database aiming at the centralization of bioinformatic data regarding all amyloid proteins and their precursors. Our sequence pattern discovery and analysis approach unveiled protein regions of significant interest. AMYPdb is freely accessible 1.
蛋白质错误折叠并聚集成有序的纤维状结构与多种严重病症相关,包括阿尔茨海默病、朊病毒病和II型糖尿病。关于这些蛋白质的序列和结构的知识迅速积累,这使得利用计算机方法来研究其异常构象变化和组装的分子机制成为可能。然而,这种方法需要收集准确的数据,而这些数据分散在几个通用数据库中,获取不便。
因此,我们创建了一个专门针对淀粉样前体蛋白的免费在线知识数据库(AMYPdb),并对其中的数据进行了大规模序列分析。目前,AMYPdb整合了31个家族的数据,包括来自近600种生物的1705种蛋白质。它显示了与2300多篇参考文献和1200个三维结构的链接。提供了一个维基系统,用于将数据插入数据库,营造了一个共享与协作的环境。我们生成并分析了3621个氨基酸序列模式,报告了每个淀粉样家族的高度特异性模式,以及可能参与蛋白质错误折叠和聚集的模式。
AMYPdb是一个综合性在线数据库,旨在集中有关所有淀粉样蛋白及其前体的生物信息学数据。我们的序列模式发现和分析方法揭示了非常值得关注的蛋白质区域。AMYPdb可免费访问。