Chen Jake Y, Pandey Ragini, Nguyen Thanh M
Wenzhou Medical University First Affiliate Hospital, Wenzhou, Zhejiang Province, China.
Medeolinx, LLC, Indianapolis, IN, 46280, USA.
BMC Genomics. 2017 Feb 17;18(1):182. doi: 10.1186/s12864-017-3512-1.
Human protein-protein interaction (PPI) data is essential to network and systems biology studies. PPI data can help biochemists hypothesize how proteins form complexes by binding to each other, how extracellular signals propagate through post-translational modification of de-activated signaling molecules, and how chemical reactions are coupled by enzymes involved in a complex biological process. Our capability to develop good public database resources for human PPI data has a direct impact on the quality of future research on genome biology and medicine.
The database of Human Annotated and Predicted Protein Interactions (HAPPI) version 2.0 is a major update to the original HAPPI 1.0 database. It contains 2,922,202 unique protein-protein interactions (PPI) linked by 23,060 human proteins, making it the most comprehensive database covering human PPI data today. These PPIs contain both physical/direct interactions and high-quality functional/indirect interactions. Compared with the HAPPI 1.0 database release, HAPPI database version 2.0 (HAPPI-2) represents a 485% of human PPI data coverage increase and a 73% protein coverage increase. The revamped HAPPI web portal provides users with a friendly search, curation, and data retrieval interface, allowing them to retrieve human PPIs and available annotation information on the interaction type, interaction quality, interacting partner drug targeting data, and disease information. The updated HAPPI-2 can be freely accessed by Academic users at http://discovery.informatics.uab.edu/HAPPI .
While the underlying data for HAPPI-2 are integrated from a diverse data sources, the new HAPPI-2 release represents a good balance between data coverage and data quality of human PPIs, making it ideally suited for network biology.
人类蛋白质-蛋白质相互作用(PPI)数据对于网络和系统生物学研究至关重要。PPI数据可以帮助生物化学家推测蛋白质如何通过相互结合形成复合物,细胞外信号如何通过失活信号分子的翻译后修饰进行传播,以及化学反应如何由参与复杂生物过程的酶进行偶联。我们开发高质量人类PPI数据公共数据库资源的能力直接影响未来基因组生物学和医学研究的质量。
人类注释与预测蛋白质相互作用(HAPPI)数据库2.0版本是对原始HAPPI 1.0数据库的重大更新。它包含由23,060种人类蛋白质连接的2,922,202种独特的蛋白质-蛋白质相互作用(PPI),使其成为当今涵盖人类PPI数据最全面的数据库。这些PPI既包含物理/直接相互作用,也包含高质量的功能/间接相互作用。与HAPPI 1.0数据库版本相比,HAPPI数据库2.0版本(HAPPI-2)的人类PPI数据覆盖率提高了485%,蛋白质覆盖率提高了73%。改进后的HAPPI门户网站为用户提供了友好的搜索、管理和数据检索界面,使他们能够检索人类PPI以及有关相互作用类型、相互作用质量、相互作用伙伴药物靶向数据和疾病信息的可用注释信息。学术用户可通过http://discovery.informatics.uab.edu/HAPPI免费访问更新后的HAPPI-2。
虽然HAPPI-2的基础数据是从多种数据源整合而来的,但新发布的HAPPI-2在人类PPI的数据覆盖率和数据质量之间取得了良好的平衡,使其非常适合网络生物学研究。