Krembil Research Institute, University Health Network, Toronto, ON M5T 0S8, Canada.
Department of Medicine, Toronto Western Hospital, University Health Network, Toronto, ON M5T 2S8, Canada.
Nucleic Acids Res. 2020 Jan 8;48(D1):D479-D488. doi: 10.1093/nar/gkz989.
PathDIP was introduced to increase proteome coverage of literature-curated human pathway databases. PathDIP 4 now integrates 24 major databases. To further reduce the number of proteins with no curated pathway annotation, pathDIP integrates pathways with physical protein-protein interactions (PPIs) to predict significant physical associations between proteins and curated pathways. For human, it provides pathway annotations for 5366 pathway orphans. Integrated pathway annotation now includes six model organisms and ten domesticated animals. A total of 6401 core and ortholog pathways have been curated from the literature or by annotating orthologs of human proteins in the literature-curated pathways. Extended pathways are the result of combining these pathways with protein-pathway associations that are predicted using organism-specific PPIs. Extended pathways expand proteome coverage from 81 088 to 120 621 proteins, making pathDIP 4 the largest publicly available pathway database for these organisms and providing a necessary platform for comprehensive pathway-enrichment analysis. PathDIP 4 users can customize their search and analysis by selecting organism, identifier and subset of pathways. Enrichment results and detailed annotations for input list can be obtained in different formats and views. To support automated bioinformatics workflows, Java, R and Python APIs are available for batch pathway annotation and enrichment analysis. PathDIP 4 is publicly available at http://ophid.utoronto.ca/pathDIP.
PathDIP 的推出旨在增加文献整理的人类途径数据库的蛋白质组覆盖范围。PathDIP 4 现在整合了 24 个主要数据库。为了进一步减少没有经过途径注释的蛋白质数量,pathDIP 将途径与物理蛋白质-蛋白质相互作用 (PPIs) 集成在一起,以预测蛋白质与经过整理的途径之间的重要物理关联。对于人类,它提供了 5366 条途径孤儿的途径注释。整合的途径注释现在包括六个模式生物和十种家养动物。总共从文献中或通过注释文献整理途径中的人类蛋白质的同源物来整理了 6401 个核心和同源途径。扩展途径是通过将这些途径与使用特定于生物体的 PPIs 预测的蛋白质-途径关联组合而产生的。扩展途径将蛋白质组覆盖范围从 81088 扩展到 120621 个蛋白质,使 PathDIP 4 成为这些生物体中最大的公开可用途径数据库,并为全面的途径富集分析提供了必要的平台。PathDIP 4 用户可以通过选择生物体、标识符和途径子集来定制他们的搜索和分析。可以以不同的格式和视图获得输入列表的富集结果和详细注释。为了支持自动化生物信息学工作流程,提供了 Java、R 和 Python API 用于批量途径注释和富集分析。PathDIP 4 可在 http://ophid.utoronto.ca/pathDIP 上公开获取。