Centre for Gene Regulation and Expression, School of Life Sciences, University of Dundee, Dow St, Dundee DD1 5EH, UK.
Nucleic Acids Res. 2018 Jan 4;46(D1):D1202-D1209. doi: 10.1093/nar/gkx807.
Driven by improvements in speed and resolution of mass spectrometers (MS), the field of proteomics, which involves the large-scale detection and analysis of proteins in cells, tissues and organisms, continues to expand in scale and complexity. There is a resulting growth in datasets of both raw MS files and processed peptide and protein identifications. MS-based proteomics technology is also used increasingly to measure additional protein properties affecting cellular function and disease mechanisms, including post-translational modifications, protein-protein interactions, subcellular and tissue distributions. Consequently, biologists and clinicians need innovative tools to conveniently analyse, visualize and explore such large, complex proteomics data and to integrate it with genomics and other related large-scale datasets. We have created the Encyclopedia of Proteome Dynamics (EPD) to meet this need (https://peptracker.com/epd/). The EPD combines a polyglot persistent database and web-application that provides open access to integrated proteomics data for >30 000 proteins from published studies on human cells and model organisms. It is designed to provide a user-friendly interface, featuring graphical navigation with interactive visualizations that facilitate powerful data exploration in an intuitive manner. The EPD offers a flexible and scalable ecosystem to integrate proteomics data with genomics information, RNA expression and other related, large-scale datasets.
受质谱仪(MS)速度和分辨率提高的推动,蛋白质组学领域(涉及细胞、组织和生物体中蛋白质的大规模检测和分析)的规模和复杂性不断扩大。由此产生的原始 MS 文件和处理后的肽和蛋白质鉴定数据集也在不断增长。基于 MS 的蛋白质组学技术也越来越多地用于测量影响细胞功能和疾病机制的其他蛋白质特性,包括翻译后修饰、蛋白质-蛋白质相互作用、亚细胞和组织分布。因此,生物学家和临床医生需要创新的工具来方便地分析、可视化和探索这些大型、复杂的蛋白质组学数据,并将其与基因组学和其他相关的大规模数据集集成。我们创建了蛋白质组动态百科全书 (EPD) 来满足这一需求(https://peptracker.com/epd/)。EPD 结合了多语言持久数据库和 Web 应用程序,为来自人类细胞和模式生物的已发表研究中 >30000 种蛋白质的综合蛋白质组学数据提供了开放访问。它旨在提供一个用户友好的界面,具有图形导航和交互式可视化功能,以直观的方式方便强大的数据探索。EPD 提供了一个灵活且可扩展的生态系统,可将蛋白质组学数据与基因组学信息、RNA 表达和其他相关的大规模数据集集成。