Huan Tianxiao, Sivachenko Andrey Y, Harrison Scott H, Chen Jake Y
School of Informatics, Indiana University - Purdue University, Indianapolis, IN 46202, USA.
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S5. doi: 10.1186/1471-2105-9-S9-S5.
New systems biology studies require researchers to understand how interplay among myriads of biomolecular entities is orchestrated in order to achieve high-level cellular and physiological functions. Many software tools have been developed in the past decade to help researchers visually navigate large networks of biomolecular interactions with built-in template-based query capabilities. To further advance researchers' ability to interrogate global physiological states of cells through multi-scale visual network explorations, new visualization software tools still need to be developed to empower the analysis. A robust visual data analysis platform driven by database management systems to perform bi-directional data processing-to-visualizations with declarative querying capabilities is needed.
We developed ProteoLens as a JAVA-based visual analytic software tool for creating, annotating and exploring multi-scale biological networks. It supports direct database connectivity to either Oracle or PostgreSQL database tables/views, on which SQL statements using both Data Definition Languages (DDL) and Data Manipulation languages (DML) may be specified. The robust query languages embedded directly within the visualization software help users to bring their network data into a visualization context for annotation and exploration. ProteoLens supports graph/network represented data in standard Graph Modeling Language (GML) formats, and this enables interoperation with a wide range of other visual layout tools. The architectural design of ProteoLens enables the de-coupling of complex network data visualization tasks into two distinct phases: 1) creating network data association rules, which are mapping rules between network node IDs or edge IDs and data attributes such as functional annotations, expression levels, scores, synonyms, descriptions etc; 2) applying network data association rules to build the network and perform the visual annotation of graph nodes and edges according to associated data values. We demonstrated the advantages of these new capabilities through three biological network visualization case studies: human disease association network, drug-target interaction network and protein-peptide mapping network.
The architectural design of ProteoLens makes it suitable for bioinformatics expert data analysts who are experienced with relational database management to perform large-scale integrated network visual explorations. ProteoLens is a promising visual analytic platform that will facilitate knowledge discoveries in future network and systems biology studies.
新的系统生物学研究要求研究人员了解无数生物分子实体之间的相互作用是如何协调的,以便实现高级细胞和生理功能。在过去十年中,已经开发了许多软件工具来帮助研究人员通过内置的基于模板的查询功能,以可视化方式浏览生物分子相互作用的大型网络。为了进一步提高研究人员通过多尺度可视化网络探索来探究细胞全局生理状态的能力,仍需要开发新的可视化软件工具来加强分析。需要一个由数据库管理系统驱动的强大可视化数据分析平台,以通过声明式查询功能执行双向数据处理到可视化。
我们开发了ProteoLens,这是一个基于JAVA的可视化分析软件工具,用于创建、注释和探索多尺度生物网络。它支持直接连接到Oracle或PostgreSQL数据库表/视图,在这些表/视图上可以指定使用数据定义语言(DDL)和数据操作语言(DML)的SQL语句。直接嵌入可视化软件中的强大查询语言可帮助用户将其网络数据带入可视化环境进行注释和探索。ProteoLens支持标准图形建模语言(GML)格式的图形/网络表示数据,并能够与广泛的其他可视化布局工具进行互操作。ProteoLens的架构设计能够将复杂的网络数据可视化任务解耦为两个不同阶段:1)创建网络数据关联规则,即网络节点ID或边ID与数据属性(如功能注释、表达水平、分数、同义词、描述等)之间的映射规则;2)应用网络数据关联规则来构建网络,并根据关联数据值对图形节点和边进行可视化注释。我们通过三个生物网络可视化案例研究展示了这些新功能的优势:人类疾病关联网络、药物-靶点相互作用网络和蛋白质-肽映射网络。
ProteoLens的架构设计使其适用于有关系数据库管理经验的生物信息学专家数据分析人员进行大规模综合网络可视化探索。ProteoLens是一个很有前途的可视化分析平台,将有助于未来网络和系统生物学研究中的知识发现。