BMC Bioinformatics. 2013;14 Suppl 19(Suppl 19):S1. doi: 10.1186/1471-2105-14-S19-S1. Epub 2013 Nov 12.
High throughput gene expression time-course experiments provide a perspective on biological functioning recognized as having huge value for the diagnosis, treatment, and prevention of diseases. There are however significant challenges to properly exploiting this data due to its massive scale and complexity. In particular, existing techniques are found to be ill suited to finding patterns of changing activity over a limited interval of an experiments time frame. The Time-Series Explorer (TSE) was developed to overcome this limitation by allowing users to explore their data by controlling an animated scatter-plot view. MaTSE improves and extends TSE by allowing users to visualize data with missing values, cross reference multiple conditions, highlight gene groupings, and collaborate by sharing their findings.
MaTSE was developed using an iterative software development cycle that involved a high level of user feedback and evaluation. The resulting software combines a variety of visualization and interaction techniques which work together to allow biologists to explore their data and reveal temporal patterns of gene activity. These include a scatter-plot that can be animated to view different temporal intervals of the data, a multiple coordinated view framework to support the cross reference of multiple experimental conditions, a novel method for highlighting overlapping groups in the scatter-plot, and a pattern browser component that can be used with scatter-plot box queries to support cooperative visualization. A final evaluation demonstrated the tools effectiveness in allowing users to find unexpected temporal patterns and the benefits of functionality such as the overlay of gene groupings and the ability to store patterns.
We have developed a new exploratory analysis tool, MaTSE, that allows users to find unexpected patterns of temporal activity in gene expression time-series data. Overall, the study acted well to demonstrate the benefits of an iterative software development life cycle and allowed us to investigate some visualization problems that are likely to be common in the field of bioinformatics. The subjects involved in the final evaluation were positive about the potential of MaTSE to help them find unexpected patterns in their data and characterized MaTSE as an exploratory tool valuable for hypothesis generation and the creation of new biological knowledge.
高通量基因表达时间序列实验为生物功能提供了一个视角,被认为对疾病的诊断、治疗和预防具有巨大价值。然而,由于数据规模庞大且复杂,要充分利用这些数据仍然面临重大挑战。特别是,现有的技术被发现不适合在实验时间框架的有限间隔内找到活动变化的模式。Time-Series Explorer(TSE)通过允许用户通过控制动画散点图视图来探索他们的数据来克服这一限制。MaTSE 通过允许用户可视化具有缺失值的数据、交叉引用多个条件、突出基因分组以及通过共享他们的发现进行协作,改进和扩展了 TSE。
MaTSE 是使用迭代软件开发周期开发的,该周期涉及大量用户反馈和评估。最终的软件结合了各种可视化和交互技术,共同允许生物学家探索他们的数据并揭示基因活动的时间模式。这些技术包括可以动画化以查看数据不同时间间隔的散点图、支持交叉引用多个实验条件的多个协调视图框架、用于突出散点图中重叠组的新方法,以及可以与散点图框查询一起使用的模式浏览器组件,以支持协作可视化。最终评估证明了这些工具在帮助用户发现意外时间模式方面的有效性,以及基因分组叠加和存储模式等功能的好处。
我们开发了一种新的探索性分析工具 MaTSE,它允许用户在基因表达时间序列数据中找到意外的时间活动模式。总体而言,该研究很好地展示了迭代软件开发生命周期的好处,并使我们能够研究生物信息学领域中可能常见的一些可视化问题。参与最终评估的对象对 MaTSE 帮助他们发现数据中意外模式的潜力持积极态度,并将 MaTSE 描述为有助于生成假设和创造新生物学知识的探索性工具。