Suppr超能文献

满足生物数据中可视化条件随机场的未满足需求。

Addressing the unmet need for visualizing conditional random fields in biological data.

作者信息

Ray William C, Wolock Samuel L, Callahan Nicholas W, Dong Min, Li Q Quinn, Liang Chun, Magliery Thomas J, Bartlett Christopher W

机构信息

Nationwide Children's Hospital, 575 Children's Crossroad, 43215 Columbus, OH, USA.

出版信息

BMC Bioinformatics. 2014 Jul 10;15:202. doi: 10.1186/1471-2105-15-202.

Abstract

BACKGROUND

The biological world is replete with phenomena that appear to be ideally modeled and analyzed by one archetypal statistical framework - the Graphical Probabilistic Model (GPM). The structure of GPMs is a uniquely good match for biological problems that range from aligning sequences to modeling the genome-to-phenome relationship. The fundamental questions that GPMs address involve making decisions based on a complex web of interacting factors. Unfortunately, while GPMs ideally fit many questions in biology, they are not an easy solution to apply. Building a GPM is not a simple task for an end user. Moreover, applying GPMs is also impeded by the insidious fact that the "complex web of interacting factors" inherent to a problem might be easy to define and also intractable to compute upon.

DISCUSSION

We propose that the visualization sciences can contribute to many domains of the bio-sciences, by developing tools to address archetypal representation and user interaction issues in GPMs, and in particular a variety of GPM called a Conditional Random Field(CRF). CRFs bring additional power, and additional complexity, because the CRF dependency network can be conditioned on the query data.

CONCLUSIONS

In this manuscript we examine the shared features of several biological problems that are amenable to modeling with CRFs, highlight the challenges that existing visualization and visual analytics paradigms induce for these data, and document an experimental solution called StickWRLD which, while leaving room for improvement, has been successfully applied in several biological research projects. Software and tutorials are available at http://www.stickwrld.org/.

摘要

背景

生物界充满了各种现象,这些现象似乎可以通过一种典型的统计框架——图形概率模型(GPM)进行理想的建模和分析。GPM的结构与从序列比对到基因组与表型关系建模等一系列生物学问题具有独特的良好匹配性。GPM所解决的基本问题涉及基于复杂的相互作用因素网络做出决策。不幸的是,尽管GPM非常适合生物学中的许多问题,但它们并非易于应用的解决方案。对于终端用户来说,构建一个GPM并非易事。此外,应用GPM还受到一个潜在事实的阻碍,即问题中固有的“复杂的相互作用因素网络”可能易于定义,但计算起来却很棘手。

讨论

我们提出,可视化科学可以通过开发工具来解决GPM中典型的表示和用户交互问题,特别是一种称为条件随机场(CRF)的GPM,从而为生物科学中的许多领域做出贡献。CRF带来了额外的能力和额外的复杂性,因为CRF依赖网络可以以查询数据为条件。

结论

在本手稿中,我们研究了几个适合用CRF建模的生物学问题的共同特征,强调了现有可视化和视觉分析范式对这些数据带来的挑战,并记录了一种称为StickWRLD的实验性解决方案,该方案虽然还有改进的空间,但已成功应用于多个生物学研究项目。软件和教程可在http://www.stickwrld.org/获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a89b/4227292/fb0df6f5a4bc/1471-2105-15-202-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验