Department of Psychiatry, Washington University, Saint Louis, MO 63110, USA.
Bioinformatics. 2012 Apr 15;28(8):1189-91. doi: 10.1093/bioinformatics/bts117. Epub 2012 Mar 16.
Public genomic databases, which are often used to guide genetic studies of human disease, are now being applied to genomic medicine through in silico integrative genomics. These databases, however, often lack tools for systematically determining the experimental origins of the data.
We introduce a new data provenance model that we have implemented in a public web application, BioQ, for assessing the reliability of the data by systematically tracing its experimental origins to the original subjects and biologics. BioQ allows investigators to both visualize data provenance as well as explore individual elements of experimental process flow using precise tools for detailed data exploration and documentation. It includes a number of human genetic variation databases such as the HapMap and 1000 Genomes projects.
BioQ is freely available to the public at http://bioq.saclab.net.
公共基因组数据库通常被用于指导人类疾病的遗传研究,现正通过计算机综合基因组学应用于基因组医学。然而,这些数据库往往缺乏系统性确定数据实验来源的工具。
我们引入了一种新的数据起源模型,我们已在一个公共网络应用程序 BioQ 中实现了该模型,通过系统性地将数据的实验起源追溯到原始个体和生物样本,从而评估数据的可靠性。BioQ 允许调查人员通过精确的工具进行详细的数据探索和记录,可视化数据起源,并探索实验流程的各个元素。它包括一些人类遗传变异数据库,如 HapMap 和 1000 基因组计划。
BioQ 可在 http://bioq.saclab.net 上免费供公众使用。