Suppr超能文献

通过DataRail将实验数据与数学模型相链接的灵活信息学

Flexible informatics for linking experimental data to mathematical models via DataRail.

作者信息

Saez-Rodriguez Julio, Goldsipe Arthur, Muhlich Jeremy, Alexopoulos Leonidas G, Millard Bjorn, Lauffenburger Douglas A, Sorger Peter K

机构信息

Center for Cell Decision Processes, Department of Systems Biology, Harvard Medical School, Boston, MA 02115, USA.

出版信息

Bioinformatics. 2008 Mar 15;24(6):840-7. doi: 10.1093/bioinformatics/btn018. Epub 2008 Jan 24.

Abstract

MOTIVATION

Linking experimental data to mathematical models in biology is impeded by the lack of suitable software to manage and transform data. Model calibration would be facilitated and models would increase in value were it possible to preserve links to training data along with a record of all normalization, scaling, and fusion routines used to assemble the training data from primary results.

RESULTS

We describe the implementation of DataRail, an open source MATLAB-based toolbox that stores experimental data in flexible multi-dimensional arrays, transforms arrays so as to maximize information content, and then constructs models using internal or external tools. Data integrity is maintained via a containment hierarchy for arrays, imposition of a metadata standard based on a newly proposed MIDAS format, assignment of semantically typed universal identifiers, and implementation of a procedure for storing the history of all transformations with the array. We illustrate the utility of DataRail by processing a newly collected set of approximately 22 000 measurements of protein activities obtained from cytokine-stimulated primary and transformed human liver cells.

AVAILABILITY

DataRail is distributed under the GNU General Public License and available at http://code.google.com/p/sbpipeline/

摘要

动机

生物学中实验数据与数学模型的关联受到缺乏合适软件来管理和转换数据的阻碍。如果能够在保存与训练数据的链接以及用于从原始结果组装训练数据的所有归一化、缩放和融合例程记录的同时,模型校准将变得更加容易,并且模型的价值也会增加。

结果

我们描述了DataRail的实现,这是一个基于MATLAB的开源工具箱,它将实验数据存储在灵活的多维数组中,转换数组以最大化信息内容,然后使用内部或外部工具构建模型。通过数组的包含层次结构、基于新提出的MIDAS格式施加元数据标准、分配语义类型化的通用标识符以及实现用于存储数组所有转换历史的过程来维护数据完整性。我们通过处理一组新收集的约22000个从细胞因子刺激的原代和转化人肝细胞获得的蛋白质活性测量数据来说明DataRail的实用性。

可用性

DataRail根据GNU通用公共许可证分发,可在http://code.google.com/p/sbpipeline/获得。

相似文献

2
TOPP--the OpenMS proteomics pipeline.TOPP——开放式质谱蛋白质组学流程
Bioinformatics. 2007 Jan 15;23(2):e191-7. doi: 10.1093/bioinformatics/btl299.
3
BioMAJ: a flexible framework for databanks synchronization and processing.BioMAJ:一个用于数据库同步与处理的灵活框架。
Bioinformatics. 2008 Aug 15;24(16):1823-5. doi: 10.1093/bioinformatics/btn325. Epub 2008 Jun 30.
6
SBMLToolbox: an SBML toolbox for MATLAB users.SBMLToolbox:面向MATLAB用户的一个SBML工具箱。
Bioinformatics. 2006 May 15;22(10):1275-7. doi: 10.1093/bioinformatics/btl111. Epub 2006 Mar 30.
7
Biskit--a software platform for structural bioinformatics.Biskit——一个用于结构生物信息学的软件平台。
Bioinformatics. 2007 Mar 15;23(6):769-70. doi: 10.1093/bioinformatics/btl655. Epub 2007 Jan 18.
10

引用本文的文献

1
Mathematical basis and toolchain for hierarchical optimization of biochemical networks.生物化学网络层次优化的数学基础与工具链
PLoS Comput Biol. 2024 Dec 2;20(12):e1012624. doi: 10.1371/journal.pcbi.1012624. eCollection 2024 Dec.
6
Logic Modeling in Quantitative Systems Pharmacology.定量系统药理学中的逻辑建模
CPT Pharmacometrics Syst Pharmacol. 2017 Aug;6(8):499-511. doi: 10.1002/psp4.12225. Epub 2017 Jul 29.
10
Pathway and network analysis of cancer genomes.癌症基因组的通路与网络分析
Nat Methods. 2015 Jul;12(7):615-621. doi: 10.1038/nmeth.3440.

本文引用的文献

4
Linking data to models: data regression.将数据与模型关联:数据回归
Nat Rev Mol Cell Biol. 2006 Nov;7(11):813-9. doi: 10.1038/nrm2030. Epub 2006 Sep 27.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验