1Knowledge Discovery and Informatics, Pacific Northwest National Laboratory, Richland, WA 99352, USA.
BMC Biotechnol. 2013 Jan 14;13:2. doi: 10.1186/1472-6750-13-2.
The high-throughput genomics communities have been successfully using standardized spreadsheet-based formats to capture and share data within labs and among public repositories. The nanomedicine community has yet to adopt similar standards to share the diverse and multi-dimensional types of data (including metadata) pertaining to the description and characterization of nanomaterials. Owing to the lack of standardization in representing and sharing nanomaterial data, most of the data currently shared via publications and data resources are incomplete, poorly-integrated, and not suitable for meaningful interpretation and re-use of the data. Specifically, in its current state, data cannot be effectively utilized for the development of predictive models that will inform the rational design of nanomaterials.
We have developed a specification called ISA-TAB-Nano, which comprises four spreadsheet-based file formats for representing and integrating various types of nanomaterial data. Three file formats (Investigation, Study, and Assay files) have been adapted from the established ISA-TAB specification; while the Material file format was developed de novo to more readily describe the complexity of nanomaterials and associated small molecules. In this paper, we have discussed the main features of each file format and how to use them for sharing nanomaterial descriptions and assay metadata.
The ISA-TAB-Nano file formats provide a general and flexible framework to record and integrate nanomaterial descriptions, assay data (metadata and endpoint measurements) and protocol information. Like ISA-TAB, ISA-TAB-Nano supports the use of ontology terms to promote standardized descriptions and to facilitate search and integration of the data. The ISA-TAB-Nano specification has been submitted as an ASTM work item to obtain community feedback and to provide a nanotechnology data-sharing standard for public development and adoption.
高通量基因组学社区已经成功地使用标准化的电子表格格式在实验室内部和公共存储库之间捕获和共享数据。然而,纳米医学社区尚未采用类似的标准来共享与纳米材料描述和特征有关的各种多维类型的数据(包括元数据)。由于在表示和共享纳米材料数据方面缺乏标准化,目前通过出版物和数据资源共享的大多数数据都不完整、集成度差,并且不适合对数据进行有意义的解释和再利用。具体来说,在当前状态下,数据无法有效地用于开发预测模型,从而为纳米材料的合理设计提供信息。
我们开发了一种名为 ISA-TAB-Nano 的规范,它由四个基于电子表格的文件格式组成,用于表示和集成各种类型的纳米材料数据。三个文件格式(调查、研究和分析文件)是从已建立的 ISA-TAB 规范中改编而来的;而材料文件格式是从头开发的,以便更轻松地描述纳米材料及其相关小分子的复杂性。在本文中,我们讨论了每个文件格式的主要特点以及如何使用它们来共享纳米材料描述和分析元数据。
ISA-TAB-Nano 文件格式为记录和集成纳米材料描述、分析数据(元数据和端点测量值)和协议信息提供了一个通用且灵活的框架。与 ISA-TAB 一样,ISA-TAB-Nano 支持使用本体论术语来促进标准化描述,并促进数据的搜索和集成。ISA-TAB-Nano 规范已作为 ASTM 工作项目提交,以获得社区反馈,并为公共开发和采用提供纳米技术数据共享标准。