University of Bayreuth, Universitätsstraße 30, Bayreuth, Germany.
Max Planck Institute for Marine Microbiology, Celsiusstraße 1, Bremen, Germany.
Database (Oxford). 2019 Jan 1;2019:baz002. doi: 10.1093/database/baz002.
With the advent of advanced molecular meta-omics techniques and methods, a new era commenced for analysing and characterizing historic collection specimens, as well as recently collected environmental samples. Nucleic acid and protein sequencing-based analyses are increasingly applied to determine the origin, identity and traits of environmental (biological) objects and organisms. In this context, the need for new data structures is evident and former approaches for data processing need to be expanded according to the new meta-omics techniques and operational standards. Existing schemas and community standards in the biodiversity and molecular domain concentrate on terms important for data exchange and publication. Detailed operational aspects of origin and laboratory as well as object and data management issues are frequently neglected. Meta-omics Data and Collection Objects (MOD-CO) has therefore been set up as a new schema for meta-omics research, with a hierarchical organization of the concepts describing collection samples, as well as products and data objects being generated during operational workflows. It is focussed on object trait descriptions as well as on operational aspects and thereby may serve as a backbone for R&D laboratory information management systems with functions of an electronic laboratory notebook. The schema in its current version 1.0 includes 653 concepts and 1810 predefined concept values, being equivalent to descriptors and descriptor states, respectively. It is published in several representations, like a Semantic Media Wiki publication with 2463 interlinked Wiki pages for concepts and concept values, being grouped in 37 concept collections and subcollections. The SQL database application DiversityDescriptions, a generic tool for maintaining descriptive data and schemas, has been applied for setting up and testing MOD-CO and for concept mapping on elements of corresponding schemas.
随着先进的分子元组学技术和方法的出现,分析和描述历史收藏标本以及最近收集的环境样本的新时代已经开始。基于核酸和蛋白质测序的分析越来越多地应用于确定环境(生物)物体和生物的起源、身份和特征。在这种情况下,显然需要新的数据结构,并且需要根据新的元组学技术和操作标准扩展以前的数据处理方法。生物多样性和分子领域的现有模式和社区标准侧重于数据交换和发布重要的术语。起源和实验室以及对象和数据管理问题的详细操作方面经常被忽视。因此,元组学数据和收藏对象 (MOD-CO) 已作为元组学研究的新模式建立,其中对描述收藏样本的概念以及在操作工作流程中生成的产品和数据对象进行了分层组织。它专注于对象特征描述以及操作方面,因此可以作为具有电子实验室笔记本功能的研发实验室信息管理系统的骨干。该模式在其当前版本 1.0 中包括 653 个概念和 1810 个预定义概念值,分别相当于描述符和描述符状态。它以多种表示形式发布,例如具有 2463 个概念和概念值相互链接的维基页面的语义媒体维基出版物,这些页面分为 37 个概念集合和子集合。用于维护描述性数据和模式的通用工具 DiversityDescriptions 的 SQL 数据库应用程序已被用于设置和测试 MOD-CO 以及对相应模式元素的概念映射。