Stoeckert Christian J, Parkinson Helen
Department of Genetics and Center for Bioinformatics, University of Pennsylvania, Philadelphia, PA 19104, USA.
Comp Funct Genomics. 2003;4(1):127-32. doi: 10.1002/cfg.234.
The Microarray Gene Expression Data (MGED) society was formed with an initial focus on experiments involving microarray technology. Despite the diversity of applications, there are common concepts used and a common need to capture experimental information in a standardized manner. In building the MGED ontology, it was recognized that it would be impractical to cover all the different types of experiments on all the different types of organisms by listing and defining all the types of organisms and their properties. Our solution was to create a framework for describing microarray experiments with an initial focus on the biological sample and its manipulation. For concepts that are common for many species, we could provide a manageable listing of controlled terms. For concepts that are species-specific or whose values cannot be readily listed, we created an 'OntologyEntry' concept that referenced an external resource. The MGED ontology is a work in progress that needs additional instances and particularly needs constraints to be added. The ontology currently covers the experimental sample and design, and we have begun capturing aspects of the microarrays themselves as well. The primary application of the ontology will be to develop forms for entering information into databases, and consequently allowing queries, taking advantage of the structure provided by the ontology. The application of an ontology of experimental conditions extends beyond microarray experiments and, as the scope of MGED includes other aspects of functional genomics, so too will the MGED ontology.
微阵列基因表达数据(MGED)协会成立之初专注于涉及微阵列技术的实验。尽管应用具有多样性,但存在一些共同使用的概念,并且有以标准化方式捕获实验信息的共同需求。在构建MGED本体时,人们认识到通过列出并定义所有类型的生物体及其属性来涵盖所有不同类型生物体上的所有不同类型实验是不切实际的。我们的解决方案是创建一个用于描述微阵列实验的框架,最初侧重于生物样本及其操作。对于许多物种共有的概念,我们可以提供一份可控术语的可管理列表。对于特定于物种的概念或其值不易列出的概念,我们创建了一个引用外部资源的“本体条目”概念。MGED本体仍在不断完善中,需要更多实例,尤其需要添加约束条件。该本体目前涵盖实验样本和设计,并且我们也已开始捕获微阵列本身的一些方面。本体的主要应用将是开发用于将信息输入数据库的表单,从而利用本体提供的结构进行查询。实验条件本体的应用不仅限于微阵列实验,而且由于MGED的范围包括功能基因组学的其他方面,MGED本体也将如此。