Brochhausen Mathias, Zheng Jie, Birtwell David, Williams Heather, Masci Anna Maria, Ellis Helena Judge, Stoeckert Christian J
Department of Biomedical Informatics, University of Arkansas for Medical Sciences, 4301 W. Markham St., #782, Little Rock, AR 72205-7199 USA.
Department of Genetics, Institute for Translational Medicine and Therapeutics, Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, USA.
J Biomed Semantics. 2016 May 2;7:23. doi: 10.1186/s13326-016-0068-y. eCollection 2016.
Biobanking necessitates extensive integration of data to allow data analysis and specimen sharing. Ontologies have been demonstrated to be a promising approach in fostering better semantic integration of biobank-related data. Hitherto no ontology provided the coverage needed to capture a broad spectrum of biobank user scenarios.
Based in the principles laid out by the Open Biological and Biomedical Ontologies Foundry two biobanking ontologies have been developed. These two ontologies were merged using a modular approach consistent with the initial development principles. The merging was facilitated by the fact that both ontologies use the same Upper Ontology and re-use classes from a similar set of pre-existing ontologies.
Based on the two previous ontologies the Ontology for Biobanking (http://purl.obolibrary.org/obo/obib.owl) was created. Due to the fact that there was no overlap between the two source ontologies the coverage of the resulting ontology is significantly larger than of the two source ontologies. The ontology is successfully used in managing biobank information of the Penn Medicine BioBank.
Sharing development principles and Upper Ontologies facilitates subsequent merging of ontologies to achieve a broader coverage.
生物样本库需要广泛整合数据,以实现数据分析和样本共享。本体已被证明是促进生物样本库相关数据更好语义整合的一种有前景的方法。迄今为止,尚无本体能够提供涵盖广泛生物样本库用户场景所需的覆盖范围。
基于开放生物和生物医学本体铸造厂制定的原则,开发了两个生物样本库本体。这两个本体使用与初始开发原则一致的模块化方法进行合并。由于两个本体都使用相同的上层本体,并从一组类似的现有本体中重用类,因此合并过程较为顺利。
基于之前的两个本体,创建了生物样本库本体(http://purl.obolibrary.org/obo/obib.owl)。由于两个源本体之间没有重叠,因此所得本体的覆盖范围明显大于两个源本体。该本体已成功用于管理宾夕法尼亚大学医学中心生物样本库的信息。
共享开发原则和上层本体有助于后续本体合并,以实现更广泛的覆盖范围。