Suppr超能文献

层次与构建模块——迈向生命科学领域粒度框架

Levels and building blocks-toward a domain granularity framework for the life sciences.

作者信息

Vogt Lars

机构信息

Rheinische Friedrich-Wilhelms-Universität Bonn, Institut für Evolutionsbiologie und Ökologie, An der Immenburg 1, 53121, Bonn, Germany.

出版信息

J Biomed Semantics. 2019 Jan 28;10(1):4. doi: 10.1186/s13326-019-0196-2.

Abstract

BACKGROUND

With the emergence of high-throughput technologies, Big Data and eScience, the use of online data repositories and the establishment of new data standards that require data to be computer-parsable become increasingly important. As a consequence, there is an increasing need for an integrated system of hierarchies of levels of different types of material entities that helps with organizing, structuring and integrating data from disparate sources to facilitate data exploration, data comparison and analysis. Theories of granularity provide such integrated systems.

RESULTS

On the basis of formal approaches to theories of granularity authored by information scientists and ontology researchers, I discuss the shortcomings of some applications of the concept of levels and argue that the general theory of granularity proposed by Keet circumvents these problems. I introduce the concept of building blocks, which gives rise to a hierarchy of levels that can be formally characterized by Keet's theory. This hierarchy functions as an organizational backbone for integrating various other hierarchies that I briefly discuss, resulting in a domain granularity framework for the life sciences. I also discuss the consequences of this granularity framework for the structure of the top-level category of 'material entity' in Basic Formal Ontology.

CONCLUSIONS

The domain granularity framework suggested here is meant to provide the basis on which a more comprehensive information framework for the life sciences can be developed, which would provide the much needed conceptual framework for representing domains that cover multiple granularity levels. This framework can be used for intuitively structuring data in the life sciences, facilitating data exploration, and it can be employed for reasoning over different granularity levels across different hierarchies. It would provide a methodological basis for establishing comparability between data sets and for quantitatively measuring their degree of semantic similarity.

摘要

背景

随着高通量技术、大数据和电子科学的出现,在线数据存储库的使用以及要求数据可由计算机解析的新数据标准的建立变得越来越重要。因此,越来越需要一个不同类型物质实体层次结构的集成系统,以帮助组织、构建和整合来自不同来源的数据,促进数据探索、数据比较和分析。粒度理论提供了这样的集成系统。

结果

基于信息科学家和本体研究者提出的粒度理论的形式化方法,我讨论了层次概念某些应用的缺点,并认为基特提出的粒度通用理论规避了这些问题。我引入了构建块的概念,它产生了一个层次结构,该层次结构可以由基特的理论进行形式化表征。这个层次结构作为一个组织主干,用于整合我简要讨论的各种其他层次结构,从而形成生命科学的领域粒度框架。我还讨论了这个粒度框架对基础形式本体中“物质实体”顶级类别的结构的影响。

结论

这里提出的领域粒度框架旨在为开发更全面的生命科学信息框架提供基础,该框架将为表示涵盖多个粒度层次的领域提供急需的概念框架。这个框架可用于直观地构建生命科学中的数据,促进数据探索,并且可用于跨不同层次结构对不同粒度层次进行推理。它将为建立数据集之间的可比性以及定量测量它们的语义相似程度提供方法基础。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验