Suppr超能文献

基于化学本体的化合物自动分类。

Automated compound classification using a chemical ontology.

机构信息

OntoChem GmbH, Heinrich-Damerow-Str, 4, Halle/Saale, D-06120, Germany.

出版信息

J Cheminform. 2012 Dec 29;4(1):40. doi: 10.1186/1758-2946-4-40.

Abstract

BACKGROUND

Classification of chemical compounds into compound classes by using structure derived descriptors is a well-established method to aid the evaluation and abstraction of compound properties in chemical compound databases. MeSH and recently ChEBI are examples of chemical ontologies that provide a hierarchical classification of compounds into general compound classes of biological interest based on their structural as well as property or use features. In these ontologies, compounds have been assigned manually to their respective classes. However, with the ever increasing possibilities to extract new compounds from text documents using name-to-structure tools and considering the large number of compounds deposited in databases, automated and comprehensive chemical classification methods are needed to avoid the error prone and time consuming manual classification of compounds.

RESULTS

In the present work we implement principles and methods to construct a chemical ontology of classes that shall support the automated, high-quality compound classification in chemical databases or text documents. While SMARTS expressions have already been used to define chemical structure class concepts, in the present work we have extended the expressive power of such class definitions by expanding their structure-based reasoning logic. Thus, to achieve the required precision and granularity of chemical class definitions, sets of SMARTS class definitions are connected by OR and NOT logical operators. In addition, AND logic has been implemented to allow the concomitant use of flexible atom lists and stereochemistry definitions. The resulting chemical ontology is a multi-hierarchical taxonomy of concept nodes connected by directed, transitive relationships.

CONCLUSIONS

A proposal for a rule based definition of chemical classes has been made that allows to define chemical compound classes more precisely than before. The proposed structure-based reasoning logic allows to translate chemistry expert knowledge into a computer interpretable form, preventing erroneous compound assignments and allowing automatic compound classification. The automated assignment of compounds in databases, compound structure files or text documents to their related ontology classes is possible through the integration with a chemical structure search engine. As an application example, the annotation of chemical structure files with a prototypic ontology is demonstrated.

摘要

背景

通过使用结构衍生描述符将化合物分类为化合物类,这是一种成熟的方法,可以帮助评估和抽象化学化合物数据库中的化合物性质。MeSH 和最近的 ChEBI 是化学本体的示例,它们基于结构以及性质或用途特征,将化合物分类为具有生物学意义的一般化合物类。在这些本体中,化合物是手动分配到各自的类别的。然而,随着使用名称到结构工具从文本文档中提取新化合物的可能性不断增加,并且考虑到数据库中存储的大量化合物,需要自动化和全面的化学分类方法来避免化合物的易错和耗时的手动分类。

结果

在本工作中,我们实现了构建化学类本体的原理和方法,以支持在化学数据库或文本文档中进行自动化、高质量的化合物分类。虽然 SMARTS 表达式已经用于定义化学结构类概念,但在本工作中,我们通过扩展其基于结构的推理逻辑扩展了此类定义的表达能力。因此,为了实现化学类定义所需的精度和粒度,SMARTS 类定义集通过 OR 和 NOT 逻辑运算符连接。此外,实现了 AND 逻辑以允许同时使用灵活的原子列表和立体化学定义。由此产生的化学本体是通过有向、传递关系连接的概念节点的多层次分类法。

结论

提出了一种基于规则的化学类定义建议,该建议允许比以前更精确地定义化学化合物类。所提出的基于结构的推理逻辑允许将化学专家知识转化为计算机可解释的形式,防止错误的化合物分配,并允许自动化合物分类。通过与化学结构搜索引擎集成,可以将数据库、化合物结构文件或文本文档中的化合物自动分配给相关的本体类。作为应用示例,演示了用原型本体注释化学结构文件。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6a67/3547776/a2fc233e2c90/1758-2946-4-40-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验