Suppr超能文献

化学实体语义规范:用于高效语义化学信息学和便捷数据集成的知识表示。

Chemical Entity Semantic Specification: Knowledge representation for efficient semantic cheminformatics and facile data integration.

机构信息

Department of Biology, Carleton University, Ottawa, Canada.

出版信息

J Cheminform. 2011 May 19;3(1):20. doi: 10.1186/1758-2946-3-20.

Abstract

BACKGROUND

Over the past several centuries, chemistry has permeated virtually every facet of human lifestyle, enriching fields as diverse as medicine, agriculture, manufacturing, warfare, and electronics, among numerous others. Unfortunately, application-specific, incompatible chemical information formats and representation strategies have emerged as a result of such diverse adoption of chemistry. Although a number of efforts have been dedicated to unifying the computational representation of chemical information, disparities between the various chemical databases still persist and stand in the way of cross-domain, interdisciplinary investigations. Through a common syntax and formal semantics, Semantic Web technology offers the ability to accurately represent, integrate, reason about and query across diverse chemical information.

RESULTS

Here we specify and implement the Chemical Entity Semantic Specification (CHESS) for the representation of polyatomic chemical entities, their substructures, bonds, atoms, and reactions using Semantic Web technologies. CHESS provides means to capture aspects of their corresponding chemical descriptors, connectivity, functional composition, and geometric structure while specifying mechanisms for data provenance. We demonstrate that using our readily extensible specification, it is possible to efficiently integrate multiple disparate chemical data sources, while retaining appropriate correspondence of chemical descriptors, with very little additional effort. We demonstrate the impact of some of our representational decisions on the performance of chemically-aware knowledgebase searching and rudimentary reaction candidate selection. Finally, we provide access to the tools necessary to carry out chemical entity encoding in CHESS, along with a sample knowledgebase.

CONCLUSIONS

By harnessing the power of Semantic Web technologies with CHESS, it is possible to provide a means of facile cross-domain chemical knowledge integration with full preservation of data correspondence and provenance. Our representation builds on existing cheminformatics technologies and, by the virtue of RDF specification, remains flexible and amenable to application- and domain-specific annotations without compromising chemical data integration. We conclude that the adoption of a consistent and semantically-enabled chemical specification is imperative for surviving the coming chemical data deluge and supporting systems science research.

摘要

背景

在过去的几个世纪中,化学几乎渗透到人类生活方式的各个方面,丰富了医学、农业、制造业、战争和电子等众多领域。不幸的是,由于化学的这种多样化应用,出现了特定于应用的、不兼容的化学信息格式和表示策略。尽管已经做出了许多努力来统一化学信息的计算表示,但是各种化学数据库之间仍然存在差异,这阻碍了跨领域、跨学科的研究。通过通用语法和形式语义,语义网技术提供了在不同化学信息之间准确表示、集成、推理和查询的能力。

结果

在这里,我们使用语义网技术为多原子化学实体、它们的子结构、键、原子和反应指定和实现了化学实体语义规范 (CHESS),用于表示它们。CHESS 提供了捕获它们相应的化学描述符、连通性、功能组成和几何结构的方法,同时指定了数据来源的机制。我们证明,使用我们易于扩展的规范,可以高效地集成多个不同的化学数据源,同时保留适当的化学描述符对应关系,而无需额外的努力。我们展示了一些表示决策对具有化学意识的知识库搜索和基本反应候选选择的性能的影响。最后,我们提供了在 CHESS 中进行化学实体编码所需的工具以及一个示例知识库。

结论

通过利用 CHESS 的语义网技术的力量,我们可以提供一种简便的跨领域化学知识集成方法,同时完全保留数据对应关系和来源。我们的表示形式建立在现有的化学信息学技术之上,并且由于 RDF 规范的存在,保持了灵活性和对特定于应用程序和领域的注释的适应性,而不会影响化学数据集成。我们得出结论,采用一致的和语义化的化学规范对于应对即将到来的化学数据洪流和支持系统科学研究是必要的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2bf2/3121712/bac98f2b038a/1758-2946-3-20-1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验