Suppr超能文献

作为提供机器可读过程数据的一种方式,对反应输入化学标识符(RInChI)的一种可能扩展。

A possible extension to the RInChI as a means of providing machine readable process data.

作者信息

Jacob Philipp-Maximilian, Lan Tian, Goodman Jonathan M, Lapkin Alexei A

机构信息

Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK.

Department of Chemistry, University of Cambridge, Cambridge, CB2 1EW, UK.

出版信息

J Cheminform. 2017 Apr 11;9(1):23. doi: 10.1186/s13321-017-0210-6.

Abstract

The algorithmic, large-scale use and analysis of reaction databases such as Reaxys is currently hindered by the absence of widely adopted standards for publishing reaction data in machine readable formats. Crucial data such as yields of all products or stoichiometry are frequently not explicitly stated in the published papers and, hence, not reported in the database entry for those reactions, limiting their usefulness for algorithmic analysis. This paper presents a possible extension to the IUPAC RInChI standard via an auxiliary layer, termed ProcAuxInfo, which is a standardised, extensible form in which to report certain key reaction parameters such as declaration of all products and reactants as well as auxiliaries known in the reaction, reaction stoichiometry, amounts of substances used, conversion, yield and operating conditions. The standard is demonstrated via creation of the RInChI including the ProcAuxInfo layer based on three published reactions and demonstrates accurate data recoverability via reverse translation of the created strings. Implementation of this or another method of reporting process data by the publishing community would ensure that databases, such as Reaxys, would be able to abstract crucial data for big data analysis of their contents.

摘要

诸如Reaxys等反应数据库的算法大规模使用和分析,目前受到缺乏以机器可读格式发布反应数据的广泛采用标准的阻碍。诸如所有产物的产率或化学计量等关键数据在已发表的论文中常常未明确说明,因此在这些反应的数据库条目中也未报告,这限制了它们在算法分析中的用途。本文通过一个名为ProcAuxInfo的辅助层,提出了对IUPAC RInChI标准的一种可能扩展,这是一种标准化的、可扩展的形式,用于报告某些关键反应参数,如所有产物和反应物的声明以及反应中已知的助剂、反应化学计量、所用物质的量、转化率、产率和操作条件。通过基于三个已发表的反应创建包含ProcAuxInfo层的RInChI来展示该标准,并通过对创建的字符串进行反向翻译来证明准确的数据可恢复性。出版界采用这种或其他报告过程数据的方法将确保诸如Reaxys等数据库能够提取其内容大数据分析所需的关键数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac67/5388667/166546c8aa59/13321_2017_210_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验