Suppr超能文献

自动化实验为化学领域的数据科学提供动力。

Automated Experimentation Powers Data Science in Chemistry.

机构信息

Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, British Columbia V6T 1Z3, Canada.

出版信息

Acc Chem Res. 2021 Feb 2;54(3):546-555. doi: 10.1021/acs.accounts.0c00736. Epub 2021 Jan 20.

Abstract

Data science has revolutionized chemical research and continues to break down barriers with new interdisciplinary studies. The introduction of computational models and machine learning (ML) algorithms in combination with automation and traditional experimental techniques has enabled scientific advancement across nearly every discipline of chemistry, from materials discovery, to process optimization, to synthesis planning. However, predictive tools powered by data science are only as good as their data sets and, currently, many of the data sets used to train models suffer from several limitations, including being sparse, limited in scope and requiring human curation. Likewise, computational data faces limitations in terms of accurate modeling of nonideal systems and can suffer from low translation fidelity from simulation to real conditions. The lack of diverse data and the need to be able to test it experimentally reduces both the accuracy and scope of the predictive models derived from data science. This Account contextualizes the need for more complex and diverse experimental data and highlights how the seamless integration of robotics, machine learning, and data-rich monitoring techniques can be used to access it with minimal human labor.We propose three broad categories of data in chemistry: data on fundamental properties, data on reaction outcomes, and data on reaction mechanics. We highlight flexible, automated platforms that can be deployed to acquire and leverage these data. The first platform combines solid- and liquid-dosing modules with computer vision to automate solubility screening, thereby gathering fundamental data that are necessary for almost every experimental design. Using computer vision offers the additional benefit of creating a visual record, which can be referenced and used to further interrogate and gain insight on the data collected. The second platform iteratively tests reaction variables proposed by a ML algorithm in a closed-loop fashion. Experimental data related to reaction outcomes are fed back into the algorithm to drive the discovery and optimization of new materials and chemical processes. The third platform uses automated process analytical technology to gather real-time data related to reaction kinetics. This system allows the researcher to directly interrogate the reaction mechanisms in granular detail to determine exactly how and why a reaction proceeds, thereby enabling reaction optimization and deployment.

摘要

数据科学已经彻底改变了化学研究,并通过新的跨学科研究继续打破障碍。计算模型和机器学习 (ML) 算法的引入与自动化和传统实验技术相结合,使几乎每一个化学领域都取得了科学进步,从材料发现到工艺优化,再到合成规划。然而,由数据科学驱动的预测工具与其数据集一样好,目前,许多用于训练模型的数据集中存在多种限制,包括稀疏性、范围有限且需要人工管理。同样,计算数据在准确模拟非理想系统方面存在限制,并且可能在从模拟到实际条件的转换过程中保真度较低。缺乏多样化的数据以及需要能够对其进行实验测试,这降低了从数据科学中得出的预测模型的准确性和范围。本专题介绍了对更复杂和多样化实验数据的需求,并强调了机器人技术、机器学习和数据丰富的监测技术的无缝集成如何用于以最小的人工劳动来获取这些数据。我们提出了化学中的三类数据:基本性质数据、反应结果数据和反应力学数据。我们重点介绍了可以部署这些数据的灵活、自动化平台。第一个平台结合了固液加样模块和计算机视觉来自动化溶解度筛选,从而收集几乎每个实验设计都需要的基本数据。使用计算机视觉还具有创建可视记录的额外好处,可参考并用于进一步查询和深入了解所收集的数据。第二个平台以闭环方式迭代测试 ML 算法提出的反应变量。与反应结果相关的实验数据被反馈到算法中,以推动新材料和化学工艺的发现和优化。第三个平台使用自动化过程分析技术来收集与反应动力学相关的实时数据。该系统允许研究人员直接详细地查询反应机制,以确定反应是如何以及为什么进行的,从而实现反应优化和部署。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验