Martinez Aritz D, Navajas-Guerrero Adriana, Bediaga-Bañeres Harbil, Sánchez-Bodón Julia, Ortiz Pablo, Vilas-Vilela Jose Luis, Moreno-Benitez Isabel, Gil-Lopez Sergio
TECNALIA, Basque Research & Technology Alliance (BRTA), Technological Park of Bizkaia, 48160 Derio, Spain.
Grupo de Química Macromolecular (LABQUIMAC), Departamento de Química Física, Facultad de Ciencia y Tecnología, Universidad del País Vasco UPV/EHU, 48940 Leioa, Spain.
Polymers (Basel). 2024 Oct 19;16(20):2936. doi: 10.3390/polym16202936.
Recent advancements in materials science have garnered significant attention within the research community. Over the past decade, substantial efforts have been directed towards the exploration of innovative methodologies for developing new materials. These efforts encompass enhancements to existing products or processes and the design of novel materials. Of particular significance is the synthesis of specific polymers through the copolymerization of epoxides with CO. However, several uncertainties emerge in this chemical process, including challenges associated with successful polymerization and the properties of the resulting materials. These uncertainties render the design of new polymers a trial-and-error endeavor, often resulting in failed outcomes that entail significant financial, human resource, and time investments due to unsuccessful experimentation. Artificial Intelligence (AI) emerges as a promising technology to mitigate these drawbacks during the experimental phase. Nonetheless, the availability of high-quality data remains crucial, posing particular challenges in the context of polymeric materials, mainly because of the stochastic nature of polymers, which impedes their homogeneous representation, and the variation in their properties based on their processing. In this study, the first dataset linking the structure of the epoxy comonomer, the catalyst employed, and the experimental conditions of polymerization to the reaction's success is described. A novel analytical pipeline based on ML to effectively exploit the constructed database is introduced. The initial results underscore the importance of addressing the dimensionality problem. The outcomes derived from the proposed analytical pipeline, which infer the molecular weight, polydispersity index, and conversion rate, demonstrate promising adjustment values for all target parameters. The best results are measured in terms of the (Determination Coefficient) R2 between real and predicted values for all three target magnitudes. The best proposed solution provides a R2 equal to 0.79, 0.86, and 0.93 for the molecular weight, polydispersity index, and conversion rate, respectively. The proposed analytical pipeline is automatized (including AutoML techniques for ML models hyperparameter tuning), allowing easy scalability as the database grows, laying the foundation for future research.
材料科学的最新进展在研究界引起了广泛关注。在过去十年中,人们投入了大量精力探索开发新材料的创新方法。这些努力包括改进现有产品或工艺以及设计新型材料。特别重要的是通过环氧化物与一氧化碳的共聚来合成特定聚合物。然而,这个化学过程中出现了一些不确定性,包括成功聚合的挑战以及所得材料的性能。这些不确定性使得新聚合物的设计成为一项反复试验的工作,由于实验不成功,往往会导致失败的结果,需要大量的资金、人力资源和时间投入。人工智能(AI)作为一种有前途的技术,有望在实验阶段减轻这些缺点。尽管如此,高质量数据的可用性仍然至关重要,在聚合物材料的背景下尤其具有挑战性,主要是因为聚合物的随机性质阻碍了它们的均匀表示,以及它们的性能会因加工而变化。在本研究中,描述了第一个将环氧共聚单体的结构、所用催化剂以及聚合实验条件与反应成功与否联系起来的数据集。介绍了一种基于机器学习(ML)的新型分析流程,以有效利用构建的数据库。初步结果强调了解决维度问题的重要性。从所提出的分析流程得出的推断分子量、多分散指数和转化率的结果,显示出所有目标参数都有很有前景的调整值。最佳结果是根据所有三个目标量的实际值与预测值之间的(决定系数)R2来衡量的。所提出的最佳解决方案分别为分子量、多分散指数和转化率提供了等于0.79、0.86和0.93的R2。所提出的分析流程是自动化的(包括用于ML模型超参数调整的自动机器学习技术),随着数据库的增长便于轻松扩展,为未来的研究奠定了基础。