Qu Mengyang, Sharma Gyanendra, Wada Naoki, Ikebata Hisaki, Matsunami Shigeyuki, Takahashi Kenji
Faculty of Biological Science and Technology, Institute of Science and Engineering, Kanazawa University, Kakuma-Machi, Kanazawa, 920-1192, Japan.
CrowdChem, Inc, 302, 1-5-28, Hiromachi, Shinagawa-ku, Tokyo, 140-0005, Japan.
J Cheminform. 2025 May 21;17(1):78. doi: 10.1186/s13321-025-01018-z.
Cellulose, a highly versatile material, faces challenges in processing due to its limited solubility in common solvents. Ionic liquids have been found to possess high solvating capacities for cellulose. However, the experimental development of ionic liquids with optimal cellulose solubilities remains a time-consuming trial-and-error process. In this work, a virtual molecular library containing billions of potentially de novo ionic liquid candidates has been generated utilizing Monte Carlo tree search and recurrent neural network techniques. The library is subsequently screened through two predictive machine learning models, which have been pre-trained for predicting cellulose solubility and melting point of ionic liquids. The promising candidates were further validated and screened using the Conductor-like Screening Model for Real Solvents (COSMO-RS) model. Our work offers an efficient workflow and virtual molecular library, which should facilitate theoretical and experimental development of novel ionic liquids.
纤维素是一种用途广泛的材料,但由于其在常见溶剂中的溶解度有限,在加工过程中面临挑战。人们发现离子液体对纤维素具有高溶解能力。然而,开发具有最佳纤维素溶解度的离子液体的实验过程仍然是一个耗时的试错过程。在这项工作中,利用蒙特卡罗树搜索和递归神经网络技术生成了一个包含数十亿潜在从头设计离子液体候选物的虚拟分子库。随后通过两个预测性机器学习模型对该库进行筛选,这两个模型已经过预训练,用于预测离子液体的纤维素溶解度和熔点。使用真实溶剂的导体类筛选模型(COSMO-RS)模型对有前景的候选物进行进一步验证和筛选。我们的工作提供了一个高效的工作流程和虚拟分子库,这应该有助于新型离子液体的理论和实验开发。