Nguyen Danh, Tao Lei, Li Ying
Department of Mechanical Engineering, University of Connecticut, Mansfield, CT, United States.
Polymer Program, Institute of Materials Science, University of Connecticut, Mansfield, CT, United States.
Front Chem. 2022 Jan 24;9:820417. doi: 10.3389/fchem.2021.820417. eCollection 2021.
In recent years, the synthesis of monomer sequence-defined polymers has expanded into broad-spectrum applications in biomedical, chemical, and materials science fields. Pursuing the characterization and inverse design of these polymer systems requires our fundamental understanding not only at the individual monomer level, but also considering the chain scales, such as polymer configuration, self-assembly, and phase separation. However, our accessibility to this field is still rudimentary due to the limitations of traditional design approaches, the complexity of chemical space along with the burdened cost and time issues that prevent us from unveiling the underlying monomer sequence-structure-property relationships. Fortunately, thanks to the recent advancements in molecular dynamics simulations and machine learning (ML) algorithms, the bottlenecks in the tasks of establishing the structure-function correlation of the polymer chains can be overcome. In this review, we will discuss the applications of the integration between ML techniques and coarse-grained molecular dynamics (CGMD) simulations to solve the current issues in polymer science at the chain level. In particular, we focus on the case studies in three important topics-polymeric configuration characterization, feed-forward property prediction, and inverse design-in which CGMD simulations are leveraged to generate training datasets to develop ML-based surrogate models for specific polymer systems and designs. By doing so, this computational hybridization allows us to well establish the monomer sequence-functional behavior relationship of the polymers as well as guide us toward the best polymer chain candidates for the inverse design in undiscovered chemical space with reasonable computational cost and time. Even though there are still limitations and challenges ahead in this field, we finally conclude that this CGMD/ML integration is very promising, not only in the attempt of bridging the monomeric and macroscopic characterizations of polymer materials, but also enabling further tailored designs for sequence-specific polymers with superior properties in many practical applications.
近年来,单体序列定义聚合物的合成已扩展到生物医学、化学和材料科学领域的广泛应用。要对这些聚合物体系进行表征和逆向设计,不仅需要我们在单个单体层面有基本的了解,还需要考虑链尺度,如聚合物构型、自组装和相分离。然而,由于传统设计方法的局限性、化学空间的复杂性以及成本和时间负担问题,阻碍我们揭示潜在的单体序列-结构-性能关系,因此我们对该领域的了解仍然很初步。幸运的是,得益于分子动力学模拟和机器学习(ML)算法的最新进展,聚合物链结构-功能相关性建立任务中的瓶颈可以被克服。在这篇综述中,我们将讨论ML技术与粗粒化分子动力学(CGMD)模拟相结合在解决聚合物科学链层面当前问题中的应用。特别是,我们专注于三个重要主题的案例研究——聚合物构型表征、前馈性能预测和逆向设计,其中利用CGMD模拟生成训练数据集,为特定聚合物体系和设计开发基于ML的替代模型。通过这样做,这种计算杂交使我们能够很好地建立聚合物的单体序列-功能行为关系,并以合理的计算成本和时间引导我们在未发现的化学空间中寻找逆向设计的最佳聚合物链候选物。尽管该领域仍面临限制和挑战,但我们最终得出结论,这种CGMD/ML集成非常有前景,不仅有助于弥合聚合物材料的单体和宏观表征之间的差距,还能在许多实际应用中为具有优异性能的序列特异性聚合物实现进一步的定制设计。