Dražić Ena, Jelušić Darijan, Janković Bevandić Patrizia, Mauša Goran, Kalafatovic Daniela
University of Rijeka, Center for Artificial Intelligence and Cybersecurity, 51000 Rijeka, Croatia.
University of Rijeka, Faculty of Engineering, 51000 Rijeka, Croatia.
ACS Nano. 2025 Jun 10;19(22):20295-20320. doi: 10.1021/acsnano.5c00670. Epub 2025 May 29.
Peptides can serve as building blocks for supramolecular materials because of their unique ability to self-assemble, offering potential applications in drug delivery, tissue engineering, and nanotechnology. In this review, we describe peptide self-assembly as a sequence- and context-dependent process and its resulting complexity due to the heterogeneity of the sequences and experimental conditions, which makes cross-laboratory reproducibility a serious challenge and standardized reporting a necessity. Given the large number of possible peptide permutations, machine learning (ML) is suitable for navigating the peptide search space with the aim of reducing trial-and-error experimentation and speeding up the discovery of self-assembling peptides. However, we point out that ML is not a point-and-shoot tool that can be applied directly to any problem and requires careful consideration, domain knowledge, and proper data preparation to achieve meaningful results. In addition, we discuss the lack of negative data reported to be the main limiting factor in the effective application of ML. Considering the transformative potential of artificial intelligence, we conclude that grasping the power of large language models and generative approaches, coupled with explainability techniques, will expedite peptide nanomaterials discovery.
肽由于其独特的自组装能力,可作为超分子材料的构建单元,在药物递送、组织工程和纳米技术等领域具有潜在应用。在本综述中,我们将肽的自组装描述为一个依赖于序列和环境的过程,以及由于序列和实验条件的异质性而导致的复杂性,这使得跨实验室的可重复性成为一个严峻挑战,标准化报告成为必要。鉴于肽的排列组合数量众多,机器学习(ML)适用于探索肽的搜索空间,以减少试错实验并加速自组装肽的发现。然而,我们指出,ML并非一种可以直接应用于任何问题的现成工具,需要仔细考虑、领域知识和适当的数据准备才能获得有意义的结果。此外,我们讨论了缺乏阴性数据被认为是ML有效应用的主要限制因素。考虑到人工智能的变革潜力,我们得出结论,掌握大语言模型和生成方法的力量,再结合可解释性技术,将加速肽纳米材料的发现。