Al-Jubouri B, Desiati I, Wijanarko W, Espallargas N
The Norwegian Tribology Center, Department of Mechanical and Industrial Engineering, Norwegian University of Science and Technology (NTNU), R. Birkelandsvei 2B, Trondheim 7491, Norway.
Department of Computer Science, York St John University, Lord Mayor's Walk, York, York YO31 7EX, United Kingdom.
ACS Appl Mater Interfaces. 2025 Mar 19;17(11):16725-16737. doi: 10.1021/acsami.4c10622. Epub 2025 Mar 10.
Lubricants are complex mixtures of chemicals that help machines function at the right level of friction and wear. Lubricant formulation methods are based on empirical experience of chemical substances that have been used as lubricants for decades. In the last years, the discussion about their environmental problem has triggered new legislations resulting in the search for Environmentally Acceptable Lubricants, which should be biodegradable, minimally toxic, and nonbioaccumulative. Finding new chemicals that comply with these three criteria is a long and expensive process that can be boosted by machine learning (ML). In this paper, we are addressing toxicity prediction with machine learning models by exploring the application of ensemble learners to chemicals having imbalanced data distribution. We investigated the effectiveness of sampling techniques to balance the data and improve the performance of the ensemble learning model. The model can predict toxicity for nonundersampled groups, which in our case corresponds to the moderately to highly toxic groups. The results of this work are useful for lubricant formulators since regulations accept moderate-to-highly toxic chemicals in lubricants if their concentration is below 20 wt %.
润滑剂是化学物质的复杂混合物,有助于机器在适当的摩擦和磨损水平下运行。润滑剂配方方法基于数十年来用作润滑剂的化学物质的经验。近年来,关于其环境问题的讨论引发了新的立法,促使人们寻找环境可接受的润滑剂,这种润滑剂应具有可生物降解、低毒性和非生物累积性。寻找符合这三个标准的新化学物质是一个漫长而昂贵的过程,机器学习(ML)可以加速这一过程。在本文中,我们通过探索集成学习器在数据分布不均衡的化学物质中的应用,用机器学习模型解决毒性预测问题。我们研究了采样技术平衡数据和提高集成学习模型性能的有效性。该模型可以预测未进行欠采样的组的毒性,在我们的案例中,这些组对应于中度至高度毒性组。这项工作的结果对润滑剂配方师很有用,因为如果润滑剂中中度至高度毒性化学物质的浓度低于20 wt%,法规是允许的。