Kotzabasaki Marianna, Sotiropoulos Iason, Charitidis Costas, Sarimveis Haralambos
School of Chemical Engineering, National Technical University of Athens 9 Heroon Polytechneiou Street, Zografou Campus 15780 Athens Greece
Nanoscale Adv. 2021 Apr 12;3(11):3167-3176. doi: 10.1039/d0na00600a. eCollection 2021 Jun 1.
Multi-walled carbon nanotubes (MWCNTs) are made of multiple single-walled carbon nanotubes (SWCNTs) which are nested inside one another forming concentric cylinders. These nanomaterials are widely used in industrial and biomedical applications, due to their unique physicochemical characteristics. However, previous studies have shown that exposure to MWCNTs may lead to toxicity and some of the physicochemical properties of MWCNTs can influence their toxicological profiles. modelling can be applied as a faster and less costly alternative to experimental ( and ) testing for the hazard characterization of MWCNTs. This study aims at developing a fully validated predictive nanoinformatics model based on statistical and machine learning approaches for the accurate prediction of genotoxicity of different types of MWCNTs. Towards this goal, a number of different computational workflows were designed, combining unsupervised (Principal Component Analysis, PCA) and supervised classification techniques (Support Vectors Machine, "SVM", Random Forest, "RF", Logistic Regression, "LR" and Naïve Bayes, "NB") and Bayesian optimization. The Recursive Feature Elimination (RFE) method was applied for selecting the most important variables. An RF model using only three features was selected as the most efficient for predicting the genotoxicity of MWCNTs, exhibiting 80% accuracy on external validation and high classification probabilities. The most informative features selected by the model were "Length", "Zeta average" and "Purity".
多壁碳纳米管(MWCNTs)由多个单壁碳纳米管(SWCNTs)组成,这些单壁碳纳米管相互嵌套形成同心圆柱。由于其独特的物理化学特性,这些纳米材料被广泛应用于工业和生物医学领域。然而,先前的研究表明,接触多壁碳纳米管可能会导致毒性,并且多壁碳纳米管的一些物理化学性质会影响其毒理学特征。建模可以作为一种更快且成本更低的替代方法,用于多壁碳纳米管危害特征的实验(和)测试。本研究旨在基于统计和机器学习方法开发一个经过充分验证的预测性纳米信息学模型,以准确预测不同类型多壁碳纳米管的遗传毒性。为实现这一目标,设计了许多不同的计算工作流程,结合了无监督(主成分分析,PCA)和监督分类技术(支持向量机,“SVM”,随机森林,“RF”,逻辑回归,“LR”和朴素贝叶斯,“NB”)以及贝叶斯优化。应用递归特征消除(RFE)方法来选择最重要的变量。选择仅使用三个特征的随机森林模型作为预测多壁碳纳米管遗传毒性最有效的模型,在外部验证中表现出80%的准确率和高分类概率。该模型选择的最具信息性的特征是“长度”、“ζ平均”和“纯度”。