Department of Artificial Intelligence, Sejong University, Seoul 05006, Republic of Korea.
Molecules. 2023 Jun 16;28(12):4821. doi: 10.3390/molecules28124821.
Drug-phospholipid complexing is a promising formulation technology for improving the low bioavailability of active pharmaceutical ingredients (APIs). However, identifying whether phospholipid and candidate drug can form a complex through in vitro tests can be costly and time-consuming due to the physicochemical properties and experimental environment. In a previous study, the authors developed seven machine learning models to predict drug-phospholipid complex formation, and the lightGBM model demonstrated the best performance. However, the previous study was unable to sufficiently address the degradation of test performance caused by the small size of the training data with class imbalance, and it had the limitation of considering only machine learning techniques. To overcome these limitations, we propose a new deep learning-based prediction model that employs variational autoencoder (VAE) and principal component analysis (PCA) techniques to improve prediction performance. The model uses a multi-layer one-dimensional convolutional neural network (CNN) with a skip connection to effectively capture the complex relationship between drugs and lipid molecules. The computer simulation results demonstrate that our proposed model performs better than the previous model in all performance metrics.
药物-磷脂复合物是一种有前途的制剂技术,可提高活性药物成分(API)的低生物利用度。然而,由于物理化学性质和实验环境的原因,通过体外试验来确定磷脂和候选药物是否能形成复合物可能既昂贵又耗时。在之前的一项研究中,作者开发了七种机器学习模型来预测药物-磷脂复合物的形成,其中 lightGBM 模型表现出最好的性能。然而,之前的研究未能充分解决由于训练数据小且存在类不平衡而导致的测试性能退化问题,并且它仅限于考虑机器学习技术。为了克服这些限制,我们提出了一种新的基于深度学习的预测模型,该模型使用变分自动编码器(VAE)和主成分分析(PCA)技术来提高预测性能。该模型使用具有跳过连接的多层一维卷积神经网络(CNN),以有效地捕捉药物和脂质分子之间的复杂关系。计算机模拟结果表明,我们提出的模型在所有性能指标上均优于之前的模型。