基于真实世界研究,使用机器学习和深度学习技术预测拉帕替尼剂量方案
Predicting Lapatinib Dose Regimen Using Machine Learning and Deep Learning Techniques Based on a Real-World Study.
作者信息
Yu Ze, Ye Xuan, Liu Hongyue, Li Huan, Hao Xin, Zhang Jinyuan, Kou Fang, Wang Zeyuan, Wei Hai, Gao Fei, Zhai Qing
机构信息
Institute of Interdisciplinary Integrative Medicine Research, Shanghai University of Traditional Chinese Medicine, Shanghai, China.
Department of Pharmacy, Fudan University Shanghai Cancer Center, Shanghai, China.
出版信息
Front Oncol. 2022 Jun 3;12:893966. doi: 10.3389/fonc.2022.893966. eCollection 2022.
Lapatinib is used for the treatment of metastatic HER2(+) breast cancer. We aim to establish a prediction model for lapatinib dose using machine learning and deep learning techniques based on a real-world study. There were 149 breast cancer patients enrolled from July 2016 to June 2017 at Fudan University Shanghai Cancer Center. The sequential forward selection algorithm based on random forest was applied for variable selection. Twelve machine learning and deep learning algorithms were compared in terms of their predictive abilities (logistic regression, SVM, random forest, Adaboost, XGBoost, GBDT, LightGBM, CatBoost, TabNet, ANN, Super TML, and Wide&Deep). As a result, TabNet was chosen to construct the prediction model with the best performance (accuracy = 0.82 and AUC = 0.83). Afterward, four variables that strongly correlated with lapatinib dose were ranked importance score as follows: treatment protocols, weight, number of chemotherapy treatments, and number of metastases. Finally, the confusion matrix was used to validate the model for a dose regimen of 1,250 mg lapatinib (precision = 81% and recall = 95%), and for a dose regimen of 1,000 mg lapatinib (precision = 87% and recall = 64%). To conclude, we established a deep learning model to predict lapatinib dose based on important influencing variables selected from real-world evidence, to achieve an optimal individualized dose regimen with good predictive performance.
拉帕替尼用于治疗转移性HER2(+)乳腺癌。我们旨在基于一项真实世界研究,运用机器学习和深度学习技术建立一个拉帕替尼剂量预测模型。2016年7月至2017年6月期间,复旦大学附属肿瘤医院招募了149例乳腺癌患者。采用基于随机森林的逐步前向选择算法进行变量选择。比较了12种机器学习和深度学习算法的预测能力(逻辑回归、支持向量机、随机森林、Adaboost、XGBoost、梯度提升决策树、LightGBM、CatBoost、TabNet、人工神经网络、超级TML和宽深模型)。结果,选择TabNet构建性能最佳的预测模型(准确率=0.82,曲线下面积=0.83)。随后,与拉帕替尼剂量密切相关的四个变量按重要性得分排序如下:治疗方案、体重、化疗次数和转移灶数量。最后,使用混淆矩阵对1250mg拉帕替尼剂量方案(精确率=81%,召回率=95%)和1000mg拉帕替尼剂量方案(精确率=87%,召回率=64%)的模型进行验证。总之,我们基于从真实世界证据中选择的重要影响变量建立了一个深度学习模型来预测拉帕替尼剂量,以实现具有良好预测性能的最佳个体化剂量方案。