Laboratory of Pharmaceutical Process Analytical Technology, Department of Pharmaceutical Analysis, Ghent University, Ottergemsesteenweg 460, 9000 Ghent, Belgium.
Pharmaceutical Engineering Research Group, Department of Pharmaceutical Analysis, Ghent University, Ottergemsesteenweg 460, 9000 Ghent, Belgium.
Int J Pharm. 2024 Sep 5;662:124463. doi: 10.1016/j.ijpharm.2024.124463. Epub 2024 Jul 14.
T-shaped partial least squares regression (T-PLSR) is a valuable machine learning technique for the formulation and manufacturing process development of new drug products. An accurate T-PLSR model requires experimental data with multiple formulations and process conditions. However, it is usually challenging to collect comprehensive experimental data using large-scale manufacturing equipment because of the cost, time, and large consumption of raw materials. This study proposes a hybrid modeling of T-PLSR and transfer learning (TL) to enhance the prediction performance of a T-PLSR model for large-scale manufacturing data by exploiting a large amount of small-scale manufacturing data for model building. The proposed method of T-PLSR+TL was applied to a practical case study focusing on scaling up the tableting process from an experienced compaction simulator to a less-experienced rotary tablet press. The T-PLSR+TL models achieved significantly better prediction performance for tablet quality attributes of new drug products than T-PLSR models without using large-scale manufacturing data with new drug products. The results demonstrated that T-PLSR+TL is more capable of addressing new drug products than T-PLSR by using small-scale manufacturing data to cover a scarcity of large-scale manufacturing data. Furthermore, T-PLSR+TL holds the potential to streamline formulation and manufacturing process development activities for new drug products using an extensive database.
T 型偏最小二乘回归(T-PLSR)是一种用于新药产品配方和制造工艺开发的有价值的机器学习技术。一个准确的 T-PLSR 模型需要具有多种配方和工艺条件的实验数据。然而,由于成本、时间和原材料的大量消耗,使用大型制造设备收集全面的实验数据通常具有挑战性。本研究提出了 T-PLSR 和迁移学习(TL)的混合建模,通过利用大量的小规模制造数据来构建模型,从而提高 T-PLSR 模型对大规模制造数据的预测性能。所提出的 T-PLSR+TL 方法应用于一个实际案例研究,重点是将压片工艺从经验丰富的压缩模拟器扩展到经验较少的旋转压片机。与不使用新药物产品的大规模制造数据的 T-PLSR 模型相比,T-PLSR+TL 模型对新药物产品的片剂质量属性具有显著更好的预测性能。结果表明,T-PLSR+TL 通过使用小规模制造数据来弥补大规模制造数据的稀缺性,比 T-PLSR 更能解决新药产品的问题。此外,T-PLSR+TL 有可能使用广泛的数据库来简化新药产品的配方和制造工艺开发活动。