Oncogen Pharma (Malaysia), Sdn Bhd, 3, Jalan Jururancang U1/21, Hicom-glenmarie Industrial Park, 40150, Shah Alam, Selangor, Malaysia.
A2Z4.0 Research and Analytics Private Limited, Old No:810, New No:62, CTH Road, Behind Lenskart, Thirumullaivoil, Chennai, Tamilnadu, India.
AAPS PharmSciTech. 2022 Oct 13;23(7):277. doi: 10.1208/s12249-022-02403-9.
NIR spectroscopy is a non-destructive characterization tool for the blend uniformity (BU) assessment. However, NIR spectra of powder blends often contain overlapping physical and chemical information of the samples. Deconvoluting the information related to chemical properties from that associated with the physical effects is one of the major objectives of this work. We achieve this aim in two ways. Firstly, we identified various sources of variability that might affect the BU results. Secondly, we leverage the machine learning-based sophisticated data analytics processes. To accomplish the aforementioned objectives, calibration samples of amlodipine as an active pharmaceutical ingredient (API) with the concentrations ranging between 67 and 133% w/w (dose ~ 3.6% w/w), in powder blends containing excipients, were prepared using a gravimetric approach and assessed using NIR spectroscopic analysis, followed by HPLC measurements. The bias in NIR results was investigated by employing data quality metrics (DQM) and bias-variance decomposition (BVD). To overcome the bias, the clustered regression (non-parametric and linear) was applied. We assessed the model's performance by employing the hold-out and k-fold internal cross-validation (CV). NIR-based blend homogeneity with low mean absolute error and an interval estimates of 0.674 (mean) ± 0.218 (standard deviation) w/w was established. Additionally, bootstrapping-based CV was leveraged as part of the NIR method lifecycle management that demonstrated the mean absolute error (MAE) of BU ± 3.5% w/w and BU ± 1.5% w/w for model generalizability and model transferability, respectively. A workflow integrating machine learning to NIR spectral analysis was established and implemented. Impact of various data learning approaches on NIR spectral data.
近红外光谱(NIR)是一种用于评估混合物均匀性(BU)的非破坏性特征工具。然而,粉末混合物的 NIR 光谱通常包含样品的物理和化学信息的重叠。从与物理效应相关的信息中解卷积与化学性质相关的信息是这项工作的主要目标之一。我们通过两种方式实现这一目标。首先,我们确定了可能影响 BU 结果的各种变化来源。其次,我们利用基于机器学习的复杂数据分析过程。为了实现上述目标,使用重量法制备了含有赋形剂的氨氯地平(作为活性药物成分(API))浓度在 67%至 133%w/w(剂量~3.6%w/w)之间的氨氯地平校准样品,并使用 NIR 光谱分析和 HPLC 测量进行评估。通过使用数据质量指标(DQM)和偏差方差分解(BVD)来研究 NIR 结果的偏差。为了克服偏差,应用了聚类回归(非参数和线性)。我们通过使用保留和 k 折内部交叉验证(CV)来评估模型的性能。建立了基于 NIR 的低平均绝对误差(0.674(均值)±0.218(标准差)w/w)和低混合均匀性的混合物均匀性。此外,还利用基于引导的 CV 作为 NIR 方法生命周期管理的一部分,该方法分别显示了 BU 的平均绝对误差(MAE)为±3.5%w/w 和 BU 的平均绝对误差(MAE)±1.5%w/w,用于模型通用性和模型可转移性。建立并实施了集成机器学习到 NIR 光谱分析的工作流程。研究了各种数据学习方法对 NIR 光谱数据的影响。