Chen Yu-Cheng, Fioretti Ismaele, Lin Dong-Qiang, Sponchioni Mattia
Key Laboratory of Biomass Chemical Engineering of Ministry of Education, Zhejiang Key Laboratory of Smart Biomaterials, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, China.
Department of Chemistry, Materials and Chemical Engineering "Giulio Natta", Politecnico di Milano, Milano, Italy.
Biotechnol Bioeng. 2025 Aug;122(8):2179-2192. doi: 10.1002/bit.29018. Epub 2025 May 9.
Hybrid models integrate mechanistic and data-driven components, effectively addressing the challenges of limited process understanding and data availability typical of biopharmaceutical processes. In this study, we applied a hybrid modeling framework named differentiable physics solver-in-the-loop (DP-SOL) to describe the reversed-phase chromatographic purification of an oligonucleotide, overcoming the mentioned limitations of purely mechanistic and data-driven models. The framework establishes a connection between neural networks (NNs) and mechanistic models through differentiable physical operators and their gradients. We first collected a data set comprising six linear gradient elution experiments at different resin loadings and gradient slopes, split in three experiments each for training and testing, for few-shot learning. The hyperparameters were determined through a grid search, resulting in a NN with two hidden layers and 14 nodes. Compared to a calibrated mechanistic model used for initialization of NN, the DP-SOL hybrid model showed significant performance improvement on both training and testing sets, with 0.97 for the former. The good predictivity of DP-SOL is attributed to the combination of mechanistic models and NNs at the solver level. As a novel and versatile hybrid modeling paradigm, DP-SOL has the potential to significantly impact modeling approaches in the downstream processing field and the broader biopharmaceutical sector.
混合模型整合了机理模型和数据驱动模型的组件,有效应对了生物制药过程中典型的过程理解有限和数据可用性不足的挑战。在本研究中,我们应用了一种名为可微物理求解器在环(DP-SOL)的混合建模框架来描述寡核苷酸的反相色谱纯化过程,克服了纯机理模型和数据驱动模型的上述局限性。该框架通过可微物理算子及其梯度在神经网络(NN)和机理模型之间建立了联系。我们首先收集了一个数据集,该数据集包含在不同树脂负载量和梯度斜率下的六个线性梯度洗脱实验,每个实验分为三个用于训练和测试的实验,用于少样本学习。通过网格搜索确定超参数,得到一个具有两个隐藏层和14个节点的神经网络。与用于初始化神经网络的校准机理模型相比,DP-SOL混合模型在训练集和测试集上均表现出显著的性能提升,训练集的性能提升为0.97。DP-SOL的良好预测性归因于在求解器层面机理模型和神经网络的结合。作为一种新颖且通用的混合建模范式,DP-SOL有可能对下游加工领域及更广泛的生物制药行业的建模方法产生重大影响。