Delmar Jared A, Buehler Eugen, Chetty Ashwin K, Das Agastya, Quesada Guillermo Miro, Wang Jihong, Chen Xiaoyu
Biopharmaceuticals Development, R&D, AstraZeneca, Gaithersburg, MD 20878, USA.
Data Sciences and AI, R&D, AstraZeneca, Gaithersburg, MD 20878, USA.
Mol Ther Methods Clin Dev. 2021 Apr 1;21:466-477. doi: 10.1016/j.omtm.2021.03.023. eCollection 2021 Jun 11.
Photooxidation of methionine (Met) and tryptophan (Trp) residues is common and includes major degradation pathways that often pose a serious threat to the success of therapeutic proteins. Oxidation impacts all steps of protein production, manufacturing, and shelf life. Prediction of oxidation liability as early as possible in development is important because many more candidate drugs are discovered than can be tested experimentally. Undetected oxidation liabilities necessitate expensive and time-consuming remediation strategies in development and may lead to good drugs reaching patients slowly. Conversely, sites mischaracterized as oxidation liabilities could result in overengineering and lead to good drugs never reaching patients. To our knowledge, no predictive model for photooxidation of Met or Trp is currently available. We applied the random forest machine learning algorithm to in-house liquid chromatography-tandem mass spectrometry (LC-MS/MS) datasets (Met, n = 421; Trp, n = 342) of tryptic therapeutic protein peptides to create computational models for Met and Trp photooxidation. We show that our machine learning models predict Met and Trp photooxidation likelihood with 0.926 and 0.860 area under the curve (AUC), respectively, and Met photooxidation rate with a correlation coefficient (Q) of 0.511 and root-mean-square error (RMSE) of 10.9%. We further identify important physical, chemical, and formulation parameters that influence photooxidation. Improvement of biopharmaceutical liability predictions will result in better, more stable drugs, increasing development throughput, product quality, and likelihood of clinical success.
甲硫氨酸(Met)和色氨酸(Trp)残基的光氧化很常见,且包括一些主要的降解途径,这些途径常常对治疗性蛋白质的成功应用构成严重威胁。氧化会影响蛋白质生产、制造和保质期的各个环节。在研发过程中尽早预测氧化倾向很重要,因为发现的候选药物数量远远超过能够进行实验测试的数量。未检测到的氧化倾向会在研发过程中需要昂贵且耗时的补救策略,并且可能导致优质药物缓慢地推向患者。相反,被错误认定为氧化倾向的位点可能会导致过度设计,从而使优质药物无法推向患者。据我们所知,目前尚无用于预测Met或Trp光氧化的模型。我们将随机森林机器学习算法应用于胰蛋白酶治疗性蛋白质肽的内部液相色谱 - 串联质谱(LC-MS/MS)数据集(Met,n = 421;Trp,n = 342),以创建Met和Trp光氧化的计算模型。我们表明,我们的机器学习模型预测Met和Trp光氧化可能性的曲线下面积(AUC)分别为0.926和0.860,预测Met光氧化速率的相关系数(Q)为0.511,均方根误差(RMSE)为10.9%。我们进一步确定了影响光氧化的重要物理、化学和制剂参数。改善生物制药倾向预测将产生更好、更稳定的药物,提高研发通量、产品质量和临床成功的可能性。