Pharmaceutical Development Biologicals, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss 88397, Federal Republic of Germany.
Analytical Development Biologicals, Boehringer Ingelheim Pharma GmbH & Co. KG, Biberach an der Riss 88397, Federal Republic of Germany.
J Pharm Sci. 2024 Apr;113(4):880-890. doi: 10.1016/j.xphs.2023.10.041. Epub 2023 Nov 3.
Sub-visible particles can be a quality concern in pharmaceutical products, especially parenteral preparations. To quantify and characterize these particles, liquid samples may be passed through a flow-imaging microscopy instrument that also generates images of each detected particle. Machine learning techniques have increasingly been applied to this kind of data to detect changes in experimental conditions or classify specific types of particles, primarily focusing on silicone oil. That technique generally requires manual labeling of particle images by subject matter experts, a time-consuming and complex task. In this study, we created artificial datasets of silicone oil, protein particles, and glass particles that mimicked complex datasets of particles found in biopharmaceutical products. We used unsupervised learning techniques to effectively describe particle composition by sample. We then trained independent one-class classifiers to detect specific particle populations: silicone oil and glass particles. We also studied the consistency of the particle labels used to evaluate these models. Our results show that one-class classifiers are a reasonable choice for handling heterogeneous flow-imaging microscopy data and that unsupervised learning can aid in the labeling process. However, we found agreement among experts to be rather low, especially for smaller particles (< 8 µm for our Micro-Flow Imaging data). Given the fact that particle label confidence is not usually reported in the literature, we recommend more careful assessment of this topic in the future.
亚可见颗粒可能是药物产品(特别是注射制剂)的质量关注点。为了定量和表征这些颗粒,可以将液体样品通过流成像显微镜仪器进行检测,该仪器还会生成每个检测到的颗粒的图像。机器学习技术已越来越多地应用于此类数据,以检测实验条件的变化或对特定类型的颗粒进行分类,主要集中在硅油上。该技术通常需要由主题专家手动标记颗粒图像,这是一项耗时且复杂的任务。在这项研究中,我们创建了模拟生物制药产品中发现的复杂颗粒数据集的硅油、蛋白质颗粒和玻璃颗粒的人工数据集。我们使用无监督学习技术根据样品有效地描述颗粒组成。然后,我们训练独立的一类分类器来检测特定的颗粒群体:硅油和玻璃颗粒。我们还研究了用于评估这些模型的颗粒标签的一致性。我们的结果表明,一类分类器是处理异质流成像显微镜数据的合理选择,并且无监督学习可以辅助标签过程。然而,我们发现专家之间的一致性相当低,尤其是对于较小的颗粒(对于我们的微流成像数据,<8 µm)。鉴于文献中通常不报告颗粒标签置信度的事实,我们建议在未来更仔细地评估这个问题。