一种通用近红外定量模型的训练集选择策略。

A training set selection strategy for a universal near-infrared quantitative model.

机构信息

Institute of Medicinal Biotechnology, Chinese Academy of Medical Science and Peking Union Medical College, Beijing, People's Republic of China.

出版信息

AAPS PharmSciTech. 2011 Jun;12(2):738-45. doi: 10.1208/s12249-011-9638-6. Epub 2011 Jun 4.

DOI:10.1208/s12249-011-9638-6

PMID:21643864

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3134668/

Abstract

The purpose of this article is to propose an empirical solution to the problem of how many clusters of complex samples should be selected to construct the training set for a universal near infrared quantitative model based on the Naes method. The sample spectra were hierarchically classified into clusters by Ward's algorithm and Euclidean distance. If the sample spectra were classified into two clusters, the 1/50 of the largest Heterogeneity value in the cluster with larger variation was set as the threshold to determine the total number of clusters. One sample was then randomly selected from each cluster to construct the training set, and the number of samples in training set equaled the number of clusters. In this study, 98 batches of rifampicin capsules with API contents ranging from 50.1% to 99.4% were studied with this strategy. The root mean square errors of cross validation and prediction were 2.54% and 2.31% for the model for rifampicin capsules, respectively. Then, we evaluated this model in terms of outlier diagnostics, accuracy, precision, and robustness. We also used the strategy of training set sample selection to revalidate the models for cefradine capsules, roxithromycin tablets, and erythromycin ethylsuccinate tablets, and the results were satisfactory. In conclusion, all results showed that this training set sample selection strategy assisted in the quick and accurate construction of quantitative models using near-infrared spectroscopy.

摘要

本文旨在提出一种经验解决方案，以解决基于 Naes 方法构建通用近红外定量模型的训练集应选择多少个复杂样本簇的问题。采用 Ward 算法和欧几里得距离对样品光谱进行层次聚类。如果样品光谱分为两类，则将变化较大的类中最大异质性值的 1/50 设定为阈值，以确定总簇数。然后从每个簇中随机选择一个样品来构建训练集，训练集的样品数等于簇数。本研究采用该策略对 98 批 API 含量为 50.1%至 99.4%的利福平胶囊进行了研究。利福平胶囊模型的交叉验证和预测均方根误差分别为 2.54%和 2.31%。然后，我们从异常值诊断、准确性、精密度和稳健性方面评估了该模型。我们还使用训练集样品选择策略重新验证了头孢拉定胶囊、罗红霉素片和琥乙红霉素片的模型，结果令人满意。总之，所有结果均表明，该训练集样品选择策略有助于快速准确地构建近红外光谱定量模型。

相似文献

A training set selection strategy for a universal near-infrared quantitative model.一种通用近红外定量模型的训练集选择策略。

AAPS PharmSciTech. 2011 Jun;12(2):738-45. doi: 10.1208/s12249-011-9638-6. Epub 2011 Jun 4.

Construction of universal quantitative models for determination of roxithromycin and erythromycin ethylsuccinate in tablets from different manufacturers using near infrared reflectance spectroscopy.使用近红外反射光谱法构建用于测定不同厂家片剂中罗红霉素和琥乙红霉素的通用定量模型。

J Pharm Biomed Anal. 2006 May 3;41(2):373-84. doi: 10.1016/j.jpba.2005.11.027. Epub 2006 Jan 6.

[Update of near-infrared models for testing ceftazidime, water and arginine in ceftazidime for injection].[注射用头孢他啶中头孢他啶、水分及精氨酸检测近红外模型的更新]

Guang Pu Xue Yu Guang Pu Fen Xi. 2014 Oct;34(10):2617-22.

[Application of near infrared spectroscopy in rapid and simultaneous determination of essential components in five varieties of anti-tuberculosis tablets].近红外光谱法在快速同时测定五种抗结核片剂中主要成分的应用

Guang Pu Xue Yu Guang Pu Fen Xi. 2008 Aug;28(8):1814-8.

[Fast measurement method based on near infrared spectroscopy in extraction process of Tianshu capsules].[基于近红外光谱法的天舒胶囊提取过程快速测定方法]

Zhongguo Zhong Yao Za Zhi. 2016 Feb;41(4):677-682. doi: 10.4268/cjcmm20160422.

Assessment of powder blend uniformity: Comparison of real-time NIR blend monitoring with stratified sampling in combination with HPLC and at-line NIR Chemical Imaging.粉末混合均匀性评估：实时近红外混合监测与分层抽样结合高效液相色谱法及在线近红外化学成像的比较

Eur J Pharm Biopharm. 2015 Nov;97(Pt A):78-89. doi: 10.1016/j.ejpb.2015.10.002. Epub 2015 Oct 9.

Dissolution testing of isoniazid, rifampicin, pyrazinamide and ethambutol tablets using near-infrared spectroscopy (NIRS) and multivariate calibration.使用近红外光谱（NIRS）和多元校准技术对异烟肼、利福平、吡嗪酰胺和乙胺丁醇片进行溶出度测试。

J Pharm Biomed Anal. 2012 Jan 5;57:115-9. doi: 10.1016/j.jpba.2011.08.029. Epub 2011 Aug 24.

[Construction of universal quantitative models for determination of cefoperazone sodium for injection from different manufacturers using near infrared reflectance spectroscopy].[利用近红外反射光谱法构建不同厂家注射用头孢哌酮钠含量测定通用定量模型]

Guang Pu Xue Yu Guang Pu Fen Xi. 2006 Dec;26(12):2214-8.

Nondestructive quantitative analysis of erythromycin ethylsuccinate powder drug via short-wave near-infrared spectroscopy combined with radial basis function neural networks.基于径向基函数神经网络的短波近红外光谱法对琥乙红霉素粉末药品进行无损定量分析

Eur J Pharm Sci. 2007 Jul;31(3-4):156-64. doi: 10.1016/j.ejps.2007.03.006. Epub 2007 Mar 19.

[Near infrared determination of sugar content in apples based on GA-iPLS].基于GA-iPLS的苹果糖含量近红外测定法

Guang Pu Xue Yu Guang Pu Fen Xi. 2007 Oct;27(10):2001-4.

引用本文的文献

Compilation of a Near-Infrared Library for Construction of Quantitative Models of Oral Dosage Forms for Amoxicillin and Potassium Clavulanate.用于构建阿莫西林和克拉维酸钾口服剂型定量模型的近红外光谱库的编制

Front Chem. 2018 May 24;6:184. doi: 10.3389/fchem.2018.00184. eCollection 2018.

本文引用的文献

Near-infrared spectroscopy applications in pharmaceutical analysis.近红外光谱在药物分析中的应用。

Talanta. 2007 May 15;72(3):865-83. doi: 10.1016/j.talanta.2006.12.023. Epub 2006 Dec 23.

A review of near infrared spectroscopy and chemometrics in pharmaceutical technologies.药物技术中近红外光谱法与化学计量学综述

J Pharm Biomed Anal. 2007 Jul 27;44(3):683-700. doi: 10.1016/j.jpba.2007.03.023. Epub 2007 Mar 30.

J Pharm Biomed Anal. 2006 May 3;41(2):373-84. doi: 10.1016/j.jpba.2005.11.027. Epub 2006 Jan 6.

Near-infrared spectroscopy and imaging: basic principles and pharmaceutical applications.近红外光谱与成像：基本原理及药物应用

Adv Drug Deliv Rev. 2005 Jun 15;57(8):1109-43. doi: 10.1016/j.addr.2005.01.020.

Meeting the International Conference on Harmonisation's Guidelines on Validation of Analytical Procedures: quantification as exemplified by a near-infrared reflectance assay of paracetamol in intact tablets.符合国际协调会议关于分析方法验证的指导原则：以对完整片剂中扑热息痛的近红外反射率测定法为例进行定量分析

Analyst. 2000 Jul;125(7):1341-51. doi: 10.1039/b002672g.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验